SOCC | SFU Opinion and Comments Corpus | Natural Language Processing library

 by   sfu-discourse-lab Python Version: Current License: Non-SPDX

kandi X-RAY | SOCC Summary

kandi X-RAY | SOCC Summary

SOCC is a Python library typically used in Artificial Intelligence, Natural Language Processing applications. SOCC has no bugs, it has no vulnerabilities and it has low support. However SOCC build file is not available and it has a Non-SPDX License. You can download it from GitHub.

SFU Opinion and Comments Corpus. The SFU Opinion and Comments Corpus (SOCC) is a corpus for the analysis of online news comments. Our corpus contains comments and the articles from which the comments originated. The articles are all opinion articles, not hard news articles. The corpus is larger than any other currently available comments corpora, and has been collected with attention to preserving reply structures and other metadata. In addition to the raw corpus, we also present annotations for four different phenomena: constructiveness, toxicity, negation and its scope, and appraisal. For more information about this work, please see our papers.
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              SOCC has a low active ecosystem.
              It has 79 star(s) with 21 fork(s). There are 11 watchers for this library.
              OutlinedDot
              It had no major release in the last 6 months.
              SOCC has no issues reported. There are no pull requests.
              It has a neutral sentiment in the developer community.
              The latest version of SOCC is current.

            kandi-Quality Quality

              SOCC has 0 bugs and 0 code smells.

            kandi-Security Security

              SOCC has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
              SOCC code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              SOCC has a Non-SPDX License.
              Non-SPDX licenses can be open source with a non SPDX compliant license, or non open source licenses, and you need to review them closely before use.

            kandi-Reuse Reuse

              SOCC releases are not available. You will need to build from source code and install.
              SOCC has no build file. You will be need to create the build yourself to build the component from source.
              Installation instructions are available. Examples and code snippets are not available.
              SOCC saves you 942 person hours of effort in developing the same functionality from scratch.
              It has 2148 lines of code, 35 functions and 8 files.
              It has low code complexity. Code complexity directly impacts maintainability of the code.

            Top functions reviewed by kandi - BETA

            kandi has reviewed SOCC and discovered the below as its top functions. This is intended to give you an instant insight into SOCC implemented functionality, and help decide if they suit your requirements.
            • Combine annotation files
            • Lookup the label for a given column
            • Simplify dataframe
            • Read data from a project file
            • Get all the labels from a dataframe
            • Get the labels from a dataframe
            • Count the number of reactions in the dataframe
            • Find all reactions in the DataFrame
            • Find all reactions in reaction_list
            • Clean filenames
            • Combine two lists
            • Remove comments from files and write comments
            • Removes comment lines from a file
            • Count the number of neg votes in the given DataFrame
            • Return all comments in a DataFrame
            • Divide two lists
            • Count the number of votes in the post author
            • Return the contents of a directory
            • Lookup the label for the given column
            • Get the labelinds from a dataframe
            • Takes a DataFrame and returns the number of posts that belong to the thread
            • Create a pandas DataFrame containing the yearly count of all comments
            • Read a given project file
            • Read in a project file with blank entries
            • Massage files
            Get all kandi verified functions for this library.

            SOCC Key Features

            No Key Features are available at this moment for SOCC.

            SOCC Examples and Code Snippets

            No Code Snippets are available at this moment for SOCC.

            Community Discussions

            QUESTION

            IEnumerable doesn't contain definition for ExpMonthYr
            Asked 2019-Aug-27 at 19:47

            I added @model IEnumerable instead of @model TravelSOCC.Models.LansingMileage which will no longer allow me to access my ExpMonthYr

            I know according to the documentation these two functions should be seperated as they are performing different task just was trying to make the end user have fewer pages to navigate.

            What this is supposed to do is allow a user to select a date from the datepicker and then update the database and have it be re-displayed in a table using the @foreach this means that I need to be able to use both the @Html.EditorFor and the @Foreach within the same view but not within the same table.

            ...

            ANSWER

            Answered 2019-Aug-27 at 18:36

            What I understand is that you are trying to add header inputs in front of the table, the datepicker might be used for filtering data.

            In this case, you cannot use any properties from Model, because it's a list and the list is not your search options but your search result. so you should have a SearchViewModel to pass search options to server, you can directly use LansingMileage as SearchViewModel. So if client input something, which data will be saved in this SearchViewModel.

            Here is example, mostly ASP.NET Core code, but it's very similar pattern with MVC5.

            At the beginning of cshtml:

            Source https://stackoverflow.com/questions/57679597

            QUESTION

            csvread function loads 0 values
            Asked 2019-Aug-18 at 20:23

            I am trying to load data from a csv file using the function csvread in Matlab as follows:

            ...

            ANSWER

            Answered 2019-Jul-29 at 19:24

            An actual excerpt of the file (rather than a screenshot) would help as this is likely some sort of encoding issue (CSVs have a number of ways they can be encoded, depending on localization for example).

            However, for all data creation purposes, I always find that the Matlab GUI does a great job, and if you want the code corresponding to the import it can be generated as well!

            Go to Home / Import data, then just follow the steps until the preview enables you to have the data the way you want it (NB: for each column, you can choose a preferred type). Then, you can either click on "import selection", which puts the variable in the workspace, or choose "generate script" or "generate function" to get the actual code.

            == edit == I hadn't seen the output you were getting, and the comment above (which was posted while I was answering) regarding the long format is likely the correct solution in this case, if the actual values aren't zeros (which is what I had assumed from your question).

            Source https://stackoverflow.com/questions/57260126

            QUESTION

            Stuck in write operation when reading from Socket
            Asked 2018-Dec-18 at 20:33

            I'm sending a file and its name through Socket to a ServerSocket. It works "partially" -- the server gets the file and saves it to disk but it does not exit the loop in the copy() method in the ClientSession class.

            ...

            ANSWER

            Answered 2018-Dec-18 at 20:24

            Think about your procotol:

            • The Client sends the file name, then sends the binary file, then waits for the server response.
            • The Server reads the file name, then the binary file until the stream is closed, then sends the success message.

            But the stream is never closed since the client is waiting for the response, hence you have a deadlock in your protocol.

            This is usually solved by sending the file size first and having the server read exactly that many bytes.

            Alternatively you can use the TCP's one-way shutdown feature to send a signal to the server that the output stream of the socket is closed. That can be done with socc.shutdownOutput();

            And please use try-with-resources to avoid resource leaks (you must close the Socket, too).

            Fixed Client:

            Source https://stackoverflow.com/questions/53836574

            QUESTION

            DUMP command in PIG not working
            Asked 2018-Apr-04 at 08:36

            I wrote a simple PIG program as follows to analyze a small and a modified version of the google n-grams dataset on AWS. The data looks something like this:

            ...

            ANSWER

            Answered 2018-Apr-02 at 16:57

            Not having enough reputations to comment so posting here, are writing the above pig statements in a script or running individually from grunt shell. Also can you give a brief about the logic behind sum_occ relation.

            Source https://stackoverflow.com/questions/49529308

            QUESTION

            QSerialPort cannot find the path specified
            Asked 2018-Feb-16 at 15:52

            I've downloaded the program com0com and created 3 pairs of virtual coms. To test such coms, I downloaded Termite, which open, receive, transmit and close every pair successfully. So far so good.

            However, in my QT application, I can't open any of the virtual coms (I don't have a actual port so I haven't tested).

            My code:

            ...

            ANSWER

            Answered 2018-Feb-16 at 15:52

            For anyone having troubles with this, open the com0com setup, now, select the option Use Port Class on your virtual port pair. Use it for both ports in the pair.

            I still do not know why that's the problem, specially since other terminals opened the port with no trouble. But that solved for me. If anyone get more information, feel free to post it.

            Source https://stackoverflow.com/questions/48829141

            QUESTION

            QFile path changing from build version to deploy version
            Asked 2017-Oct-12 at 10:19

            I have a program that saves data inside it's own folder. This is how I save the data:

            ...

            ANSWER

            Answered 2017-Oct-12 at 10:19

            Due to security features introduced with Windows Vista (UAC) any non-Administrator program that tries to write to protected locations such as "Program Files" will get their writes caught and redirected to an alternative "user friendly" location.

            In you case C:\Users\SOCC\AppData\Local\VirtualStore\Program Files (x86)\SOCC_Pesquisa\config\Values

            You can find out more about UAC here on Wikipedia

            So you can move in these directions:

            1. Run program as administrator each time.
            2. Change directory's security settings: going properties, select the Security tab and then advanced.
            3. Request elevation for your app as you can find on Wikipedia link.
            4. Change the savings location to a more secure and usual, like documents or whatever you want not conflicting with UAC

            Source https://stackoverflow.com/questions/41554807

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            No vulnerabilities reported

            Install SOCC

            The data is divided into two main parts, with each part being, in turn, divided into three portions:. The corpus contains 10,339 opinion articles (editorials, columns, and op-eds) together with their 663,173 comments from 303,665 comment threads, from the main Canadian daily in English, The Globe and Mail, for a five-year period (from January 2012 to December 2016). We organize our corpus into three sub-corpora: the articles corpus, the comments corpus, and the comment-threads corpus, organized into three CSV files: gnm_articles.csv, gnm_comments.csv, and gnm_comment_threads.csv. article_id A unique identifier for the article. We use this identifier in the comments CSV. You'll also see this identifier in the article url. (E.g., 26691065). title The title or the headline of The Globe and Mail opinion article. (E.g., Fifty years in Canada, and now I feel like a second-class citizen). article_url The Globe and Mail url for the article. (E.g., http://www.theglobeandmail.com/opinion/fifty-years-in-canada-and-now-i-feel-like-a-second-class-citizen/article26691065/). author The author of the opinion article. published_date The date when the article was published. (E.g., 2015-10-16 EDT). ncomments The number of comments in the comments corpus for this article. ntop_level_comments The number of top-level comments in the comments corpus for this article. article_text The article text. We have preserved the paragraph structure in the text with paragraph tags. The CSV contains all unique comments (663,173 comments) in response to the articles in articles.csv after removing duplicates and comments with large overlap. The corpus is useful to study individual comments, i.e., without considering their location in the comment thread structure. Below we describe fields in this CSV. article_id A unique identifier for the article. We use this identifier in the comments CSV. You'll also see this identifier in the article url. (E.g., 26691065). comment_counter The comment counter which encodes the position and depth of a comment in a comment thread. Below are some examples. comment_author The username of the author of the comment. timestamp The timestamp indicating the posting time of the comment. The comments from source1 have timestamp. post_time The posting time of the comment. The comments from source2 have post_time. comment_text The comment text. The text is minimally preproessed. We have cleaned the HTML tags and have done preliminary word segmentation to fix missing spaces after punctuation. TotalVotes The total votes (positive votes + negative votes). posVotes The positive votes received by the comment. negVotes The negative votes received by the comment. vote Not sure. A Field from the scraped comments JSON.
            Raw data The articles corpus The comments corpus The comment-threads corpus
            Annotated data SFU constructiveness and toxicity corpus SFU negation corpus SFU Appraisal corpus
            First top-level comment: source1_article-id_0
            First child of the top-level comment: source1_article-id_0_0
            Second child of the top-level comment: source1_article-id_0_1
            Grandchildren. source1_article-id_0_0_0, source1_article-id_0_0_1
            Very toxic (4)
            Toxic (3)
            Mildly toxic (2)
            Not toxic (1)

            Support

            Varada Kolhatkar (vkolhatk@sfu.ca) Maite Taboada (mtaboada@sfu.ca).
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            CLONE
          • HTTPS

            https://github.com/sfu-discourse-lab/SOCC.git

          • CLI

            gh repo clone sfu-discourse-lab/SOCC

          • sshUrl

            git@github.com:sfu-discourse-lab/SOCC.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link

            Consider Popular Natural Language Processing Libraries

            transformers

            by huggingface

            funNLP

            by fighting41love

            bert

            by google-research

            jieba

            by fxsjy

            Python

            by geekcomputers

            Try Top Libraries by sfu-discourse-lab

            SO-CAL

            by sfu-discourse-labPython

            GenderGapTracker

            by sfu-discourse-labPython

            Misinformation_detection

            by sfu-discourse-labPython

            SFU_Comment_Extractor

            by sfu-discourse-labHTML

            adverbly_adjectives

            by sfu-discourse-labJupyter Notebook