giraph | Mirror of Apache Giraph

 by   apache Java Version: rel/1.2.0-RC1 License: Apache-2.0

kandi X-RAY | giraph Summary

kandi X-RAY | giraph Summary

giraph is a Java library typically used in Big Data, Maven, Hadoop applications. giraph has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has high support. You can download it from GitHub.

giraph : large-scale graph processing on hadoop. web and online social graphs have been rapidly growing in size and scale during the past decade. in 2008, google estimated that the number of web pages reached over a trillion. online social networking and email sites, including yahoo!, google, microsoft, facebook, linkedin, and twitter, have hundreds of millions of users and are expected to grow much more in the future. processing these graphs plays a big role in relevant and personalized information for users, such as results from a search engine or news in an online social networking site. graph processing platforms to run large-scale algorithms (such as page rank, shared connections, personalization-based popularity, etc.) have become quite popular. some recent examples include pregel and haloop. for general-purpose big data computation, the map-reduce computing
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              giraph has a highly active ecosystem.
              It has 610 star(s) with 302 fork(s). There are 68 watchers for this library.
              OutlinedDot
              It had no major release in the last 6 months.
              giraph has no issues reported. There are 32 open pull requests and 0 closed requests.
              OutlinedDot
              It has a negative sentiment in the developer community.
              The latest version of giraph is rel/1.2.0-RC1

            kandi-Quality Quality

              giraph has no bugs reported.

            kandi-Security Security

              giraph has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

            kandi-License License

              giraph is licensed under the Apache-2.0 License. This license is Permissive.
              Permissive licenses have the least restrictions, and you can use them in most projects.

            kandi-Reuse Reuse

              giraph releases are not available. You will need to build from source code and install.
              Build file is available. You can build the component from source.

            Top functions reviewed by kandi - BETA

            kandi has reviewed giraph and discovered the below as its top functions. This is intended to give you an instant insight into giraph implemented functionality, and help decide if they suit your requirements.
            • Populates the giraph configuration .
            • Start the server .
            • Runs a given test graph with vertex output .
            • coordinate a superstep
            • Instruments the sandbox .
            • Performs a single input split .
            • Starts the ZooKeeper server .
            • Gets the next IOCommand .
            • Saves the vertices .
            • Connects all the tasks to their addresses .
            Get all kandi verified functions for this library.

            giraph Key Features

            No Key Features are available at this moment for giraph.

            giraph Examples and Code Snippets

            No Code Snippets are available at this moment for giraph.

            Community Discussions

            QUESTION

            Failure to start Hadoop after having stopped a running (and working) instance before, because Datanode says that the directory is locked
            Asked 2020-Jun-29 at 08:23

            I have a cluster running Hadoop 1.2.1 with Giraph on top. The server runs ok, but when I stop it, I am unable to make it run again. In the datanode log I get the following error: ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Cannot lock storage /pathToFolder/data/datanode. The directory is already locked.

            I have tried many solutions that I found online:

            1. Checking permissions of folders.
            2. Checking equal versions of VERSION file for namenode and datanode.
            3. Checking configuration files (core-site, hdfs-site, mapred-site, master, slaves, ...)
            4. Deleting / Changing the namenode and datanode data folders
            5. Removing hadoop temporary files

            Bottomline is, everything seems fine, but it is still failing to start the datanode. A complete log file for the datanode is the following:

            ...

            ANSWER

            Answered 2020-Jun-29 at 08:23

            Still haven't managed to get rid of the problem (Datanode not shutting down correctly), but I found a workaround to the situation. I used lsof +D /pathto detect active processes and killed them. The weird part is that this process was invisible to top and jps commands.

            Source https://stackoverflow.com/questions/62554490

            QUESTION

            Java SparkContext error: java.lang.NoSuchMethodError: io.netty.buffer.PooledByteBufAllocator
            Asked 2018-Apr-27 at 06:31

            This is my first foray into java on Spark. The following error is happening when using either Spark 1.X (tried 1.5.0) or 2.X (tried 2.2.0), java 1.8 and with scala 2.10:

            ...

            ANSWER

            Answered 2018-Apr-27 at 06:31

            Both spark-core and giraph-core have dependency for netty-all. You need to exclude it from giraph-core.

            Source https://stackoverflow.com/questions/50055656

            QUESTION

            Running my own job on Giraph
            Asked 2018-Mar-14 at 03:32

            So, I've successfully executed the SimpleShortestPathComputation on my computer via the script shown here:

            ...

            ANSWER

            Answered 2018-Mar-14 at 03:32

            You need to have the jar which contains the class GiraphAlgs.GiraphPBFS in the hadoop classpath.

            Also, verify that your classpath is correct set by running $bin/hadoop classpath.

            Once in hadoop 2.7 setting HADOOP_CLASSPATH variable didn't work, I had to copy the jar in the hadoop share lib directory: HADOOP_HOME/share/hadoop/mapreduce/lib.

            Source https://stackoverflow.com/questions/48845639

            QUESTION

            Apache Giraph on Cloudera VM - POM for org.apache.hadoop:hadoop-core:jar:2.6.0 missing, no dependency info
            Asked 2017-Dec-09 at 00:56

            I am new to Hadoop/Giraph and Java. As part of a task, I downloaded Cloudera Quickstart VM and Giraph on top of it. I am using this book named "Practical Graph Analytics with Apache Giraph; Authors: Shaposhnik, Roman, Martella, Claudio, Logothetis, Dionysios" from which I tried to run the first example on Page 111 (Twitter Followership Graph).

            Please find the below error while trying to run the changed pom.xml file with the hadoop version on the cluster 2.6.0-mr1-cdh5.12.0

            ...

            ANSWER

            Answered 2017-Dec-08 at 23:38

            The pom.xml in your book's copy is outdated. Use this one instead. Source: book examples repository on Github.

            Edit:

            You want to use a recent version of hadoop-core, but the most recent one Maven Central Repository (the default respository) offers is the 1.2.1. You will need to use the Cloudera Repository to get the most recent version of the library. To do that, simply add the repository to your pom.xml:

            Source https://stackoverflow.com/questions/47721179

            QUESTION

            Hadoop 1.2.1 is running in local mode despite set mapred.job.tracker value
            Asked 2017-Apr-20 at 17:21

            I am trying to submit a giraph job to a hadoop 1.2.1 cluster. The cluster has a name node master, a map reduce master, and four slaves. The job is failing with the following exception:

            java.util.concurrent.ExecutionException: java.lang.IllegalStateException: checkLocalJobRunnerConfiguration: When using LocalJobRunner, must have only one worker since only 1 task at a time!

            However, here is my mapred-site.xml file:

            ...

            ANSWER

            Answered 2017-Apr-20 at 17:21

            The problem wasn't that hadoop was running in local job mode, the problem is that giraph, configured on another machine, assumed that hadoop was running in local job mode.

            I was submitting the job via gremlin, I needed to add the following line to the its configuration file:

            Source https://stackoverflow.com/questions/43403685

            QUESTION

            Giraph, Hadoop, Spark and Cassandra
            Asked 2017-Apr-02 at 21:44

            Is it possible for me to use Giraph if I have Spark clusters and Cassandra but no Hadoop clusters?

            Currently, I am using GraphX and would like to use Giraph instead. Is this possible considering that I have Spark clusters and am using Cassandra?

            ...

            ANSWER

            Answered 2017-Apr-02 at 04:23

            I have only limited experience with Giraph from years ago, and I never tried using it outside of a Hadoop cluster. But it looks like what you want is at least technically possible if not necessarily easy.

            This code is the companion to Practical Graph Analytics with Apache Giraph. As you can see, it requires Hadoop in the classpath for DoubleWritable and Text, for example, but it does nothing with a Hadoop cluster. Instead, it works with in-memory arrays. It looks like all you need to do is implement compute in the BasicComputation class to do whatever you need with Cassandra as long as you keep Hadoop around as a dependency to help satisfy the type boundaries for BasicComputation.

            I never found Giraph terribly intuitive, but hopefully you can make this unconventional setup work.

            Source https://stackoverflow.com/questions/43164082

            QUESTION

            How to set classpath for a maven project
            Asked 2017-Feb-13 at 02:16

            I created a maven project with eclipse and made jar file from it with this below comand

            mvn package

            when i try to know my mvn project config is true or not with this command

            mvn exec:java -D exec.mainClass="giraph.helloworld.App"

            i get this error :

            failed to execute goal
            org.codehaus.mojo:exec-maven-plugin:1.2.1:java(default-cli) on project helloworld: An exception occured while executing the java class. null: InvocationTargetException: No arguments were provided

            POM.xml setting of project is as follows. I will be so grateful if anyone can help me and specifies the reasons of this error?

            ...

            ANSWER

            Answered 2017-Feb-13 at 02:16

            I think the problem is that your manifest file does not contain information about the entry point (the main class) of your jar. See Setting an Application entry point.

            There are many ways to rectify this problem. You can use maven assembly plugin. For more details, check here

            Source https://stackoverflow.com/questions/42195116

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            No vulnerabilities reported

            Install giraph

            You can download it from GitHub.
            You can use giraph like any standard Java library. Please include the the jar files in your classpath. You can also use any IDE and you can run and debug the giraph component as you would do with any other Java program. Best practice is to use a build tool that supports dependency management such as Maven or Gradle. For Maven installation, please refer maven.apache.org. For Gradle installation, please refer gradle.org .

            Support

            For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            CLONE
          • HTTPS

            https://github.com/apache/giraph.git

          • CLI

            gh repo clone apache/giraph

          • sshUrl

            git@github.com:apache/giraph.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link