• Big data books to start a career

    I apologize if this isn't the right area to ask but I'm looking to get into the big data field (I would like to work in the industry) so would anyone happen to know of some great books on big data? I'm looking for anything on Hadoop or HBase. Thanks so much!

    ITKE1,119,375 pointsBadges:
  • How to start a big data project

    I want to do a project on Big data, but I don't know how to start? Is anyone can guide me?

    Jamvant5 pointsBadges:
  • How to know which region server to write to in HBase

    Would there be a way in HBase to get operations to know which region server the row should be written to? Just in case that several rows need to be read, how multiple region servers are contacted and results are retrieved. Thanks so much

    ITKE1,119,375 pointsBadges:
  • Is HBase a better choice than Cassandra for big data?

    We're trying to decide which software would be best for us when it comes to our big data. We're currently between HBase and Cassasndra (with Hadoop) and we're learning more towards HBase. Do you guys think HBase is a better choice for us? Is there really any difference between the two?

    ITKE1,119,375 pointsBadges:
  • Fetch data from HBase table in Spark

    We have this huge table in HBase that's named UserAction. It has three different column families. We're trying to fetch all of the data from one column family as a JavaRDD object. We've tried using the code below but it's not working. What else can we do? static SparkConf sparkConf = new...

    ITKE1,119,375 pointsBadges:
  • Document database for big data

    My department has around 100 million of records in a database. But roughly 65% of the records will be deleted on a daily basis and roughly the same amount of records will be added in. We feel like a big data document database like HBase, Cassandra or Hadoop could do this for us but we're not sure...

    ITKE1,119,375 pointsBadges:
  • Process range of Hbase rows using Spark

    We've been using HBase as a data source for Spark. We've already created a RDD from a HBase table but we can't figure out a way to create a RDD for a range scan. Does anyone know how to do it?

    ITKE1,119,375 pointsBadges:
  • Apache HBase: How to count number of rows quickly

    Currently in Apache HBase, I've been implementing row count over ResultScanner, like this: for (Result rs = scanner.next(); rs != null; rs = scanner.next()) { number++; } But my data is starting to reach the millions so the computing is big. I'm trying to compute it real-time but I would like to...

    ITKE1,119,375 pointsBadges:

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

Thanks! We'll email you when relevant content is added and updated.

Following