• Writing map only Hadoop jobs

    I'm pretty new to the Hadoop scene and I've run into a problem. Every once in a while, I only map for a job so I actually only need the map result directly as output (AKA reduce phase isn't needed). Is there a way to do that?

    ITKE376,290 pointsBadges:
  • Should I move my SQL database to NoSQL?

    I have a very large table (roughly 100 million rows and 35 columns) and it's currently stored in a SQL database. However, my queries are running very slow at the moment so I'm wondering if I should move to NoSQL. But I have a few questions: Which NoSQL database should I use? Is there a way to move...

    ITKE376,290 pointsBadges:
  • How to use Python to parse a 12 GB CSV file

    We currently have a 12 GB CSV file. We're now trying to extract some columns from this data and then write a new CSV file that would load into R for data analysis. But we keep getting this error when we're loading the list before writing the new file. Is there a way we can parse the data row by row...

    ITKE376,290 pointsBadges:
  • What’s the difference between CLI and CQL in Cassandra?

    I'm pretty new to Cassandra and I'm looking to find out the difference between CLI and CQL. Which one is better to use? Also, are there any APIs I can use to query Cassandra using .NET? Thanks so much.

    ITKE376,290 pointsBadges:
  • Putting username and password in MongoDB

    I've been trying to set up the user name and password for my MongoDB so any remote access will ask for it. Does anyone know if there's a way to do it? I've tried this but it still doesn't ask me for a password or user name. use admin db.addUser('theadmin', '12345'); db.auth('theadmin','12345');...

    ITKE376,290 pointsBadges:
  • Where do I download large data for Hadoop?

    I apologize if this is a 'newbie' question but I'm looking for large data (more than 10 GB) to run a Hadoop demo. Does anyone know if/where I can find it?

    ITKE376,290 pointsBadges:
  • Available Scala projects to use Hadoop and MapReduce

    My team has recently begun a big data, analytics project and we're considering using Scala. Are there any Scala projects that are available to do Hadoop and MapReduce programs?

    ITKE376,290 pointsBadges:
  • How to reduce the file size of a MongoDB database

    My friend and I have a MongoDB database which was pretty large (over 3 GB). But we recently deleted a bunch of documents, files, etc., and we thought that would lead to the size of the database files to go down. But since MongoDB keeps allotted space, our files are still big. We read that mongod --...

    ITKE376,290 pointsBadges:
  • Apache HBase: How to count number of rows quickly

    Currently in Apache HBase, I've been implementing row count over ResultScanner, like this: for (Result rs = scanner.next(); rs != null; rs = scanner.next()) { number++; } But my data is starting to reach the millions so the computing is big. I'm trying to compute it real-time but I would like to...

    ITKE376,290 pointsBadges:
  • How to use large datasets in Hadoop

    Would anyone happen to know of any large datasets to experiment with in Hadoop with a low cost? I need to use at least 1 GB of data and a production log data of a webserver. I would appreciate any help available. Thank you.

    ITKE376,290 pointsBadges:
  • List all collections in a MongoDB shell

    I'm currently in a MongoDB shell and I was wondering if anyone knows how to list all of the collections for the current database and I'm using. I can't anywhere at all. Thanks!

    ITKE376,290 pointsBadges:
  • Is there a performance difference between Java or Python on Hadoop?

    I've been working on a project in Hadoop for quite some time and now I'm trying to incorporate Java and provide support for Python. Does anyone know if there's any performance impact when it comes to choosing between the two. Any help would be appreciated.

    ITKE376,290 pointsBadges:
  • Running a simple Java program on AWS instances

    Hi, I am little confused about the steps to run a simple Java program on AWS instances. I have launched four EC2 instances. Can you please guide me the steps how to run a simple Java program on four AWS instances and also a guide to run a map-reduce word_count.Java program on AWS instances...

    preeti0110 pointsBadges:
  • Tool for migrating DTS to SSIS

    What is the tool for migrating DTS to SSIS. And how effective it works. And I want migrate 500 DTS packages to SSIS. How much time it will take?

    Lingala5 pointsBadges:
  • Tibco Training

    What training time is necessary for an experienced IT package consultant to provide useful Tibco support to a project team, and in what capacity. What can be accomplished in two to three weeks of SELF training?

    Masks5555 pointsBadges:
  • Sunsetting an analytic database

    Are there templates to follow for sunsetting a major analytic database within the healthcare sector?

    vooloo225 pointsBadges:
  • Does latest stable hive (1.0.0) support faster queries and ACID transactions?

    According to this doc: http://hortonworks.com/wp-content/uploads/2013/12/StingerTechnicalPreviewInstall.pdf Stinger is coming to work with hive (0.13) for full support of SQL queries. In addition another advantage is the faster performance and in order to check out the performance advantage of...

    nicknicknick5 pointsBadges:
  • How can one use HDFS as backend storage for Squid-proxy’s cache?

    HDFS is critical part of Hadoop landscape and provides distributed filesystem capabilities. How can one utilize HDFS for storing Squid-proxy's cache data? For example: first time a YouTube video is downloaded its stored in Squid-proxy's cache residing on HDFS. next time the same request is catered...

    dbaannaeh5 pointsBadges:
  • Best way to migrate big data from Oracle to DB2 for i

    Hi. I've tried some methods (SQL, plain text files...) but, I'd like to be sure on what is the best method to implement a migration of big data files.

    iredin5 pointsBadges:
  • SAP Basis workload analysis

    This request for help was originally submitted to the Research Assistant on WhatIs.com. I’m a beginner technology consultant Sap Basis, and I like find more information about Workload analysis and maybe performance and possible issues about that.

    ResearchAssistant855 pointsBadges:

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Thanks! We'll email you when relevant content is added and updated.