Big Data Tag Directory

Browse Alphabetically:

Featured Big Data Questions

  • SQL Server – SSIS

    what is SSIs in Sql Server?

    90497265505 pointsBadges:
  • Get the name of the column to identify an error in SSIS

    Hello, please I need your precious help. I need a script task using C# identify the name of the column that had a data error. Total thanks!

    felalliap5 pointsBadges:
  • Ratios and Percentages in SPSS

    How to capture variables values of a ratio like 5:1 and a percentage value like 50% in SPSS. I have a questionnaire that I am analyzing that involves capturing percentages and ratios but I cant enter them in SPSS. I have a question like how do you rate your performance? Ratio of computers to...

    esekimpi5 pointsBadges:
  • Career path to becoming a big data analyst

    Hello everyone, By way of introduction, I am Sean and I am looking for a way out of my confusion as to which path to apply to become an analytics professional in the Big data field. I don't belong to the IT background but have a very deep interest in technical knowhow. I have done Business...

    kirktt2005 pointsBadges:
  • Should I try a web analytics career?

    I am a System Admin with 6 Years experience. I happen to come across Web Analytic and thinking of having this as a second career. Should I go for it?

    SameerAshfaq5 pointsBadges:
  • Multiple Regression SPSS

    Hi all, Can someone with knowledge of SPSS tell me how to test Homogeneity in Multiple Regression by using Levenes Test?

    shav4life5 pointsBadges:
  • Repetition of code for a different customers in SPSS

    Suppose I have a table with all the clients in an enterprise order by descending order. I want a code that repeats a series of steps every time a new customer number is read. Example: When I read the client 1 certain steps are execute when the program reaches the customer two this same steps are...

    latristain5 pointsBadges:
  • Bounce rate in analytics

    Are there any free tools for calculate bounce rate in analytics?

    robertnus1235 pointsBadges:
  • Is it an option to extract data from SAP MM using ETL tools such talend, SSIS or PowerCenter in order to load data into Oracle DW?

    Instead of using BW or Hanna, is it a good alternative to design and populate an Oracle Data Warehouse, using ETL tools such Talend, SSIS or PowerCenter to extract data from SAP Modules? Any suggestion about an outstanding open source solution of query & BI tools to access this Oracle DW?

    MarceloMSP5 pointsBadges:
  • NoSQL Database(s)

    I am knew to this tech. I understand NoSQL is not relational. Could old COBOL file structures or such systems be considered as NoSQL type since we could put different record structures with variable lengths (dependent on array values) and we could design a UI as a record (incorporating arrays as...

    Chegutu465 pointsBadges:
  • Error when configuring Hadoop on CentOS

    I've been configuring Hadoop on one of our servers that's running CentOS. When I run, I keep getting this error: WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable So I'm not sure what to do here. Has anyone...

    ITKE371,720 pointsBadges:
  • ElasticSearch for real-time statistics

    We have to log millions and millions of small log documents on a weekly basis. This includes: Ad hoc queries for data mining Joining and filtering values Full-text search with Python We thought about using HBase and running Hadoop jobs to generate stat results. But the problem is that the results...

    ITKE371,720 pointsBadges:
  • Small business KPI’s in startup

    What are the classic KPI's for small business in each phase of the business life cycle, specifically in the start-up phase?

    Cmichaud5 pointsBadges:
  • Hadoop error when accessing application through HTML

    We have installed Hadoop on our cluster and now we're installing HTTPFS to access the HDFS content using HTTP protocol. We're able to access the normal page but when we tried to access HDFS, we're getting an error: {"RemoteException":{"message":"User: ubantu is not allowed to impersonate ubantu",...

    ITKE371,720 pointsBadges:
  • Out of memory error when installing Hadoop

    I recently tried to install Hadoop following a document my friend gave me. When I tried to execute this: bin/hadoop jar hadoop-examples-*.jar grep input output 'dfs[a-z.]+' I got this exception: java.lang.OutOfMemoryError: Java heap space Has anyone seen this before? Like I said, I'm pretty new to...

    ITKE371,720 pointsBadges:
  • JAVA_HOME is not set correctly when installing Hadoop on Ubuntu

    I've been trying to install Hadoop on Ubuntu 11.10. I just set the JAVA_HOME variable in the file conf/ to: # export JAVA_HOME=/usr/lib/jvm/java-1.6.0-openjdk Then I tried to execute these commands: $ mkdir input $ cp conf/*.xml input $ bin/hadoop jar hadoop-examples-*.jar grep input...

    ITKE371,720 pointsBadges:
  • Hadoop: What’s the difference between Pig and Hive?

    I'm pretty new to the Hadoop world (been using it for about a month) and I've started to get into Hive, Pig and Hadoop using Cloudera's Hadoop VM. Is there a difference between Pig and Hive? I understand they have similar commands so I'm trying to figure out the big differences.

    ITKE371,720 pointsBadges:
  • What’s the difference between S3 and S3N in Hadoop?

    When we recently connected our Hadoop cluster to our Amazon storage and downloaded a file to HDFS, we noticed that s3:// didn't work but when we tried out S3N, it worked. Why didn't it work with S3? Is there a difference between the two?

    ITKE371,720 pointsBadges:
  • Hadoop: Safemode recovery is taking too long

    We have a Hadoop cluster with 18 data nodes. We recently restarted the name node about three hours ago and it's still in safe mode! We're not sure if we should try to restart it. We looked online and found this to try: dfs.namenode.handler.count 3 true Should we try this? If not, has anyone seen...

    ITKE371,720 pointsBadges:
  • Hadoop: How to handle data streams in real-time

    I've recently been working with Hadoop and now I'm using it to handle data streams in real-time. For this, I would like to build a meaningful POC around it so I could showcase it. I'm pretty limited in resources so any help would be appreciated.

    ITKE371,720 pointsBadges:

Big Data Tags - SSA to Uns

Browse Alphabetically:

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

To follow this tag...

There was an error processing your information. Please try again later.

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Thanks! We'll email you when relevant content is added and updated.