Monday, April 4, 2016

Which freaking big data programming language should I use?

Ian Pointer, senior consultant at Mammoth Data, wrote this article. He basically talks about the variety of programing language such as R, Python, Scala, and Java. These are all well-known programing language that programmers use at their school or work. He is very experience at his field and in this article he give advice on what program is based for doing specific kind of task.

Furthermore, he goes on to explain what does it program do and who are its likely user. First i didn't know this program before i read this article. R is programing language and is often called "a language for statisticians built by statisticians." This means there are other program that are only used by certain group of people or certain group of people who works in same field. 

Next, he talks about Python. I have used this program before but I can’t say that I know python inside and out. "Python has been very popular in academic for more than a decade." Hence I had to learn it in school. According to Mr. Pointer, “Python tends to be supported in big data processing frameworks, but at the same time, it tends not to be a first-class citizen.”

Next, Scala is the programing language that I haven’t heard of before. I am not sure what are its strength and weakness. But he does points out that he is fond of this program. “Scala is a mostly successful marriage of the functional and object-oriented paradigms, and it's currently making huge strides in the financial world and companies that need to operate on very large amounts of data, often in a massively distributed fashion (such as Twitter and LinkedIn).”

Lastly, he talks about Java. He criticizes java, and I am not sure what I feel about it because I used Java before but I never pay attention to other things concerning Java. Even though he criticizes Java, he goes on to say that it is a good tool for Big Data projects.

I think it was good article to learn more about programming language and what are it ups and down. I have just summarized what he says on article, but if you readers go there, I am sure you will find it very interesting.

At the end he goes on saying, “if you're doing heavy data analysis with obscure statistical calculations, then you'd be crazy not to favor R. If you're doing NLP or intensive neural network processing across GPUs, then Python is a good bet. And for a hardened, production streaming solution with all the important operational tooling, Java or Scala are definitely great choices.” I think it was good advice because there are many new programmers who may not have the experience and looking for a quick answer, and this article gives them the answers and advice they need.


Citation

Pointer, Ian. "Which Freaking Big Data Programming Language Should I Use?" InfoWorld. InfoWorld, 1 Apr. 2016. Web. 04 Apr. 2016.

http://www.infoworld.com/article/3049672/application-development/which-freaking-big-data-programming-language-should-i-use.html

No comments:

Post a Comment