Growing Python projects from small to large scale
You need significantly different principles for developing small, medium and large scale software system.When my project started to become big I searched the Internet for some guidelines or best...
View ArticleNLTK under Python 2.7 and SciPy 0.9.0
Python 2.7 has been out for months, but I have been stuck using Python 2.6 since SciPy was not working for Python 2.7. SciPy 0.9 Beta 1 binary distribution has just been released.Normally I try to stay...
View ArticleIs IBM Watson Beginning An AI Boom?
Artificial intelligence fell out of favor in the 1970s, the start of first artificial intelligence winter, and has mainly been out of favor since. In April 2010 I wrote a post about how you can now get...
View ArticleBedtime Science Stories My Science Education Blog
I started a science education blog called: Bedtime Science Stories. Here is a little excerpt from my first post: Can and should a 3 year old girl be into science?I have a 3 year old daughter that has...
View ArticlePractical Probabilistic Topic Models for NLP
Latent Dirichlet Allocation, LDA is a new and very powerful technique for finding the topics in a collection of texts, using unsupervised learning. LDA is a probabilistic topic models. LDA was...
View ArticleCloud Computing For Data Mining Part 1
The first half of this blog post is about selecting a cloud provider for a data mining and natural language processing system. I will compare 3 leading cloud computing providers Amazon Web Services,...
View ArticleNatural language processing in F# and Scala
I do natural language processing in C# 3.5 and Python. My work includes classification, named entity recognition, sentiment analysis and information extraction. Both C# and Python are great languages,...
View ArticleScala, Eclipse and Maven integration tutorial
I have evaluated Scala as a language for cloud computing and Hadoop. One requirement was a robust development environment, with a real build system, a good IDE with code completion and debugging.The...
View ArticleHive, Pig, Scalding, Scoobi, Scrunch and Spark
Comparison of Hadoop FrameworksI had to do simple processing of log files in a Hadoop cluster. Writing Hadoop MapReduce classes in Java is the assembly code of Big Data. There are several high level...
View ArticleScala vs. Haskell vs. Python
Functional programming is on the upswing, but should you bet your career on it, or is it a short-lived technology fad?I have long wanted to use functional programming professionally and for the last...
View ArticleAkka vs. Finagle vs. Storm
Akka, Finagle and Storm are 3 new open source frameworks for distributed parallel and concurrent programming. They all run on the JVM and work well with Java and Scala.They are very useful for many...
View ArticleCollision with the Zeitgeist
It has been 5 years since I started my blog. Back then I was alone with my obscure computer interests: functional programming languages, machine learning and AI.I felt lucky when I met a Python...
View ArticleLISP Prolog and Evolution
I just saw David Nolen give a talk at a LispNYC Meetup called:LISP is Too PowerfulIt was a provocative and humorous talk. David showed all the powerful features of LISP and said that the reason why...
View ArticleBig Data: What Worked?
"Big data" created an explosion of new technologies and hype: NoSQL, Hadoop, cloud computing, highly parallel systems and analytics.I have worked with big data technologies for several years. It has...
View ArticlePractical Scala, Haskell and Category Theory
Functional programming has moved from academia to industry in the last few years. It is theoretical with a steep learning curve. I have worked with strongly typed functional programming for 4 years. I...
View ArticleStatic vs. Dynamic Functional Languages
You can divide functional programming languages into 2 groups: Static and dynamic. Dynamic functional languages: Clojure, Common Lisp, Racket and Scheme. They have few types often only known at run...
View ArticleVM, Lambda, Kubernetes & Terraform Best Practice
I work with these popular cloud technologies.VMs, virtual machines like EC2 or GCEDockerKubernetesTerraformLambda / serverless functionsThis post contains a short introduction to these technologies and...
View ArticleML and Data in AWS, Azure and GCP
Machine learning and data technology are changing fast and the big cloud providers compete with new offerings. This blog is a short introduction to what this looks like in 2019. It is focused on cloud...
View ArticleBenefits of Different Python Distributions on Mac
There are at least 5 popular ways to install Python on OS X / Mac.OS X default Python installation, currently Python 2.7.10Use brew install pythonUse brew install pyenvAnacondaPython pkg installer from...
View ArticleTyped Functional Languages 2019
This post is a brief status of the state of typed functional languages in late 2019.Typed functional languages like Clean, Haskell and OCaml were developed within academia in the 1990s. Around 2010,...
View ArticleF# vs Scala
F# and Scala are both hybrid functional object-oriented languages created for popular virtual machines.F# for CLR / .NETScala for JVM / JavaF# and Scala are now in more direct competition after...
View ArticleHaskell and Hadoop the Aftermath
In 2012 Haskell and Hadoop were the hottest technologies. They had a lot of hype and I loved them. Both were based on functional programming and built on towering abstractions.Elite functional...
View ArticleHaskell IDE 2020
I spent some time getting a working Haskell environment on OS X. Generally the Haskell tooling has improved, but getting an IDE like setup is still tricky and it took me some trial and error.I tried 2...
View ArticleHow Many Languages Should You Program In
I love programming languages. Some would say that I am a language addict. I have programmed in a lot of languages, and written blog posts promoting their use.What is a reasonable number of languages to...
View ArticleDeep Learning From Java and Scala
Deep learning has been dominated by Python for years. It has been much harder to do deep learning on the JVM, but recently there has been some improvements. Here is a brief comparison of popular...
View Article