About
My name is Juan Manuel Caicedo Carvajal, a software engineer interested in text mining, information retrieval and information extraction. I'm currently a graduate student at the Language Technologies Institute at Carnegie Mellon University.
Before coming to Carnegie Mellon, I worked during the last five years on different areas, for instance:
- Distributed systems
- I worked at CERN as a member of the team responsible for the data acquisition of the LHCb experiment, where I participated in the administration of the computing infrastructure. My functions included managing the distributed file system and a 4400-core computing farm.
- Text mining and vertical crawling
- I have worked on research projects to analyze the content of a large volume of documents. The most recent project is an Ngram search engine that allows you to explore the trends on Colombian news articles in the past 20 years.
- Web related technologies
- I have a strong background on web related technologies, from front-end to server-side development, including load time optimization, load balancing, database replication, and HTTP cache configuration.
In addition to those areas, I have worked in personal projects involving data analysis and data visualization (1, 2, 3) and crowd-sourcing applications. I am also interested in programming languages and social network analysis.
Regarding programming languages, I'm fluent in Python, Scala, Java and JavaScript. I prefer to use Python for simple data analysis projects and for web applications (using the Django framework). I use Scala and Java for more complex projects that require libraries like Lucene and Hadoop. I've been using JavaScript for data visualization, although I'm learning R, and the excellent library ggplot2, for data analysis and visualization.
I have been fortunate enough to live in different cities and countries. I was born in Popayán (Colombia) and lived there for 18 years. Then I moved to Cali, Lyon (France), Bogotá, Geneva, and now I live in Pittsburgh.
The best way of contacting me is by email. My address is juan@cavorite.com.
- Resume (Includes publications list)
- LinkedIn Profile
You also can see my other identities on the Web:
- Twitter : What am I doing right now?
- del.icio.us : My bookmarks.
- Flickr : Some photos.
- Google Reader Shared Items : Feeds I read.
- My Amazon wish list : Be kind, buy me something :-P.
Illustration by Laura Acosta. See the full version.
