My name is Juan Manuel Caicedo Carvajal, a software engineer interested in text mining, information retrieval and information extraction. I'm currently a graduate student at the Language Technologies Institute at Carnegie Mellon University.
Before coming to Carnegie Mellon, I worked during the last five years on different areas, for instance:
- Distributed systems
- I worked at CERN as a member of the team responsible for the data acquisition of the LHCb experiment, where I participated in the administration of the computing infrastructure. My functions included managing the distributed file system and a 4400-core computing farm.
- Text mining and vertical crawling
- I have worked on research projects to analyze the content of a large volume of documents. The most recent project is an Ngram search engine that allows you to explore the trends on Colombian news articles in the past 20 years.
- Web related technologies
- I have a strong background on web related technologies, from front-end to server-side development, including load time optimization, load balancing, database replication, and HTTP cache configuration.
In addition to those areas, I have worked in personal projects involving data analysis and data visualization (1, 2, 3) and crowd-sourcing applications. I am also interested in programming languages and social network analysis.
I have been fortunate enough to live in different cities and countries. I was born in Popayán (Colombia) and lived there for 18 years. Then I moved to Cali, Lyon (France), Bogotá, Geneva, and now I live in Pittsburgh.
The best way of contacting me is by email. My address is firstname.lastname@example.org.
You also can see my other identities on the Web:
- Twitter : What am I doing right now?
- del.icio.us : My bookmarks.
- Flickr : Some photos.
- Google Reader Shared Items : Feeds I read.
- My Amazon wish list : Be kind, buy me something :-P.