I am Research Scientist at the National Institutes of Health (NIH), USA. I am part of the Machine Learning Team at the National Institute of Mental Health (NIMH-NIH).
I have expertise and research experience in the areas of (1) data science, (2) natural language processing (NLP), (3) machine learning, and (4) knowledge bases applied to health and biomedicine.
I am also committed to the development of computer science in South America, especially in Peru. Thus, in 2014 I co-founded
(the International Conference on Information Management and Big Data). SIMBig has positioned itself as one of the most important conferences in South America on topics related to data science, artificial intelligence, machine learning, etc.
|National Institutes of Health (NIH) (Current position)|
|University of Florida|
|Xerox Research Centre Europe (now Naver Labs)|
|Alcatel Lucent – Bell Labs (now Nokia Bell Labs)|
|Orange - France Télécom|
|Touring y Automóvil Club del Perú|
I got my PhD degree in Computer Science from the University of Montpellier, France, working at LIRMM (Laboratory of Informatics, Robotics, and Microelectronics of Montpellier). My advisors were Clement Jonquet, Mathieu Roche, and Maguelonne Teisseire.
I have a Master degree in Artificial Intelligence and Decision Support Systems from INSA Lyon and from the University of Lyon 1, France. I have a Master degree in Business Intelligence and Statistics from the University of Lyon 2 and I also have an Engineering degree in Computer Science from the Universidad Nacional Mayor de San Marcos, Lima, Peru.
- SIFR Project
This project had for objective to investigate the scientific and technical challenges in building ontology-based services to leverage biomedical knowledge in indexing, mining and retrieval of biomedical data. The project will specifically address the issues related to the processing of French biomedical data in collaboration with NCBO (Stanford University). The fields related to this work are Natural Language Processing, Text Mining and Semantic Web.
- BIrthDAY Consortium
This project aims at providing new efficient decision making tools for helping agricultural development as well as biodiversity protection in Peru. More precisely it aims at developing a new platform for helping to acquire new data, to share data, to extract knowledge, and to share useful information and knowledge among different actors that are involved in agriculture or biodiversity domains in Peru.
- OC-2-KB - Obesity and Cancer to Knowledge Base system (2017).
- BioTex - A system that extracts biomedical terms from free text. The terminology is ranked using different measures (2014-2015)..
- Targeted Summarization of Medical Data Based on Implicit Queries: A system and method for targeted summarization of a patient’s electronic medical records. A component transforms the health records of the patient into representations in a multidimensional search space (2013-2014).
- University of Florida, Informatics Institute, Post-Doctoral Fellowship Program 2017/2018.
- IEEE BIBM 2016 student travel award.
- Award for professional excellence in Computer Science by National University of San Marcos, Peru.
- Best Mini-Project (Hospital Finder) - 10th Summer School on Ontology Engineering and the Semantic Web (SSSW 2013).
- Scholarships for research studies from FINCyT programs, in Peru, the last one obtained in 2013.
- Different Chess Trophy & Awards (2000-2004).
- Co-founder of SIMBig conference: International Conference on Information Management and Big Data.
- Head of the BIrthDAY consortium: BIg Data for Agriculture and biodiversitY.
- General co-chair of the 7th International Conference on Information Management and Big Data - SIMBig 2020, Lima, Peru.
- General co-chair of SIMBig 2019, SIMBig 2018, SIMBig 2017, SIMBig 2016, SIMBig 2015, and SIMBig 2014.
- General co-chair of the Special Session "Learning from heterogeneous data analytics (EAST'2017)" - FLAIRS 2017 - AAAI.
- Organizing committee of the 19th International Conference on Application of Natural Language to Information Systems - NLDB 2014, Montpellier, France.
- Local helper for chairs of the 10th European Semantic Web Conference - ESWC 2013, Montpellier, France.
Invited Seminar Presentations
- 2015: Invited speaker at the Seminar on Big Data, Data Mining - 2015, Universidad Nacional Mayor de San Marcos, Lima, Peru.
- 2015: Invited speaker at the Seminar on Data Mining - 2015, Universidad de Pacífico, Lima, Peru.
- 2015: Invited speaker at the International Seminar on Data Mining - 2015, Universidad Tecnológica del Perú, Lima, Peru.
- 2014: Invited speaker at the 1st Workshop on Pattern Recognition and Applied Artificial Intelligence - WRPIAA 2014, Lima, Peru.
Data Science - PythonPeriod: 2016, 2017, 2018, and 2019 (Lecture and Tutorial)
PhD, Master students, at the University of Florida, U.S.A.
Advanced Web Programming - PHPPeriod: 2014-2015 (Tutorial)
Bachelor (second year), at the University Institute of Technology in Béziers (IUT Béziers)
Web Development - PHPPeriod: 2014-2015 (Tutorial)
Bachelor (first year), at the University Institute of Technology in Béziers (IUT Béziers)
Data MiningPeriod: 2013-2014 (Projects)
Master (first year), at the Faculty of Science - University of Montpellier
Databases and Information SystemsPeriod: 2013-2014 (Tutorial)
Bachelor (third year), at the Faculty of Science - University of Montpellier
NoSQL DatabasesPeriod: 2013-2014 (Lecture and Tutorial)
Master (second year), at the École Privée des Sciences Informatiques (EPSI)
Subjects: Introduction, The NoSQL-Movement, Basic Concepts, Techniques and Patterns Key-/Value-Stores, Document Databases (ApacheCouchDB, MongoDB), Column-Oriented Databases (Cassandra).
Algorithmics and ProgrammationPeriod: 2013-2014 (Tutorial)
Bachelor (first year), at the Faculty of Science - University of Montpellier
Database Administration, ORACLE 11gPeriod: 2012-2013 (Lecture and Tutorial)
Master (first year), at the École Privée des Sciences Informatiques (EPSI)
Subjects: Data Dictionary, User's administration, Privileges, Physical and logical structures, Disk files, System Global Area, Library cache, Data dictionary cache, Program Global Area, User processes, connections and sessions, Concurrency and locking.