In October 2013, I received my Ph.D. in Computer Science from the University of Montpellier 2, France, in collaboration with the University of New Caledonia. My thesis was carried out under the supervision of Maguelonne Teisseire, Nazha Selmaoui-Folcher, Sandra Bringay, and Frédéric Flouvat. This work involved the spatiotemporal pattern mining problem in geo-referenced data, more specifically, focused on health and environmental issues.
Previously, I received my Master's degree from the University of Montpellier 2 (France) in Computability, Algorithms and Network Management and Security in June 2010, and I obtained my degree in Systems Engineering at Andina University in 2004 (Cusco, Peru).
Actually, I am a professor at the Engineering faculty at the Universidad del Pacífico in Lima, Peru (Bitmap team). Moreover, I am part of the Artificial Intelligence team at the Pontificia Universidad Católica del Perú (IA-PUCP team).
Further, I am co-founder of SIMBig, which is is one of the first conferences in Latin America grouping related areas such as Data Science, Big Data, Natural Language Processing, among others. Currentry, SIMBig celebrate their 7th. edition.
My research works are focused on Knowledge Extraction from Data from heterogeneous data. I am mainly concentrated on technics for the extraction of complex patterns from data involving spatiotemporal dynamics. These studies should tackle real problems associated with biodiversity, the environment, health issues, among others. Some of my contributions are available onDBLP and Google Scholar
I am also interested in the diffusion of areas such as Data Mining, Machine Learning, Big Data, etc. This is reflected in some conferences where I was General Chair or member of the Organization Committee. Some of them are Edunine 2019, CIARP 2016, WRPIAA 2014, NLDB 2014, among others. In the same context, my team and I proposed the algorithm WinCOPPER, a new Python-implementation to extract sequential patterns under constraints.
Founded research projects
During these last four years, I have participated - as a head or a member of researchers team - in some projects financed by local and/or international organizations. Some of these projects are:
Computational Approaches for the Analysis of Urban Mobility Data
The goal of this project is to analyze urban mobility problems and phenomena that are present in cities around the world. Also, to design data modeling algorithms that help identify patterns in mobility data from big cities. This SticAmSud project starts in 2021, and it is funded by Concytec.
COViD: COntro de Virus Dinámico
The aim of the COViD project is the Peruvian implementation of a digital contact tracer. In this context, we should be capable of estimating the probability of contagion based on infected people and the visualization of such information for policymakers. This project funded by Concytec was finished in February 2021.
Big data analysis over cellphone usage and consumption to estimate unbanked people indebtedness capacity for financial inclusion
This project aims to extract the unbanked inhabitants' financial health indicators from data related to the use and location reported by cell phones. To attend this aim, we perform big data analytics techniques for the development of evidence-based public policies for traditional and/or digital financial inclusion. This project is still under development.
Feasibility of using geo-referenced photographs in the design of agricultural insurance
This project aims to evaluate the feasibility of using geo-referenced photographs periodically captured by farmers from their cell phones to design low-cost agricultural insurance aimed at small and medium-sized commercial farmers. This project is still under development.
Development of a decision support system, using drones, for adaptation to climate change in high-Andean agriculture
This project aims to build a decision support system based on the results of the analysis of heterogeneous data associated with agriculture. Our disposal data contains surveys, meteorological data, hydrological data, and images from multispectral cameras, among others. This project is funded by the local government, precisely by the National Institute of Agrarian Innovation INIA-PNIA.
Low cost microscope for smartphones: Technological solution for the detection of diseases in cattle
This project aims to propose a new technology for the early detection of subclinical mastitis in cattle. Our proposal includes the adaptation of a device for capturing images and a microscope to a micro-computer. Also, an algorithm for counting somatic cells was proposed. This project ended in April 2018.
Smart shopping: Smart purchase tool
This project aims to construct an algorithm to - smartly - compare products from different stores to propose the best price to clients. The idea behind our proposal is to compare features of the products based on a distance measure. This project ended in September 2017, and intellectual property rights protect our algorithm in Peru.
PEDESTAl: Prediction models for Energy consumption based on big data analytics of population DEnsity and SpaTio-social Activities
The proposed project aims to design and implement prediction models for energy (electricity) consumption relying on human activities classification and mapping over time as well as population density dynamic estimation. Big Data Analytics, Data and Text Mining, Machine Learning, and Social Network tool will be investigated and applied to achieve our goal. This project was funded by Stic-AmSud - Fondecyt.
ANIMITEX project: image analysis based on textual information
With the amount of textual data available on the web, new knowledge extraction domains are provided. Some original methods allow the users to combine different types of data in order to extract relevant information. In this context, the main objectives of the ANIMITEX project is to combines spatial and textual data. This project was funded by CNRS - France.
The BirthDAY project aims to provide new efficient decision-making tools for helping agricultural development and biodiversity protection in Peru. More precisely, it aims to develop a new platform for acquiring new data, extract knowledge, and share useful information and knowledge among different actors involved in agriculture or biodiversity domains in Peru.
My trajectory starts in 2004 with some courses in technical schools. The latest ones are:
- Universidad del Pacífico, Peru
- Data engineering (undergraduate degree, 2016 - now)
- Data mining (undergraduate degree, 2016 - now)
- Web mining (undergraduate degree, 2016 - 2017)
- Pontificia universidad Católica del Perú, Peru
- Aplicaciones de las Ciencias de la Computación (undergraduate degree, 2014 - 2016)
- Web analytics (master degree, 2017 - now)
- Information visualization (master degree, 2015 - now)
- Text mining (master degree, 2015 - now)
- Ecole d'informatique EPSI, Montpellier, France
- Data Werehouse and OLAP (undergraduate degree, 2013 - 2014)
- Université de Montpellier 2, France
- Extraction des connaissances à partir des données (master degree, 2012 - 2014)
- Introduction à la fouille de données (master degree, 2015 - now)
- Introduction à l’Algorithmique et la programmation (undergraduate degree, 2010 - 2014)
- Université Paul Valéry, Montpellier, France
- Méthodologie et traitement de données, analyse de données (undergraduate degree, 2011 - 2012)
- Université Paul Valéry, Montpellier, France. MIASH Master. Jan-Feb 2020
- Pontificia Universidad Católica del Ecuador - Sede Ambato, Ambato, Ecuador. EIS. Nov, 2016
- Ana Paula Galarreta Asian, Ph.D. candidate - PUCP. Spatiotemporal predictions from frequent substructures
- Luis Jara-Obregón, Ph.D. candidate - PUCP. Identification of socio-economic levels in cities based on data mining techniques and digital image processing.
- Guillermo Rodríguez-López, Ph.D. candidate, PUCP. Measuring the resilience of social structures based on heterogeneous data.
- Hilda Samame Jimenez, Master student, PUCP. Recommender Systems using Temporal Restricted Sequential Patterns.
- Romer Vargas Otiniano, Master student, PUCP. Analysis of the Influence of Social Classes on the Shops Attractiveness.
- Peter Montalvo García, Master student, PUCP. Generation of differentially private synthetic data for data mining.
- Ariana Quispe Porras, Undergraduate student, PUCP. Análisis del turismo en el Perú usando técnicas de graph mining.
- Pavel Rojas Bustamante, Master student, PUCP. Resume here
- Julianna Milagros Apumayta Lopez, Master student, PUCP. Resume here
- Kevin Alvarez Mouravskaia, Master student, PUCP. Resume here
- Iván Darío Peñaranda Arenas, Master student, PUCP. Resume here
- Ian Paul Brossard Núñez, Master student, PUCP. Resume here
- Mauro Antonio Leon Payano, Master student, PUCP. Resume here
- José Luis Barturén-Larrea, Master student, PUCP. Resume here
- Oscar Antonio Díaz-Barriga, Master student, PUCP. Resume here
- Natali Flores-Lafosse, Master student, PUCP. Resume here
- Rodrigo Ricardo Maldonado-Cadenillas, undergraduate student, PUCP. Resume here
- Wissame Laddada, "Discovering new spatial relationships using text mining methods". Supervision of Master Research Internship, Université de Montpellier 2.
- Pierre Accorsi, "Spatiotemporal pattern visualization: Three different approaches". Supervision of Master Research Internship, Université de Montpellier 2.
- Mickaël Fabrègue, "Efficient spatiotemporal sequential pattern mining". Supervision of Master Research Internship, Université de Montpellier 2.