4 Research Activities

This section presents my main research activities that may help with understanding the impact of my research. Assessing scientific quality is not an easy task and it is still source of debate among many scholars. One of the key issues on this assessment is typically the publication record and associated bibliometric numbers. However, there are many disciplines where other aspects of research activity can be regarded as significant in terms of the impact on the community, and that is frequently the case of Computer Science, the field where I belong. For instance, developing a software tool that is widely used by the community and allows for further advances may well be more important than many publications and yet it is seldom considered as a criterion as it is not easy to assess without being involved in the concrete field of expertise. In my personal case I have developed many software programs that are widely used by the community, like for instance several R packages that have hundreds of monthly downloads. I have also created and still maintain a free regression data set repository that is used in most papers that test new regression algorithms within the machine learning community. This type of contributions, although relevant and important in my personal opinion, are hardly reflected in any type of research bibliometrics (with the exception of the data set repository whose citations are counted by Google Scholar although not all papers that use these data sets, cite the repository).

In spite of the previously mentioned limitations I have collected some data concerning bibliometric information related with my publications with the goal of helping in assessing the impact of this aspect of my research activities. The inclusion of this information requires some clarifications. Bibliometric data involves two main decisions: (i) whether to use it or not; and (ii) if yes, from which source. The first decision is hard to escape - it is the norm nowadays to evaluate CVs based on these numbers. Unfortunately this process often leads to unwanted bad decisions. There is a growing awareness of the research community that these numbers may be seriously misleading, particularly in some disciplines (like Computer Science) where the publication culture is rather different from other disciplines, for instance in the case of international conferences. While in most disciplines works published in conferences are not subject to peer reviewing, that is not the case in CS where international conferences are always peer reviewed and some are far more competitive than most journals. This leads to a clear bias on some of the frequently used bibliometric indices that frequently disregard conferences. Regarding this issue of the bias and risk of misjudgment of bibliometrics it may be interesting to point out several very interesting recent papers on this topic:

  • Diana Hicks, Paul Wouters, Ludo Waltman, Sarah de Rijcke & Ismael Rafols.Bibliometrics: The Leiden Manifesto for research metrics. Nature 520, 429–431 (23 April 2015) doi:10.1038/520429a
  • Paula Stephan, Reinhilde Veugelers & Jian Wang. Reviewers are blinkered by bibliometrics. Nature 544, 411–412 (27 April 2017) doi:10.1038/544411a
  • Rinze Benedictus, Frank Miedema & Mark W. J. Ferguson. Fewer numbers, better science. Nature 538, 453–455 (27 October 2016) doi:10.1038/538453a

These and many other works have been trying to raise the awareness of the community for the danger of these bibliometric-based decisions. Still, as this is still the norm I will present some of these numbers with the list of publications given below.

The second decision concerns the source of the bibliometric numbers. I have considered 3 sources: Thomson ISI, Elsevier Scopus and Google Scholar. All of them have potential drawbacks. Still, the decision was to select Google Scholar (GS). This index overcomes some limitations the other indices have, particularly for Computer Science, as it is clearly much more inclusive, although with an increased risk of inflating numbers. A few examples of my personal CV provide illustrations of the problems. My top cited publication is my book Data Mining with R published by a major publishing house (CRC Press from Taylor and Francis). For some reason that I cannot explain but most probably related with financial issues between the companies involved, this book is not indexed by Scopus. So, a peer reviewed book by a major publisher is non-existent for this widely used bibliometric index; (ii) one of my recent journal publications is on ACM Computing Surveys, one of the top journals of Computer Science according to the 2017 data from Thomson ISI impact factor. At some point in time (12/Dec/2017), according to Google Scholar (GS) this publication from 2016 had 42 citations. According to Scopus this citation number was 20! If we check the concrete citations which is possible on GS we can observe that 6 of these were self-citations, so we can reduce this to 36, but this is still too far from 20. These and other similar effects lead to divergent values like the fact that my h-index according to Scopus is significantly lower than that reported in GS, a very common phenomenon for computer science researchers as mentioned in [^1]. Still, GS is not without problems either. For instance, the same book I’ve mentioned above has two quite different editions currently, the second being more than 150 pages longer, with the other pages having been significantly revised. In spite of this GS insists in not considering these two books which obviously has an impact on the bibliometrics (I’ve inserted it manually but citations are not being counted by GS).

[^1] - Judit Bar-Ilan. Which h-index? — A comparison of WoS, Scopus and Google Scholar. Scientometrics, Volume 74, Issue 2, pp. 257–271.

Summarizing, I have decided to include bibliometric data from Google Scholar, even-though I’m well aware that this is a source of debate but my decision was essentially guided by completeness criteria both in terms of computer science in general and in my particular case. Moreover, in order to try to provide more information on the quality of the peer-reviewed conference papers, I have added the CORE rank of the respective conferences. CORE is a widely used source of information on the computer science conferences reputation. It is used wordwide by many funding agencies and universities where the specificities of the computer science field have been recognised. This austrolasian association provides a rank for international conferences using a scale from A* till Unranked. Further details on the criteria and meaning of this classification schema can be obtained here.

4.1 Publications

Global bibliometrics

Important Note: All citation numbers were obtained from my Google Scholar profile on 2021-05-14.

Global scores:

Total Nr. of Citations h-index i10-index
4702 29 66

Yearly evolution of the number of citations:

Books

## Loading required namespace: bibtex

[1] L. Torgo. Data Mining with R: Learning with Case Studies, Second Edition (chinese edition). China Machine Press (CMP), 2018. ISBN: 9787111596660.
(extra information)

[2] L. Torgo. Data Mining with R: Learning with Case Studies, Second Edition. Chapman and Hall/CRC, 2017.
(11 citations) (document) (extra information)

[3] L. Torgo. Data Mining with R: Learning with Case Studies (Chinese Edition). China Machine Press, 2012.
(document)

[4] L. Torgo. Data Mining with R: Learning with Case Studies. Chapman and Hall/CRC Data Mining and Knowledge Discovery Series. CRC Press, 2010.
(681 citations) (document) (extra information)

[5] L. Torgo. A Linguagem R, programação para a análise de dados. Escolar Editora, 2009.
(document)

Edition of Books

[1] A. Jorge, L. Torgo, P. Brazdil, et al., ed. Knowledge Discovery in Databases: PKDD 2005: 9th European Conference on Principles and Practice of Knowledge Discovery in Databases. LNAI 3721. Springer, 2005.
(3 citations)

[2] J. Gama, R. Camacho, P. Brazdil, et al., ed. Machine Learning: ECML 2005: 16th European Conference on Machine Learning. LNAI 3720. Springer, 2005.
(1 citations)

Chapters in Books

[1] N. Guimarães, L. Torgo, and A. Figueira. “Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons”. In: Social Network Based Big Data Analysis and Applications. 2018, pp. 1-19.
(3 citations) (document)

[2] L. Torgo. “Regression Trees”. In: Encyclopedia of Machine Learning and Data Mining. Ed. by C. Sammut and G. I. Webb. Springer, 2016, pp. 1080-1083.
(document)

[3] L. Torgo. “Model Trees”. In: Encyclopedia of Machine Learning and Data Mining. Ed. by C. Sammut and G. I. Webb. Springer, 2016, pp. 845-843.
(document)

[4] L. Torgo. “Regression Trees”. In: Encyclopedia of Machine Learning. Ed. by C. Sammut and G. I. Webb. Springer, 2011, pp. 842-845.
(5 citations)

[5] L. Torgo. “Model Trees”. In: Encyclopedia of Machine Learning. Ed. by C. Sammut and G. I. Webb. Springer, 2011, pp. 684-686.

[6] L. Torgo and C. Soares. “Resource-bounded Outlier Detection Using Clustering Methods”. In: Data Mining for Business Applications. Ed. by C. Soares and R. Ghani. Frontiers in Artificial Intelligence and Applications. IOS Press, 2010, pp. 84-98.
(10 citations)

[7] P. Flach, H. Blockeel, T. Gartner, et al. “Data Mining and Decision Support, Integration and Collaboration”. In: On the road to knowledge: mining 21 years of UK traffic accident reports. Ed. by D. Mladenic, N. Lavrac, M. Bohanec and S. Moyle. Morgan Kaufmann, 2003, pp. 143-156.
(14 citations)

[8] T. Hellström and L.Torgo. “Post processing trading signals for improved trading performance”. In: Data Mining III. WIT Press, 2002, pp. 437-447.
(document)

[9] P. Brazdil and L. Torgo. “Knowledge Acquisition via Knowledge Integration”. In: Current Trends in Knowledge Acquisition. Ed. by B. e. a. Wielinga. IOS Press, 1990, pp. 90-104.
(87 citations) (document)

Journals

[1] M. Oliveira, L. Torgo, and V. S. Costa. “Evaluation procedures for forecasting with spatiotemporal data”. In: Mathematics 9 (6 2021), pp. 691-718.
(11 citations) (extra information)

[2] M. Etemad, A. Soares, E. Etemad, et al. “SWS: an unsupervised trajectory segmentation algorithm based on change detection with interpolation kernels”. In: GeoInformatica 25 (2021), pp. 269-289.
(2 citations)

[3] V. Cerqueira, L. Torgo, C. Soares, et al. “Model Compression for Dynamic Forecast Combination”. In: arXiv arXiv:2104.01830 (2021).
(extra information)

[4] V. Cerqueira, L. Torgo, and C. Soares. “Model Selection for Time Series Forecasting: Empirical Analysis of Different Estimators”. In: arXiv arXiv:2104.00584 (2021).
(extra information)

[5] M. Alam, L. Torgo, and A. Bifet. “A Survey on Spatio-temporal Data Analytics Systems”. In: arXiv arXiv:2103.09883 (2021).
(extra information)

[6] V. Cerqueira, H. Gomes, A. Bifet, et al. “STUDD: A Student-Teacher Method for Unsupervised Concept Drift Detection”. In: arXiv arXiv:2103.00903 (2021).
(extra information)

[7] L. T. N Guimarães Á Figueira. “An organized review of key factors for fake news detection”. In: arXiv arXiv:2102.13433 (2021).
(extra information)

[8] V. Cerqueira, L. Torgo, and I. Mozetic. “Evaluating time series forecasting models: An empirical study on performance estimation methods”. In: Machine Learning (2020), pp. 1-32.
(32 citations)

[9] M. Monteiro, M. S. Baptista, J. Séneca, et al. “Understanding the Response of Nitrifying Communities to Disturbance in the McMurdo Dry Valleys, Antarctica”. In: Microorganisms 8.3 (2020). ISSN: 2076-2607. DOI: 10.3390/microorganisms8030404.
(4 citations) (document)

[10] I. Areosa and L. Torgo. “Visual interpretation of regression error”. In: Expert Systems 36.6 (2020). DOI: https://doi.org/10.1111/exsy.12621.
(1 citations) (document)

[11] V. Cerqueira, L. Torgo, and C. Soares. “Early Anomaly Detection in Time Series: A Hierarchical Approach for Predicting Critical Health Episodes”. In: arXiv arXiv:2010.11595 (2020).
(document)

[12] V. Cerqueira, L. Torgo, and C. Soares. “Machine Learning vs Statistical Methods for Time Series Forecasting: Size Matters”. In: arXiv arXiv:1909.13316 (2019).
(19 citations) (document)

[13] A. G. G. de Sousa, M. P. Tomasino, P. Duarte, et al. “Diversity and Composition of Pelagic Prokaryotic and Protist Communities in a Thin Arctic Sea-Ice Regime”. In: Microbial ecology ? (2019), pp. 1-21. DOI: https://doi.org/10.1007/s00248-018-01314-2.
(13 citations) (document)

[14] N. Moniz and L. Torgo. “A review on web content popularity prediction: Issues and open challenges”. In: Online Social Networks and Media 12 (2019), pp. 1-20. ISSN: 2468-6964. DOI: https://doi.org/10.1016/j.osnem.2019.05.002.
(4 citations) (document)

[15] A. Figueira, N. Guimaraes, and L. Torgo. “A Brief Overview on the Strategies to Fight Back the Spread of False Information”. In: Journal of Web Engineering 18.4 (2019), pp. 319-352. DOI: https://doi.org/10.13052/jwe1540-9589.18463.
(1 citations) (document)

[16] V. Cerqueira, L. Torgo, F. Pinto, et al. “Arbitrage of Forecasting Experts”. In: Machine Learning 108 (2019), pp. 913-944.
(15 citations) (document)

[17] P. Branco, L. Torgo, and R. P. Ribeiro. “Pre-processing approaches for imbalanced distributions in regression”. In: Neurocomputing 343 (2019), pp. 76-99. DOI: 10.1016/j.neucom.2018.11.100.
(8 citations) (document)

[18] P. Branco, L. Torgo, and R. P. Ribeiro. “REBAGG: REsampled BAGGing for Imbalanced Regression”. In: Proceedings of Machine Learning Research (PMLR) 94 (2018), pp. 1-15.
(7 citations) (extra information)

[19] L. Torgo, S. Matwin, G. Weiss, et al. “Cost-Sensitive Learning: Preface”. In: Proceedings of Machine Learning Research (PMLR) 88 (2018), pp. 1-3.
(document)

[20] P. Branco, L. Torgo, and R. P. Ribeiro. “Resampling with neighbourhood bias on imbalanced domains”. In: Expert Systems 35.4 (2018). DOI: 10.1111/exsy.12311.
(3 citations) (document)

[21] I. Mozetic, L. Torgo, V. Cerqueira, et al. “How to evaluate sentiment classifiers for Twitter time-ordered data?” In: PLOS ONE 13.3 (2018), p. e0194317.
(20 citations) (document) (extra information)

[22] N. Moniz and L. Torgo. “Multi-Source Social Feedback of Online News Feeds”. In: arXiv arXiv:1801.07055 (2018).
(20 citations) (document)

[23] H. Ribeiro, T. de Sousa, J. Santos, et al. “Potential of dissimilatory nitrate reduction pathways in polycyclic aromatic hydrocarbon degradation”. In: Chemosphere 199 (2018), pp. 54-67.
(23 citations) (document)

[24] M. Monteiro, J. Séneca, L. Torgo, et al. “Environmental controls on estuarine nitrifying communities along a salinity gradient”. In: Aquatic Microbial Ecology 80 (2) (2017), pp. 167-180.
(4 citations) (document)

[25] L. Torgo, B. Krawczyk, P. Branco, et al. “Learning with Imbalanced Domains: preface”. In: Proceedings of Machine Learning Research (PMLR) 74 (2017), pp. 1-6.
(document)

[26] P. Branco, L. Torgo, and R. P. Ribeiro. “SMOGN: a Pre-processing Approach for Imbalanced Regression”. In: Proceedings of Machine Learning Research (PMLR) 74 (2017), pp. 36-50.
(23 citations) (document)

[27] N. Moniz, P. Branco, and L. Torgo. “Evaluation of Ensemble Methods in Imbalanced Regression Tasks”. In: Proceedings of Machine Learning Research (PMLR) 74 (2017), pp. 129-140.
(11 citations) (document)

[28] N. Moniz, L. Torgo, M. Eirinaki, et al. “A Framework for Recommendation of Highly Popular News Lacking Social Feedback”. In: New Generation Computing 35 (4) (2017), pp. 417-450.
(6 citations) (document)

[29] N. Moniz, P. Branco, and L. Torgo. “Resampling Strategies for Imbalanced Time Series Forecasting”. In: International Journal of Data Science and Analytics 3.3 (2017), pp. 161-181.
(18 citations) (document)

[30] N. Moniz, L. Torgo, and J. Vinagre. “Data-driven relevance judgments for ranking evaluation”. In: CoRR abs/1612.06136 (2016).
(document)

[31] L. Baía and L. Torgo. “A comparative study of approaches to forecast the correct trading actions”. In: Expert Systems 34.1 (2016), pp. e12169-n/a.
(4 citations) (document)

[32] P. Branco, L. Torgo, and R. Ribeiro. “A Survey of Predictive Modeling on Imbalanced Domains”. In: ACM Comput. Surv. 49.2-31 (2016).
(396 citations) (document)

[33] P. Branco, R. Ribeiro, and L. Torgo. “A UBL: an R package for Utility-based Learning”. In: CoRR abs/1604.08079 (2016).
(36 citations) (document)

[34] N. Moniz and L. Torgo. “Socially Driven News Recommendation”. In: CoRR abs/1506.01743 (2015).
(1 citations) (document)

[35] P. Branco, L. Torgo, and R. Ribeiro. “A Survey of Predictive Modelling under Imbalanced Distributions”. In: CoRR abs/1505.01658 (2015).
(document)

[36] L. Torgo, P. Branco, R. P. Ribeiro, et al. “Re-sampling Strategies for Regression”. In: Expert Systems 32.3 (2015), pp. 465-476.
(77 citations) (document)

[37] L. Torgo. “An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R”. In: CoRR abs/1412.0436 (2014).
(32 citations) (document) (extra information)

[38] J. Vanschoren, J. N. van Rijn, B. Bischl, et al. “OpenML: networked science in machine learning”. In: SIGKDD Explorations Newsletter 15.2 (2013), pp. 49-60.
(617 citations) (document)

[39] B. Drury, L. Torgo, and J. J. Almeida. “Classifying News Stories with a Constrained Learning Strategy to Estimate the Direction of a Market Index”. In: IJCSA 9.1 (2012), pp. 1-22.
(14 citations)

[40] M. Herrera, L. Torgo, J. Izquierdo, et al. “Predictive models for forecasting hourly urban water demand”. In: Journal of Hydrology 387.1-2 (Jun. 2010), pp. 141-150.
(387 citations) (document)

[41] L. Torgo and R. P. Ribeiro. “Modelos de Previsão de Valores Extremos e Raros”. In: Boletim da Sociedade Portuguesa de Estat'istica Primavera 2010 (2010), pp. 15-22.

[42] L. Torgo. “Deteção de fraude usando o R: um caso de estudo”. In: Boletim da Sociedade Portuguesa de Estatística (2009).

[43] R. Ribeiro and L. Torgo. “A Comparative Study on Predicting Algae Blooms in Douro River, Portugal”. In: Ecological Modelling - Selected Papers from the 5th European Conference on Ecological Modelling 212.1-2 (2008), pp. 86-91.
(22 citations) (document)

[44] A. Silva, A. Jorge, and L. Torgo. “Design of an end-to-end method to extract information from tables”. In: International Journal on Document Analysis and Recognition 8.2-3 (2006), pp. 144-171.
(94 citations)

[45] L. Torgo and J. P. Costa. “Clustered Partial Linear Regression”. In: Machine Learning 50.3 (2003), pp. 303-319.
(15 citations) (document)

[46] L. Torgo. “Thesis: Inductive learning to tree-based regression models.”. In: AI Commun. 13.2 (2000), pp. 137-138.

[47] L. Torgo and J. Gama. “Regression using Classification Algorithms”. In: Intelligent Data Analysis 1.4 (1997).
(63 citations) (document)

Full Papers at International Conferences with Peer Reviewing

[1] M. Etemad, Z. Etemad, A. Soares, et al. “Wise Sliding Window Segmentation: A classification-aided approach for trajectory segmentation”. (2020), pp. 208-219.
(2 citations)

[2] C. Bellinger, P. Branco, and L. Torgo. “The CURE for Class Imbalance”. In: Discovery Science. Ed. by P. Kralj Novak, T. Šmuc and S. Džeroski. Springer International Publishing, 2019, pp. 3-17. ISBN: 978-3-030-33778-0.
CORE rank: None
(1 citations) (document)

[3] V. Cerqueira, L. Torgo, and C. Soares. “Layered Learning for Early Anomaly Detection: Predicting Critical Health Episodes”. In: Discovery Science. Ed. by P. Kralj Novak, T. Šmuc and S. Džeroski. Springer International Publishing, 2019, pp. 445-459. ISBN: 978-3-030-33778-0.
CORE rank: None
(1 citations) (document)

[4] I. Areosa and L. Torgo. “Explaining the Performance of Black Box Regression Models”. In: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA). 2019, pp. 110-118.
CORE rank: None
(1 citations) (document)

[5] P. Branco and L. Torgo. “A Study on the Impact of Data Characteristics in Imbalanced Regression Tasks”. In: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA). 2019, pp. 193-202.
CORE rank: None
(document)

[6] M. Etemad, A. Soares, S. Matwin, et al. “On Feature Selection and Evaluation of Transportation Mode Prediction Strategies”. In: Proceedings of the Workshops of the EDBT/ICDT 2019 Joint Conference, EDBT/ICDT 2019, Lisbon, Portugal, March 26, 2019. Ed. by P. Papotti. Vol. 2322. CEUR Workshop Proceedings. CEUR-WS.org, 2019.
CORE rank: None
(document)

[7] N. Guimarães, Á. Figueira, and L. Torgo. “Knowledge-based Reliability Metrics for Social Media Accounts”. In: Proceedings of the 16th International Conference on Web Information Systems and Technologies (WEBIST 2020),. 2019, pp. 339-350. DOI: 10.5220/0010140403390350.
CORE rank: None
(document)

[8] I. Areosa and L. Torgo. “Visual Interpretation of Regression Error”. In: Progress in Artificial Intelligence. Ed. by P. Moura Oliveira, P. Novais and L. P. Reis. Springer International Publishing, 2019, pp. 473-485. ISBN: 978-3-030-30244-3.
(1 citations) (document)

[9] M. Oliveira, N. Moniz, L. Torgo, et al. “Biased Resampling Strategies for Imbalanced Spatio-Temporal Forecasting”. In: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA). 2019, pp. 100-109.
CORE rank: None
(1 citations) (document)

[10] N. Guimarães, Á. Figueira, and L. Torgo. “Contributions to the Detection of Unreliable Twitter Accounts through Analysis of Content and Behaviour”. In: Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2018, Volume 1: KDIR, Seville, Spain, September 18-20, 2018. 2018, pp. 90-99. DOI: 10.5220/0006932800900099.
CORE rank: None
(document)

[11] P. Branco, L. Torgo, and R. Ribeiro. “MetaUtil: Meta Learning for Utility Maximization in Regression”. In: Proceedings of the International Conference on Discovery Science, DS’18. Springer, 2018, pp. 129-143.
CORE rank: None
(3 citations) (extra information)

[12] M. Oliveira, L. Torgo, and V. S. Costa. “Evaluation procedures for forecasting with spatio-temporal data”. In: Proceedings of the ECML/PKDD’2018 Conference. Springer, 2018, pp. ??-??
CORE rank: A
(extra information)

[13] V. Cerqueira, F. Pinto, L. Torgo, et al. “Constructive Aggregation and its Application to Forecasting with Dynamic Ensembles”. In: Proceedings of the ECML/PKDD’2018 Conference. Springer, 2018, pp. ??-??
CORE rank: A
(1 citations) (extra information)

[14] N. Moniz and L. Torgo. “The Utility Problem of Web Content Popularity Prediction”. In: Proceedings of the 29th ACM Conference on Hypertext and Social Media, HT 2018. 2018, pp. 82-86.
CORE rank: A
(document)

[15] Á. Figueira, N. Guimarães, and L. Torgo. “Current State of the Art to Detect Fake News in Social Media: Global Trendings and Next Challenges”. In: Proceedings of the 14th International Conference on Web Information Systems and Technologies, WEBIST 2018, Seville, Spain, September 18-20, 2018. 2018, pp. 332-339. DOI: 10.5220/0007188503320339.
CORE rank: None
(document)

[16] V. Cerqueira, L. Torgo, M. Oliveira, et al. “Dynamic and Heterogeneous Ensembles for Time Series Forecasting”. In: IEEE International Conference on Data Science and Advanced Analytics (DSAA’2017). 2017, pp. 242-251.
CORE rank: None
(document)

[17] P. Branco, L. Torgo, R. P. Ribeiro, et al. “Learning Through Utility Optimization in Regression Tasks”. In: IEEE International Conference on Data Science and Advanced Analytics (DSAA’2017). 2017, pp. 30-39.
CORE rank: None
(document)

[18] V. Cerqueira, L. Torgo, J. Smailović, et al. “A Comparative Study of Performance Estimation Methods for Time Series Forecasting”. In: IEEE International Conference on Data Science and Advanced Analytics (DSAA’2017). 2017, pp. 529-538.
CORE rank: None
(document)

[19] V. Cerqueira, L. Torgo, F. Pinto, et al. “Arbitrated Ensemble for Time Series Forecasting”. In: Proceedings of the ECML/PKDD’2017 Conference. Ed. by M. Ceci, J. Hollmén, L. Todorovski and C. Vens. Lecture Notes in Artificial Intelligence. Springer, 2017, pp. 478-494.
CORE rank: A (BEST STUDENT MACHINE LEARNING PAPER AWARD)
(43 citations)

[20] P. Branco, L. Torgo, and R. P. Ribeiro. “Exploring Resampling with Neighborhood Bias on Imbalanced Regression Problems”. In: Proceedings of 18th EPIA Conference on Artificial Intelligence (EPIA 2017). Ed. by E. Oliveira, J. Gama, Z. Vale and H. L. Cardoso. LNCS 10423. Springer, 2017, pp. 513-524.
CORE rank: B
(3 citations) (document)

[21] V. Cerqueira, L. Torgo, and C. Soares. “Arbitrated Ensemble for Solar Radiation Forecasting”. In: Proceedings of IWANN’2017. Vol. 10305. LNCS. Springer, 2017, pp. 720-732.
CORE rank: B
(6 citations) (document)

[22] P. Branco, L. Torgo, and R. P. Ribeiro. “Relevance-based Evaluation Metrics for Multi-class Imbalanced Domains”. In: Advances in Knowledge Discovery and Data Mining - 21th Pacific-Asia Conference, PAKDD 2017, Jeju, South Korea, May 23-26, 2017, Proceedings. Ed. by J. Kim, K. Shim, L. Cao, J. Lee, X. Lin and Y. Moon. Lecture Notes in Computer Science, vol 10234. Springer. 2017, pp. 698-710.
CORE rank: A
(29 citations) (document)

[23] N. Guimarães, L. Torgo, and A. Figueira. “Lexicon Expansion System for Domain and Time Oriented Sentiment Analysis”. In: Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. Vol. 1: KDIR. 2016.
CORE rank: C
(document)

[24] N. Moniz, L. Torgo, and M. Eirinaki. “Time-Based Ensembles for Prediction of Rare Events in News Stream”. In: IEEE International Conference on Data Mining Workshops, ICDM Workshops 2016, December 12-15, 2016, Barcelona, Spain. 2016, pp. 1066-1073. DOI: 10.1109/ICDMW.2016.0154.
CORE rank: A*
(2 citations) (document)

[25] M. Oliveira, L. Torgo, and V. S. Costa. “Predicting Wildfires: Propositional and Relational Spatio-Temporal Pre-Processing Approaches”. In: Proceedings of Discovery Science 2016. LNAI. Springer, 2016.
CORE rank: None
(3 citations) (document)

[26] N. Moniz, P. Branco, and L. Torgo. “Resampling Strategies for Imbalanced Time Series”. In: Proceedings of DSAA 2016. 2016.
CORE rank: None
(16 citations) (document)

[27] A. Martins, A. Dias, E. Silva, et al. “MarinEye - A tool for marine monitoring”. In: OCEANS 2016 - Shanghai. 2016, pp. 1-7.
CORE rank: None
(document)

[28] L. Nezvaloví, L. Popelínsky, L. Torgo, et al. “Class-Based Outlier Detection: Staying Zombies or Awaiting for Resurrection?” In: Proceeedings of IDA’2015. Vol. 9385. Lecture Notes in Computer Science . Springer, 2015, pp. 193-204.
CORE rank: A
(4 citations) (document)

[29] L. Baia and L. Torgo. “Forecasting the Correct Trading Actions”. In: Proceeedings of 17th Portuguese Conference on Artificial Intelligence, EPIA 2015. LNAI. Springer, 2015, pp. 560-571.
CORE rank: B
(2 citations) (document)

[30] M. Oliveira and L. Torgo. “Ensembles for Time Series Forecasting”. In: Proceedings of Asian Conference on Machine Learning (ACML’2014). Vol. 39. JMLR: Workshop and Conference Proceedings. 2014, pp. 360-370.
CORE rank: None
(57 citations) (document) (extra information)

[31] N. Moniz, L. Torgo, and F. Rodrigues. “Resampling approaches to improve news importance prediction”. In: Advances in Intelligent Data Analysis XIII (IDA’2014). Ed. by B. H.. Vol. 8819. LNCS. Springer, 2014, pp. 215-226.
CORE rank: A
(7 citations) (document)

[32] N. Moniz and L. Torgo. “Improvement of News Ranking through Importance Prediction”. In: Proceeding of KDD’2014 workshop NewsKDD - Data Science for News Publishing. 2014.
CORE rank: C
(3 citations) (document) (extra information)

[33] L. Torgo, R. P. Ribeiro, B. Pfahringer, et al. “SMOTE for Regression”. In: Proceedings of EPIA’2013. Springer, 2013.
CORE rank: B
(88 citations) (document) (extra information)

[34] J. van Rijn, B. Bischl, L. Torgo, et al. “OpenML: A collaborative Science Platform”. In: Proceedings of ECML/PKDD’2013. Springer, 2013, pp. 645-649.
CORE rank: A
(60 citations)

[35] O. Ohashi and L. Torgo. “Spatial Interpolation Using Multiple Regression”. In: Data Mining (ICDM), 2012 IEEE 12th International Conference on. 2012, pp. 1044-1049.
CORE rank: A*
(15 citations) (document) (extra information)

[36] O. Ohashi and L. Torgo. “Wind speed forecasting using spatio-temporal indicators”. In: ECAI 2012 - 20th European Conference on Artificial Intelligence. Ed. by L. D. Raedt, C. Bessière, D. Dubois, P. Doherty, P. Frasconi, F. Heintz and P. J. F. Lucas. IOS Press, 2012, pp. 975-980.
CORE rank: A
(36 citations) (document)

[37] B. Drury, G. Dias, and L. Torgo. “A Contextual Classification Strategy for Polarity Analysis of Direct Quotations from Financial News”. In: RANLP. Ed. by G. Angelova, K. Bontcheva, R. Mitkov and N. Nicolov. RANLP 2011 Organising Committee, 2011, pp. 434-440.
CORE rank: C
(12 citations)

[38] B. Drury, L. Torgo, and J. Almeida. “Classifying news stories to estimate the direction of a stock market index”. In: Information Systems and Technologies (CISTI), 2011 6th Iberian Conference on. 2011, pp. 1-4.
CORE rank: None
(14 citations) (document)

[39] B. Drury, L. Torgo, and J. J. Almeida. “Guided Self Training for Sentiment Classification”. In: Proceedings of International Conference On Recent Advances in Natural Language Processing (RANLP 2011) - ROBUS workshop. RANLP 2011 Organising Committee, 2011.
CORE rank: C
(document)

[40] L. Torgo and E. Lopes. “Utility-Based Fraud Detection”. In: IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artificial Intelligence. Ed. by T. Walsh. AAAI Press. IJCAI/AAAI, 2011, pp. 1517-1522.
CORE rank: A*
(17 citations) (document) (extra information)

[41] L. Torgo and O. Ohashi. “2D-interval predictions for time series”. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’2011. Ed. by C. A. é, J. Ghosh and P. Smyth. ACM, 2011, pp. 787-794.
CORE rank: A*
(7 citations) (document) (extra information)

[42] O. Ohashi, L. Torgo, and R. P. Ribeiro. “Interval Forecast of Water Quality Parameters”. In: ECAI 2010 - 19th European Conference on Artificial Intelligence. Ed. by H. Coelho, R. Studer and M. Wooldridge. IOS Press, 2010, pp. 283-288.
CORE rank: A
(3 citations) (document)

[43] L. Torgo, W. Pereira, and C. Soares. “Detecting Errors in Foreign Trade Transactions: Dealing with Insufficient Data”. In: 14th Portuguese Conference on Artificial Intelligence, EPIA 2009. Ed. by L. e. al Lopes. LNAI - 5816. Springer, 2009.
CORE rank: B
(3 citations) (document)

[44] L. Torgo and R. P. Ribeiro. “Precision and Recall for Regression.”. In: Discovery Science. Ed. by J. Gama, V. S. Costa, A. M. Jorge and P. Brazdil. Vol. 5808. Lecture Notes in Computer Science. Springer, 2009, pp. 332-346.
CORE rank: None
(56 citations) (document)

[45] R. Ribeiro and L. Torgo. “Utility-based performance measures for regression”. In: Proceedings of the 3rd Workshop on Evaluation Methods for Machine Learning, in conjunction with the 25th International Conference on Machine Learning (ICML 2008). 2008.
CORE rank: A*
(2 citations) (document)

[46] L. Torgo. “Resource-bounded Fraud Detection”. In: Progress In Artificial Intelligence, Proceedings of the 13th Portuguese Conference on Artificial Intelligence Workshops (EPIA 2007). Ed. by J. Neves, M. F. Santos and J. Machado. Vol. 4874. Lecture Notes in Artifical Intelligence. Springer, Dec. 2007, pp. 449-460.
CORE rank: None
(20 citations) (document)

[47] L. Torgo and R. Ribeiro. “Utility-based Regression”. In: Proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 2007). Ed. by K. JN, K. J. de Mántaras RL, S. Matwin, D. Mladenic and A. Skowron. Vol. 4702. Lecture Notes in Artificial Intelligence. Springer, 2007, pp. 597-604.
CORE rank: A
(50 citations) (document)

[48] R. Ribeiro and L. Torgo. “Rule-based Prediction of Rare Extreme Values”. In: Proceedings of the 9th International Conference on Discovery Science (DS’2006). LNAI. Springer, 2006.
CORE rank: None
(5 citations) (document)

[49] L. Torgo and R. Ribeiro. “Predicting Rare Extreme Values”. In: Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’2006). Ed. by W. Ng. Lecture Notes in Artificial Intelligence 3918. Springer, 2006.
CORE rank: A
(5 citations) (document)

[50] R. Ribeiro and L. Torgo. “A Comparative Study on Predicting Algae Blooms in River Douro, Portugal”. In: Proceedings of the V European Conference on Ecological Modelling (ECEM-2005). 2005.
CORE rank: None
(22 citations) (document)

[51] L. Torgo. “Regression Error Characteristic Surfaces”. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2005). Ed. by R. Grossman, R. Bayardo, K. Bennett and J. Vaidya. ACM Press, 2005, pp. 697-702.
CORE rank: A*
(29 citations) (document)

[52] L. Torgo. “The TNT Financial Trading System: a midterm report”. In: Proceedings of the Workshop on Data Mining for Business at ECML/PKDD 2005. 2005.
CORE rank: A
(2 citations) (document)

[53] L. Torgo and J. Marques. “Adapting Peepholing to Regression Trees”. In: Proceedings of the 12th EPIA. LNAI. Springer, 2005.
CORE rank: B
(document)

[54] A. Loureiro, L. Torgo, and C. Soares. “Outlier Detection using Clustering Methods: a data cleaning application”. In: Proceedings of KDNet Symposium on Knowledge-based Systems for the Public Sector. 2004.
CORE rank: None
(185 citations) (document)

[55] R. P. Ribeiro and L. Torgo. “Predicting Harmful Algae Blooms.”. In: EPIA. Ed. by F. Moura-Pires and S. Abreu. Vol. 2902. Lecture Notes in Computer Science. Springer, 2003, pp. 308-312.
CORE rank: B
(11 citations) (document)

[56] A. C. e Silva, A. Jorge, and L. Torgo. “Automatic Selection of Table Areas in Documents for Information Extraction.”. In: EPIA. Ed. by F. Moura-Pires and S. Abreu. Vol. 2902. Lecture Notes in Computer Science. Springer, 2003, pp. 460-465.
CORE rank: B
(7 citations)

[57] A. Silva, A. Jorge, and L. Torgo. “Selection of Table Areas for Information Extraction”. In: Proceedings of the 3rd International Workshop in Document Analysis and its Applications (DLIA 2003). 2003.
CORE rank: None
(2 citations) (document)

[58] L. Torgo and R. Ribeiro. “Predicting Outliers”. In: Proceedings of Principles of Data Mining and Knowledge Discovery (PKDD’03). Ed. by N. Lavrac, D. Gamberger, L. Todorovski and H. Blockeel. 2838 LNAI. Springer, 2003, pp. 447-458.
CORE rank: A
(22 citations) (document)

[59] L. Torgo. “Computationally Efficient Linear Regression Trees”. In: Classification, Clustering and Data Analysis: recent advances and applications (Proc. of IFCS 2002). Ed. by K. Jajuga, A. Sokolowski and H. Bock. Studies in Classification, data analysis, and knowledge organization. Springer, 2002, pp. 409-415.
CORE rank: None
(13 citations) (document)

[60] P. Almeida and L. Torgo. “The Use of Domain Knowledge in Feature Construction for Financial Time Series Prediction”. In: Proceedings of the Portuguese AI Conference (EPIA’01). Ed. by P. Brazdil and A. Jorge. LNAI 2258. Springer, 2001, pp. 116-129.
CORE rank: B
(4 citations)

[61] L. Torgo. “A study on end-cut preference in least squares regression trees”. In: Proceedings of the Portuguese AI Conference (EPIA 2001). Ed. by P. Brazdil and A. Jorge. LNAI 2258. Springer, 2001, pp. 104-115.
CORE rank: B
(7 citations) (document)

[62] L. Torgo. “Efficient and Comprehensible Local Regression”. In: Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2000). Ed. by Terano, Liu and Chen. LNAI 1805. Springer, 2000, pp. 376-379.
CORE rank: A
(3 citations) (document) (extra information)

[63] L. Torgo. “Partial Linear Trees”. In: Proceedings of the 17th International Conference on Machine Learning (ICML 2000). Ed. by P. Langley. Morgan Kaufmann Publishers, 2000, pp. 1007-1014.
CORE rank: A*
(13 citations) (document)

[64] L. Torgo and J. P. Costa. “Clustered Multivariate Regression”. In: Data Analysis, Classification, and Related Methods. Ed. by Kiers, Rasson, Groenen and Schader. Springer, 2000.
CORE rank: None

[65] L. Torgo and J. P. Costa. “Clustered Partial Linear Regression”. In: Proceedings of the Fifth International Workshop on Multistrategy Learning (MSL-2000). Ed. by R. Michalski and P. Brazdil. 2000.
CORE rank: None
(15 citations) (document)

[66] L. Torgo and J. P. Costa. “Clustered Partial Linear Regression”. In: Proceedings of the 11th European Conference on Machine Learning (ECML 2000). Ed. by R. Mantaras and E. Plaza. LNAI 1810. Springer, 2000, pp. 426-436.
CORE rank: A
(8 citations)

[67] L. Torgo. “Predicting the Density of Algae Communities using Local Regression Trees”. In: Proceedings of the European Congress on Intelligent Techniques and Soft Computing (EUFIT’99). 1999.
CORE rank: None
(10 citations) (document)

[68] J. Gama, L. Torgo, and C. Soares. “Dynamic Discretization of Continuous Attributes.”. In: IBERAMIA. Ed. by H. Coelho. Vol. 1484. Lecture Notes in Computer Science. Springer, 1998, pp. 160-169.
CORE rank: None
(43 citations)

[69] L. Torgo. “A Comparative Study of Reliable Error Estimators for Pruning Regression Trees”. In: Proceedings of the Iberoamericam Conference on AI (IBERAMIA-98). Ed. by H. Coelho. 1998.
CORE rank: None
(18 citations) (document)

[70] L. Torgo. “Error Estimates for Pruning Regression Trees”. In: Proceedings of the 10th European Conference on Machine Learning. Ed. by C. Nedellec and C. Rouveirol. LNAI 1398. Springer Verlag, 1998.
CORE rank: A
(14 citations) (document)

[71] L. Torgo. “Functional Models for Regression Tree Leaves”. In: Proceedings of the 14th International Conference on Machine Learning. Ed. by D. Fisher. Morgan Kaufmann Publishers, 1997.
CORE rank: A*
(159 citations) (document)

[72] L. Torgo. “Kernel Regression Trees”. In: Poster papers of the European Conference on Machine Learning (ECML-97). 1997.
CORE rank: A
(40 citations) (document)

[73] L. Torgo and J. Gama. “Search-based Class Discretization”. In: Proceedings of the European Conference on Machine Learning (ECML-97). LNAI 1224. Springer, 1997.
CORE rank: A
(35 citations) (document)

[74] L. Torgo and J. Gama. “Regression by Classification”. In: Advances In Artificial Intelligence, Proceedings of the 13th Brazilian Symposium on Artificial Intelligence (SBIA 1996). Ed. by D. L. Borges and C. A. A. Kaestner. Vol. 1159. Lecture Notes in Artificial Intelligence. Springer, Oct. 1996, pp. 51-60.
CORE rank: None
(62 citations) (document)

[75] L. Torgo. “Applying Propositional Learning to Time Series Prediction”. In: Workshop on Statistics, Machine Learning and Knowledge Discovery in Databases. Ed. by Y. et all Kodratoff. 1995.
CORE rank: None
(5 citations) (document)

[76] L. Torgo. “Data Fitting with Rule-based Regression”. In: Proceedings of the 2nd International Workshop on Artificial Intelligence Techniques (AIT 1995). Ed. by J. Zizka and P. Brazdil. 1995.
CORE rank: None
(26 citations) (document)

[77] L. Torgo. “Controlled Redundancy in incremental Rule Learning”. In: European Conference on Machine Learning (ECML’93). Ed. by P. Brazdil. LNAI 667. Springer-Verlag, 1993, pp. 185-195.
CORE rank: A
(53 citations) (document)

[78] L. Torgo. “Rule Combination in Inductive Learning”. In: European Conference on Machine Learning (ECML’93). Ed. by P. Brazdil. LNAI 667. Springer-Verlag, 1993, pp. 384-389.
CORE rank: A
(16 citations) (document)

[79] P. Brazdil, M. Gams, S. Sian, et al. “Learning in Distributed Systems and Multi-Agent Environments”. In: Machine Learning: EWSL-91 (European Working Session on Learning). Ed. by Y. Kodratoff. Vol. 482. Lecture Notes in Artificial Intelligence. Springer, 1991, pp. 412-423.
CORE rank: None
(72 citations) (document)

[80] L. Torgo and M. Kubat. “Knowledge Integration and Forgetting”. In: Proceedings of the Czechoslovak Conference on Artificial Intelligence. 1991.
CORE rank: None
(4 citations) (document)

Other Publications

[1] L. Torgo. “Uma Breve Introdução à Data Science (in portuguese)”. In: Boletim da APDIO 57 (2017), pp. 9-11.
(document)

[2] C. Magalhães, A. Mucha, F. Carvalho, et al. “Development of an autonomous system for integrated marine monitoring”. In: Proceedings of 41st CIESM Congress. 2016.
(document)

[3] C. Magalhães, J. Séneca, C. Leite, et al. “Distribution and Environmental Controls on Marine Nitrogen Biogeochemical Functions”. In: Proceedings of 41st CIESM Congress. 2016.
(document)

[4] C. Magalhães, C. Lee, M. Monteiro, et al. “Everything is not everywhere: Antarctica Dry Valleys as an extreme counter example”. In: Proceedings of XXXIV SCAR. 2016.

[5] J. Séneca, C. Magalhães, M. Monteiro, et al. “Distribution of prokaryotic communities and NifH gene diversity in the extrem Darwin Mountains, Abtarctica”. In: Proceedings of XXXIV SCAR. 2016.

[6] M. Monteiro, J. Séneca, L. Torgo, et al. “The impact of environmental changes on nitrifyng communities from the Dry Valleys of Antarctica”. In: Proceedings of XXXIV SCAR. 2016.

[7] C. Bartilotti, A. Santos, R. Marques, et al. “Presenting the MarinEye project – Development and validation of a prototype for multitrophic oceanic monitoring”. In: Proceedings of ASC 2016- ICES Anual Science Meeting. 2016.

[8] C. Magalhães, A. Mucha, F. Carvalho, et al. “Development of an autonomous system for integrated marine monitoring”. In: Proceedings of XIX Iberian Symposium on Marine Biology Studies. 2016.
(document)

[9] J. N. van Rijn, V. Umaashankar, S. Fischer, et al. “A RapidMiner extension for Open Machine Learning”. In: RapidMiner Community Meeting and Conference, 2013. 2013.
(8 citations)

[10] L. Torgo. “Inductive Learning of Tree-based Regression Models”. PhD thesis. Faculty of Sciences, University of Porto, 1999.

4.2 Projects

Project Coordination

  • [2020 – 2025]
    Anomaly Detection for Trajectory Data
    NSERC Discovery grants, Principal investigator (PI)
    Budget: 175 kCad

  • [2019 – 2026]
    Spatiotemporal Ocean Data Anlytics
    Tier 1 Canada Research Chair, Principal investigator (PI)
    Budget: 1.4 MCad

  • [2018 – 2021]
    Nitrolimit - Life at the Edge: Define the Boundaries of the Nitrogen Cycle in the Extreme Antarctic Environments
    Portuguese Science Foundation (FCT), 02/SAICT/2017, Co-Principal investigator (PI) (PI: Catarina Magalhaes)
    Budget: 238.5 kEur

  • [2018 – 2019]
    Online Observatory of Economic Activity through Digital Means
    Private funding: ASAE, Ministry of Economy, Portugal - Principal investigator (PI)
    Budget: 68 kEur

  • [2015 – 2018]
    Parfois Product/Shop Sales Forecasting for Supporting Logistics Decisions
    Private funding: Parfois - Principal investigator (PI)
    Budget: 100 kEur

  • [2015 – 2016]
    News Summarizer
    Private funding: SkimIT - Principal investigator (PI)
    Budget: 35 kEur

  • [2011 – 2014]
    e-Policy, Engineering the POlicy-making LIfe CYcle
    EC 7th Framework Programme Theme ICT-2011-7 (EC), Pr. Nr. 288147 - Local coordinator (PI of Portuguese Partner)
    Budget (local partner): 295 kEur; Global budget: 3 MEur

  • [2008 – 2011]
    MORWAQ, Monitoring and Predicting Water Quality Parameters
    Portuguese Science Foundation (FCT), PTDC/EIA/68489/2006 - Principal investigator (PI)
    Budget: 47 kEur

  • [2008 – 2011]
    oRANKI, Resource-bounded Outlier Detection
    Portuguese Science Foundation (FCT), PTDC/EIA/68322/2006 - Principal investigator (PI)
    Budget: 48,5 kEur

  • [2008]
    Exploratory Analysis of Sonae Distribuição Employees Survey (2008)
    MBA Consultores, private funding - Principal investigator (PI)

  • [2007 – 2008]
    Monitoring water quality parameters
    Águas do Douro e Paiva, SA, private funding - Principal investigator (PI)

  • [2004–2006]
    MODAL, Models for Predicting Algae Blooms in River Douro
    Portuguese Science Foundation (FCT), POSI/2000/SRI/40949 - Principal investigator (PI)
    Budget: 33 kEur

  • [2005]
    Exploratory Analysis of Sonae Distribuição Employees Survey (2005)
    MBA Consultores, private funding - Principal investigator (PI)

  • [2003 – 2004]
    Development of a System for Automatic Intraday Trading in Stock Markets
    private funding - Principal investigator (PI)

Project Participation

4.3 Prizes

  • [2017]
    Co-author (author was Vitor Cerqueira) of the paper that won the Best Student Machine Learning Paper Award given by the Machine Learning Journal at the European Conference on Machine Learnig (ECML/PKDD’2017)

  • [2017]
    Supervisor of the PhD thesis of Nuno Moniz entitled “Prediction and Ranking of Highly Popular Web Content” that was awarded the 2nd place in the Fraunhofer Portugal Challenge 2017 competition in the category of PhD theses

  • [2006]
    Co-author (author was Rita Ribeiro) of the paper that won the Best Student Paper Award given by Yahoo! Research Labs at the Discovery Science (DS’06) international conference

  • [1999]
    Runner-up winner at the 3rd International Competition ``Protecting rivers and streams by monitoring chemical concentrations and algae communities’’, organized by ERUDIT in conjunction with COIL, the cluster of four European Research Networks (ERUDIT, EvoNet, MLNet and NeuroNet)

4.4 Supervision

Post-Doctoral Fellows

Ongoing

  • Vitor Cerqueira
    • Title: Mining Time-dependent Data
    • Start: 2020

Finished

  • Paula Branco
    • Title: Utility-based Predictive Analytics
    • 2019
    • Current position: Assistant Professor at University of Ottawa, Canada
  • Colin Bellinger
    • Title: Methods for Handling Adverse Data Properties
    • 2018
    • Current position: Research Officer at National Research Council (NRC), Canada

Ph.D.’s

Ongoing

  • Marvin da Silva
    • Title: TBD
    • Supervisor: Luis Torgo ; co-supervisor: Sageev Oore (Dalhousie University, Canada)
    • PhD on Computer Science, Dalhousie University, Canada
    • Start: 2021
  • Francisco Pascoal
    • Title: Deciphering the roles of the prokaryotic rare biosphere in the Arctic Ocean under climate change
    • Supervisor: Catarina Magalhaes (University of Porto, Portugal); co-supervisors: Luis Torgo and Rodrigo Costa (IST, Portugal)
    • PhD on Biology, Faculty of Sciences, University of Porto, Portugal
    • Start: 2021
  • Md Mahbub Alam
    • Title: Spatiotemporal data analysis with big data
    • Supervisor: Luis Torgo
    • PhD on Computer Science, Faculty of Computer Science, Dalhousie University
    • Start: 2020
  • Luis Roque
    • Title: Deep Probabilistic Models in Hierarchical Time Series
    • Supervisor: Luis Torgo; co-supervisor: Carlos Soares
    • PhD Program Faculty of Engineering/UPorto
    • Start: 2020
  • Mariana Oliveira
    • Title: Predictive Analytics for Dependent Data
    • Supervisor: Luis Torgo; Co-Supervisor: Vitor Santos Costa
    • MAPi PhD Program, Universities of Aveiro, Minho and Porto
    • Start: 2017
  • Nuno Guimarães
    • Title: Analyzing and Developing Veracity Indicators for Building an Automatic Detector of False Online News
    • Supervisor: Álvaro Figueira; co-supervisor: Luis Torgo
    • PhD on Computer Science, Faculty of Sciences, University of Porto
    • Start: 2017

Finished

  • Mohammad Etemad
    • Title: Segmentation Algorithms for Trajectory Data
    • Supervisor: Stan Matwin; co-supervisor: Luis Torgo
    • PhD on Computer Science, Faculty of Computer Science, Dalhousie University
    • October/2020
  • Vitor Cerqueira
    • Title: Ensembles for Time Series Forecasting
    • Supervisor: Luis Torgo; Co-Supervisor: Carlos Soares
    • PhD Program Faculty of Engineering/UPorto
    • December/2019
    • Current position: Post Doctoral Fellow at Dalhousie University, Canada
  • Paula Branco
    • Title: Utility-based Predictive Analytics
    • Supervisor: Luis Torgo; Co-Supervisor: Rita Ribeiro
    • MAPi PhD Program, Universities of Aveiro, Minho and Porto
    • September/2018
    • Current position: Assistant Professor at University of Ottawa, Canada
  • Nuno Moniz
    • Title: Prediction and Ranking of Highly Popular Web Content
    • Supervisor: Luis Torgo
    • PhD on Computer Science, Faculty of Sciences, University of Porto
    • July/2017
    • Awarded the 2nd place in the Fraunhofer Portugal Challange
    • Current position: Post Doctoral Fellow at the Laboratory of Artificial Intelligence and Decision Support (LIAAD - INESC Tec), and an Invited Professor at the Faculty of Sciences of the University of Porto, Portugal
  • Brett Drury
  • Orlando Ohashi
    • Title: Spatio-Temporal Prediction Methods
    • Supervisor: Luis Torgo
    • MAPi PhD Program, Universities of Aveiro, Minho and Porto
    • December/2012
    • Current position: Adjunct Professor at UFRA, Brazil
  • Rita Ribeiro
    • Title: Utility-based Regression
    • Supervisor: Luis Torgo
    • PhD on Computer Science, Faculty of Sciences, University of Porto
    • September/2011
    • Current position: Assistant Professor at the Department of Computer Science of the Faculty of Sciences of the University of Porto, Portugal
  • Pedro Almeida
    • Title: Previsão do Comportamento de Séries Temporais Financeiras com Apoio de Conhecimento Sobre o Domínio
    • Supervisor: Luis Torgo
    • Doutoramento em Engenharia Informática, Universidade da Beira Interior
    • April/2003
    • Current position: Assistant Professor at Beira Interior University, Portugal

M.Sc.’s

Ongoing

  • Vishnu Kandimalla
    • Title: Integrating Multiple Deep Learning Models to Track and Classify at Risk-Fish Species
    • Supervisor: Luis Torgo ; co-supervisor: Cristopher Widen
    • MSc on Computer Science, Faculty of Computer Science, Dalhousie University
    • Start: 2020
  • Amruth Kuppili
    • Title: Forecasting meteorological variables and anticipating climatic aberrations of an oceanic Buoy using a neighbour buoy
    • Supervisor: Luis Torgo
    • MSc on Computer Science, Faculty of Computer Science, Dalhousie University
    • Start: 2020

Finished

  • Deepan Shankar
    • Title: Forecasting Algae Blooms using Data from Mussels’ Openings
    • Supervisor: Luis Torgo
    • MSc on Computer Science, Faculty of Computer Science, Dalhousie University
    • Apr/2021
  • Inês Areosa
    • Title: Visual Tools for Understanding Regression Performance
    • Supervisor: Catarina Magalhães; Co-Supervisors: Pedro Duarte and Luis Torgo
    • Master of Science Degree in Aerospace Engineering, IST, Portugal
    • Nov/2019
    • Current position: Research and Data Analyst Specialist at GVC Italia, Lebanon
  • Antonio Gaspar Goncalves de Sousa
  • Carlos Leite
    • Title: Domain Oriented Biclustering Validation
    • Supervisor: Luis Torgo; Co-Supervisor: Catarina Magalhães
    • Masters on Computer Science, Faculty of Sciences, University of Porto
    • 30/Nov/2016
    • Grade: 19 out of 20
    • Current position: Data Scientist at Facebook, England
  • Nuno Guimarães
  • Mariana Oliveira
  • Luís Baía
  • Paula Branco
  • Fernando Correia
  • João Cepêda
  • Pedro Coelho
  • Hélia Costa
  • Raquel Santos
    • Title: Modelos de Regressão para a Previsão de Vendas e de Clientes
    • Supervisor: Luis Torgo; Co-Supervisor: Luis Marques (SONAE)
    • Masters in Engineering Mathematics, Faculty of Sciences, University of Porto
    • 2010
  • Pedro Duarte
    • Title: Service-Oriented Architectures
    • Supervisor: Paulo Martins (Critical); Co-Supervisor: Luis Torgo
    • Masters in Network and Information Systems Engineering, Faculty of Sciences, University of Porto
    • 2010
  • Clara Gonçalves
    • Title: Modelos de Regressão com Análise Classificatória
    • Supervisor: Joaquim Pinto da Costa ; Co-Supervisor: Luis Torgo
    • Masters in Engineering Mathematics, Faculty of Sciences, University of Porto
    • 2005
  • Jorge Barbosa
    • Title: Métodos para lidar com Mudanças de Regime em Séries Temporais Financeiras
    • Supervisor: Luis Torgo
    • Master in Data Analysis and Decision Support Systems, Faculty of Economics, University of Porto
    • 2005
  • Joana Marques
    • Title: Um estudo sobre a eficiência computacional da construção de árvores de regressão
    • Supervisor: Luis Torgo
    • Masters in Artificial Intelligence and Computation, Faculty of Economics, University of Porto
    • 2004
  • Rita Ribeiro
    • Title: Modelos de Previsão de Fenómenos Raros
    • Supervisor: Luis Torgo
    • Masters in Artificial Intelligence and Computation, Faculty of Economics, University of Porto
    • 2003
  • Ana Silva
  • Mário Oldemiro
    • Title: Técnicas de Inteligência Artificial Aplicadas à Previsão de Séries Temporais Financeiras
    • Supervisor: Luis Torgo; Co-supervisor: Pavel Brazdil
    • Master in Data Analysis and Decision Support Systems, Faculty of Economics, University of Porto
    • 2002
  • César Rocha
  • Sílvia Amorim

4.5 Organization of Events

4.6 Scientific Reviewing

Academic Juris

  • [2021]
    MSc. Thesis of Fatemeh Rahimi - MTLV: A Library for building deep multi-task learning architectures, Masters of Computer Science, Faculty of Computer Science, Dalhousie University, Canada

  • [2021]
    PhD Thesis of Xiang Jiang - Learning Adaptive Deep Representations for Few-to-Medium Shot Image Classification, PhD of Computer Science, Faculty of Computer Science, Dalhousie University, Canada

  • [2020]
    MSc. Thesis of Tamir Elmasri - Prediction Based Web Application for Customer Opinion Product Analysis using Twitter’s API, MEC, Faculty of Computer Science, Dalhousie University, Canada

  • [2020]
    MSc. Thesis of Olashile S. Adebimpe - Time-series forecasting using feature based hybrid approach, Masters of Computer Science, Faculty of Computer Science, Dalhousie University, Canada

  • [2017]
    Ph.D. Thesis of Pedro Saleiro - Entity-Specific Text Mining for Online Reputation Monitoring, University of Porto, Portugal

  • [2017]
    Ph.D. Thesis of Davi D’Andréa Baccan - Contributions of Computational Cognitive Modeling to the Understanding of the Financial Markets, University of Coimbra, Portugal

  • [2015]
    Ph.D. Thesis of Vinay Uday Prabhu - Network Aided Classification and Detection of Data, Carnegie Mellon University / MAP-I Doctoral Program, Pittsburgh, USA

  • [2013]
    Ph.D. Thesis of Ricardo Nuno Taborda Campos - Disambiguating Implicit Temporal Queries for Temporal Information Retrieval Applications, PhD on Computer Science, Faculty of Sciences, University of Porto, Portugal

  • [2012]
    Ph.D. Thesis of Nuno Constantino Castro - Time Series Motif Discovery, MAP-I Doctoral Program, University of Minho, Portugal

  • [2012]
    MSc. Thesis of Nuno Moniz - Bridging the gap between closed and open data, System proposal for the Portuguese Legislation, Masters on Computer Engineering, specialization in Networks, Architectures and Systems , ISEP, Portugal

  • [2011]
    Ph.D. Thesis of Rui Barbosa - Agents in the Market Place, University of Minho, Portugal

  • [2008]
    Ph.D. Thesis of Pedro Rafael de Ruiz Graça - Aprendizagem Interactiva em Sistemas Multi-Agente, University of Lisbon, Portugal

  • [2008]
    Ph.D. Thesis of Anneleen Van Assche - Improving the Applicability of Ensemble methods in Data Mining, Katholieke Universiteit Leuven, Belgium

  • [2007]
    Ph.D. Thesis of Pedro Gabriel Dias Ferreira - Sequence Pattern Mining in Biochemical Data, University of Minho, Portugal

  • [2005]
    Ph.D. Thesis of Kwok Pan Pang - Statistics for Structural Break Detection and Their Application to Forecasting and Statistical Process Control, Monash University, Australia

  • [2004]
    MSc. Thesis of Susana Pereira - Análise de Séries Temporais no Domínio das Telecomunicações Móveis, Masters on Statistics and Information Managment, ISEGI, New University of Lisbon, Portugal

  • [2003]
    Ph.D. Thesis of Vitor Lobo - Ship Noise Classification, a contribution to prototype based classifier design, New University of Lisbon, Portugal

  • [2003]
    MSc. Thesis of Raul Moisão - Modelo Predictivo, Baseado em Redes Neuronais, para Previsão em Séries Temporais com Origem em Sistemas Financeiros,New University of Lisbon, Portugal

Research projects

  • [2017]
    European Comission, Expert Reviewer (EX2017D300375), Review of 4 proposals for the H2020-MSCA-IF-2017 (Horizon 2020 Marie Sklodowska-Curie Actions - Individual Fellowships)

  • [2016]
    KU Leuven, Belgium. Review of one project proposal.

  • [2011 – 2014]
    FIRST - Large scale information extraction and integration infrastructure for supporting financial decision making. EC Seventh Framework Programme, project nr. 257928. Member of the Advisory Board.

  • [2013]
    Czech Science Foundation – GACR. Review of one project proposal.

  • [2011]
    Czech Science Foundation – GACR. Review of two projects proposals.

  • [2010]
    Czech Science Foundation – GACR. Review of one project proposal.

Editorial Boards of Journals

  • Data Mining and Knowledge Discovery, Springer.
    Member of the editorial board.

  • Mathematics, MDPI.
    Member of the editorial board.

  • Intelligent Data Analysis, IOS Press.
    Member of the editorial board.

Journals

  • Data Mining and Knowledge Discovery, Springer.
    Reviewing of 17 submissions.

  • Journal of Machine Learning Research.
    Reviewing of 5 submissions.

  • Machine Learning Journal, Kluwer Academic Publishers.
    Reviewing of 17 submissions.

  • IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
    Reviewing of 5 submissions.

  • IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Computer Society.
    Reviewing of 1 submission.

  • Journal of Artificial Intelligence Research, Morgan Kaufmann.
    Reviewing of 1 submission.

  • Decision Support Systems, Elsevier.
    Reviewing of 2 submissions.

  • Neural Computing and Applications, Springer.
    Reviewing of 1 submission.

  • Neural Networks, Elsevier.
    Reviewing of 1 submission.

  • Neurocomputing, Elsevier.
    Reviewing of 3 submission.

  • Intelligent Data Analysis, Elsevier Science.
    Reviewing of 3 submission.

  • Expert Systems, Wiley.
    Reviewing of 1 submission.

  • International Journal of Human-Computer Studies, Elsevier Science.
    Reviewing of 1 submission.

  • AI Communications, IOS Press.
    Reviewing of 1 submission.

  • International Journal on Artificial Intelligence Tools, World Scientific.
    Reviewing of 2 submission.

International Conferences

  • KDD, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
    2021 (PC member), 2009 (PC member), 2007 (PC member)

  • ICML, International Conference on Machine Learning
    2021 (PC member), 2020 (PC member), 2016 (PC member), 2011 (PC member), 2010 (PC member), 2009 (Area Chair), 2008 (Area Chair), 2004 (PC member)

  • ECML, European Conference on Machine Learning
    2018 (Area Chair), 2017 (Area Chair), 2016 (PC member), 2015 (Area Chair), 2014 (PC member), 2013 (PC member), 2012 (Area Chair), 2011 (Area Chair), 2010 (PC member), 2009 (PC member), 2008 (PC member), 2007 (Area Chair), 2006 (Area Chair), 2005 (Area Chair), 2004 (PC member), 2003 (PC member)

  • ICDM, IEEE International Conference on Data Mining
    2007 (PC member), 2006 (PC member), 2005 (PC member), 2004 (PC member)

  • PKDD, European Conference on Principles and Practice of Knowledge Discovery in Databases
    2018 (Area Chair), 2017 (Area Chair), 2016 (PC member), 2015 (Area Chair), 2014 (PC member), 2013 (PC member), 2012 (Area Chair), 2011 (Area Chair), 2010 (PC member), 2009 (PC member), 2008 (PC member), 2007 (Area Chair), 2006 (Area Chair), 2005 (Program Chair), 2004 (PC member), 2003 (PC member)

  • PAKDD, Pacific-Asia Conference on Knowledge Discovery and Data Mining
    2009 (PC member), 2008 (PC member), 2007 (PC member)

  • DS, International Conference on Discovery Science
    2019 (PC member), 2010 (PC member), 2009 (PC member), 2008 (PC member), 2007 (PC member)

  • AAAI, Conference on Artificial Intelligence
    2018 (PC member), 2017 (PC member), 2015 (PC member), 2014 (PC member)

  • IJCAI, International Joint Conference on Artificial Intelligence
    2021 (Area Chair), 2020 (PC member), 2019 (PC member), 2017 (PC member), 2013 (Senior PC member), 2011 (Senior PC member)

  • ECAI, European Conference on Artificial Intelligence
    2014 (Senior PC member), 2012 (PC member), 2010 (PC member)

  • SDM21, SIAM International Conference on Data Mining
    2021 (PC member)

  • NIPS, Annual Conference on Neural Information Processing Systems
    2014 (PC member)

  • ACML, Asian Conference on Machine Learning
    2012 (PC member)

  • UseR, The R User Conference
    2013 (PC member)

  • SAC, ACM Symposium on Applied Computing
    2005 (PC member)

  • ADMA, International Conference on Advanced Data Mining and Applications
    2008 (PC member), 2007 (PC member), 2006 (PC member), 2005 (PC member)

  • IBERAMIA, Iberoamerican Conference on Artificial Intelligence
    2002 (PC member), 2000 (PC member), 1998 (PC member)

  • EPIA, Portuguese AI Conference
    2005 (PC member), 2003 (PC member), 2001 (PC member)

  • SBIA, Brazilian Symposium on Artificial Intelligence
    2008 (PC member), 2004 (PC member)

  • ENIA, Brazilian Meeting on Artificial Intelligence
    2011 (PC member), 2007 (PC member)

4.7 Software

  • [2016]
    DMwR2 - an R package with functions and data for the 2nd edition of “Data Mining with R”
    GitHub project page

  • [2016, co-author]
    UBL - an R package for utility-based predictive analytics
    GitHub project page

  • [2014]
    performanceEstimation - an R package for estimating the performance of predictive models
    GitHub project page

  • [2010]
    DMwR - an R package with functions and data for the 1st edition of “Data Mining with R”

  • [2005]
    TNT - an autonomous trading system for financial markets

  • [2001]
    CLRT - Clustered regression models

  • [1999]
    RT - Tree-based regression models

  • [1997]
    C library for propositional learning (in conjuntion with João Gama)

  • [1996]
    RECLA - Regression through classification

  • [1996]
    KERTI - Kernel regression trees

  • [1996]
    EcoTerme - Calculus of thermic behaviour of buildings

  • [1995]
    R2 - regression rules learner

  • [1995]
    YAP-Prolog library for propositional learning

  • [1993]
    YAILS - Incremental learning of classification rules

  • [1991]
    INTEG - Knowledge integration system

4.8 Visits to Research Labs

  • [Jan/2018] One week visit to the Jozef Stefan Institute, Ljubljana, Slovenia. Host: Prof. Igor Mozetic

  • [Jan/2017] One week visit to the Jozef Stefan Institute, Ljubljana, Slovenia. Host: Prof. Igor Mozetic

  • [Sep-Dec/2015]
    Four months visit to the Weka research lab at University of Waikato, New Zealand. Host: Prof. Bernhard Pfahringer.

  • [July/2012]
    Two weeks visit to the Text Analysis and Machine Learning (TAMALE) research lab at University of Ottawa, Canada. Host: Prof. Stan Matwin.

  • [May/2012]
    Two weeks visit to the Weka research lab at University of Waikato, New Zealand. Host: Prof. Bernhard Pfahringer.

  • [Jun/2010]
    One month visit to the Department of Informatics, University degli Studi di Bari, Italy. Host: Prof. Donato Malerba

  • [Fev-Jul/2008]
    Six months visit to the Weka research lab at University of Waikato, New Zealand. Host: Prof. Bernhard Pfahringer.

  • [Mar-Aug/2004]
    Five months visit to the Stern Business School of the University of New York. Hosts: Profs. Foster Provost and Vasant Dhar

  • [1994]
    Three months visit to the University of São Paulo, campus São Carlos, Brazil. Host: Prof. Carolina Monard

4.9 Invited Seminars

  • [Mar/2021] Time Series Forecasting: some challenges and possible solutions, DaSSWeb, Data Science and Statistics Webinar, Porto, Portugal.

  • [Sep/2019] Adressing the Data Revolution, Engineers Nova Scotia Annual Conference, Halifax, Canada.

  • [Jun/2018] Predictive Analytics and the Ocean, H2O Conference, Oceans Week, Halifax, Canada.

  • [May/2018] Utility-based Regression, COST’2018, San Diego, USA.

  • [Jan/2018] Arbitrage of Forecasting Experts, Jozef Stefan Institute, Slovenia.

  • [July/2017] Data Pre-processing Methods for Forecasting with Spatio-Temporal Data, invited talk at the international conference Data Science, Statistics and Visualization, Lisbon, Portugal

  • [June/2017] Handling Imbalanced Regression Tasks through Utility- based Regression, invited seminar at Université de Fribourg, Fribourg, Switzerland

  • [May/2017] An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R, invited talk at SER, Niteroi, Rio de Janeiro, Brazil

  • [Jan/2017] Resampling Approaches for Handling Imbalanced Regression Tasks, Jozef Stefan Institute, Slovenia.

  • [Dec/2015]
    An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R, University of Waikato, New Zealand

  • [Sep/2015]
    Feature Engineering for Handling Spatial and Spatio-Temporal Forecasting, University of Waikato, New Zealand

  • [Jun/2015]
    The R Language - programming for data analysis, Join 2015, Braga, Portugal

  • [May/2015]
    Data Mining aplicado à Previsão de Blooms de Algas, Workshop Aquacultura 2015, Porto, Portugal

  • [May/2015]
    Data Science - what, why and how?, Porto Tech Hub, Porto, Portugal

  • [Mar/2015]
    An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R, Porto R Users Group (PRUG), Porto, Portugal

  • [Jan/2015]
    An Infra-Structure for Performance Estimation and Experimental Comparison of Predictive Models in R, LIAAD Seminars, INESC Tec, Portugal

  • [Nov/2014]
    Monitoring and Forecasting Rare Events, Workshop INESC/CIIMAR, INESC Tec, Portugal

  • [Jun/2014]
    Dynamic Documents in R, DCC talks, FCUP/UPorto, Portugal

  • [Jan/2014]
    Spatio-temporal data mining and extreme behavior data mining, LIAAD Open-Day, INESC Tec, Portugal

  • [Jul/2013]
    Spatial Interpolation using Multiple Regression, University of Konstanz, Germany

  • [Feb/2013]
    Data Mining para a Deteção de Fraude, INESC Tec, Portugal

  • [May/2012]
    Modeling Deviations from Expected Behavior - two case studies, University of Waikato, New Zealand

  • [Out/2011]
    Modeling Deviations from Expected Behavior - two case studies, Back2Basics Seminar Series, Faculty of Engineering, University of Porto, Portugal

  • [Out/2011]
    Modeling Deviations from Expected Behavior - two case studies, Thought Leader Speaker Series, eBay Research Labs, San Jose, USA

  • [Jul/2011]
    Modelos de Previsão para Sistemas Dinâmicos Complexos, Seminários em Engenharia de Sistemas, University of Minho, Portugal

  • [Jun/2010]
    Resource-bounded Outlier Detection using Clustering Methods, Department of Informatics, University degli Studi di Bari, Italy.

  • [Jan/2009]
    Using Data Mining for Resource-aware Fraud Detection, Workshop on Data Mining for the Banking System, Faculty of Economics, University of Porto, Portugal

  • [Mar/2008]
    Utility-based Regression - recent developments, University of Waikato, New Zealand

  • [Jan/2008]
    Utility-based Regression - recent developments, Katholieke Universiteit Leuven, Belgium

  • [Jan/2007]
    Predicting Rare Extreme Values - recent developments, Solomon Seminars, Josef Stefan Institute, Slovenia

  • [Jan/2006]
    Non-Uniform Cost Surfaces for Predicting Rare Extreme Values, Solomon Seminars, Josef Stefan Institute, Slovenia

  • [Jan/2006]
    Regression Error Characteristic Surfaces, Solomon Seminars, Josef Stefan Institute, Slovenia

  • [Sep/2004]
    An autonomous trading system, International Summer School on Data Analysis, Instituto Superior de Gestão, Portugal

  • [Jan/2004]
    An intraday Autonomous Trading System, Faculty of Economics, University of Porto, Portugal

  • [Nov/2003]
    Mining DNA microarray data: techniques and applications, Instituto de Biologia Molecular e Celular, Porto, Portugal

  • [Jun/2003]
    Models for Predicting Water Quality, Jornadas em Informática (JOIN’03), University of Minho, Portugal

  • [Out/2002]
    Artificial Intelligence: from fiction to reality, 2 Ciclo de Conferências em Cibercultura, Guarda, Portugal

4.10 Service to the Community