Course Description and Goals:

Statistical learning is a field that teaches students how to analyze and interpret data by applying statistical methods and machine learning algorithms to uncover patterns, make predictions, and gain insights from data. The syllabus includes:

• Statistical and machine learning methods, including linear and polynomial regression, logistic regression, and linear discriminant analysis.

• Model validation techniques such as cross-validation and bootstrap, model selection, and regularization methods (ridge and lasso).

• Nonlinear models, splines, and generalized additive models.

• Tree-based methods, including random forests and boosting.

• Support-vector machines and an introduction to causal inference.

• Unsupervised learning methods such as principal components analysis and clustering (k-means and hierarchical).

Examination Format: Report and Presentation (obligatory to receive participaation certificate)

General Information University: University of Duisburg-Essen

• Learning Platform: Moodle (Link: https://lehre.moodle.uni-due.de/course/view.php?id=1986)

• Hybrid Format: Yes, the course includes both in-person and online sessions via Zoom

• Study Program and Level: Master’s/PhD students

• Weekly Hours: The event consists of three block sessions (lecture with tutorial sessions) with two additional dates for tutorials:

• First session: 02.04. (9:00–14:00)

• Following sessions: 03.04. & 04.04. (9:00–17:00)

• Online session: 07.04.

• Final lecture: 25.04. (9:00–14:00)

• Language: German or English (depending on student preference)

Further information, including locations, and Zoom links, can be found on our homepage: Statistical Learning SS 25

Related Posts

How To: Open Science

How To: Open Science

Tired of Recreating someone else’s work? - How Open Science can accelerate research and overcome reinvention

Have you ever found papers on algorithms but their implementation is missing? Found an interesting analysis but there is no way to check the results, as you don’t have access to the data they were derived from? Ever thought you had a great idea for a project, just to find out a year later that you are not the only research group following that specific idea? Not having access to other people’s code, data, metrics or even their plans for research projects often leads to unnecessary delays and scientific redundancies. There is an easy solution to overcome (almost) all of these issues. It’s called Open Science! What is Open Science? The UNESCO defines Open Science as a construct of “movements and practices aiming to make multilingual scientific knowledge openly available, accessible and reusable for everyone, to increase scientific collaborations and sharing of information for the benefits of science and society, and to open the processes of scientific knowledge creation, evaluation and communication to societal actors […]”. To ensure that everyone has access to scientific knowledge and infrastructure, Open Science focuses on four main concepts.

Read More
A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

There are a lot of reasons why collaborating with other researchers on scientific projects is great! It provides new perspectives and gives you the chance to benefit from other people’s knowledge and input. When it comes to sharing and exchanging data across multiple locations and devices however, researchers are often disoriented and don’t know which tools, cloud services and so on are safe to share data in a secure and ethical way.

Read More
How To: Good Scientific Practice

How To: Good Scientific Practice

“Scientific integrity forms the basis for trustworthy research”, so it says in the Guidelines for Safeguarding Good Research Practice of the DFG, the German Research Foundation. As a major funder of research in Germany the DFG, as well as many other funders of research in Germany and the European Union, requires researchers to follow a certain set of rules conducting their research. These rules are called “good scientific practice” and have to be followed by researchers to be viable for funding. According to the guidelines researchers are required to “document all information relevant to the production of a research result as clearly as is required by and is appropriate for the relevant subject area to allow the result to be reviewed and assessed”. But good scientific practice is not done by documenting your research. It also includes i.a. protecting the personality rights of your subjects and handling research data in an appropriate manner by e.g. “back(-ing) up research data and results made publicly available, as well as the central materials on which they are based and the research software used, by adequate means according to the standards of the relevant subject area, and retain them for an appropriate period of time.” This is where Research Data Management (RDM) comes in. Of course RDM is much more than just creating a backup of your data on a USB-Stick and handing it over to anyone asking for it. “Good scientific practice” in RDM follows the FAIR principles:

Read More