Speakers • STAT of ML 2021

Non-fungible tokens and VizTech

Language: en

Authors: Wolfgang Karl Härdle

Abstract: TBA

Opening

Language: en

Closing

Language: en

Wolfgang Karl Härdle Professor, Humboldt-Universität zu Berlin

Probabilistic Forecasting with Machine Learning and Big Data

Language: en

Authors: Lubos Hanus and Jozef Barunik

Abstract: e propose a distributional deep learning approach to probabilistic forecasting of economic time series. Being able to learn complex patterns from large amount of data, deep learning methods are useful for a decision making that depends on uncertainty of possibly large number of economic outcomes. Such predictions are also informative to decision makers facing asymmetric dependence of their loss on outcomes from possibly non-Gaussian and non-linear variables. We show the usefulness of the approach on the three distinct problems. First, we use deep learning to construct data-driven macroeconomic fan charts that reflect information contained by large number of variables. Second, we obtain uncertainty forecasts of irregular traffic data. Third, we illustrate gains in prediction of stock return distributions which are heavy tailed and suffer from low signal-to-noise ratio.

Opening

Language: en

Closing

Language: en

Jozef Barunik Associate Professor, Academy of Sciences and Charles University

Jozef Baruník is an Associate Professor at the Institute of Economic Studies, Charles University in Prague. He also serves as a head of the Econometrics department at the Czech Academy of Sciences. In his research, he develops mathematical models for understanding financial problems (such as measuring and managing financial risk), develops statistical methods and analyzes financial data. Especially, he is interested in asset pricing, high-frequency data, financial econometrics, machine learning, high-dimensional financial data sets (big data), and frequency domain econometrics (cyclical properties and behavior of economic variables).

Portfolio optimization using distortion risk measures via linear programming

Language: en

Authors: Milos Kopa

Abstract: TBA

Opening

Language: en

Closing

Language: en

Milos Kopa Associate Professor, Charles University

Tree Models for Data Driven Decision Making and Optimization

Language: en

Authors: Dolores Romero Morales

Abstract: TBA

Dolores Romero Morales Professor, Copenhagen Business School

Monitoring network changes in social media

Language: en

Authors: Cathy Yi-Hsuan Chen, Yarema Okhrin, Tengyao Wang

Abstract:

Cathy Yi-Hsuan Chen Professor, Adam Smith Business School, University of Glasgow

Bankruptcy prediction for privately held SME's with implications on banks profitability

Language: en

Authors: Florentina Paraschiv

Abstract:

Florentina Paraschiv Professor, Zeppelin University and Norwegian University of Science and Technology

Empirical Risk Minimization for Time Series: Nonparametric Performance Bounds for Prediction

Language: en

Authors: Christian Brownlees and Jordi Llorens-Terrazas

Abstract: Empirical risk minimization is a standard principle for choosing algorithms in learning theory. In this paper we study the properties of empirical risk minimization for time series. The analysis is carried out in a general framework that covers different types of forecasting applications encountered in the literature. We are concerned with 1-step-ahead prediction of a univariate time series generated by a parameter-driven process. A class of recursive algorithms is available to forecast the time series. The algorithms are recursive in the sense that the forecast produced in a given period is a function of the lagged values of the forecast and of the time series. The relationship between the generating mechanism of the time series and the class of algorithms is unspecified. Our main result establishes that the algorithm chosen by empirical risk minimization achieves asymptotically the optimal predictive performance that is attainable within the class of algorithms.

Christian Brownlees Professor, Universitat Pompeu Fabra

Recursive multivariate volatility forecasts for large portfolios

Language: en

Authors: Cipra, T., Hendrych, R.

Abstract: TBA

Tomáš Cipra Professor, Charles University

Testing structural breaks in large dynamic models

Language: en

Authors: Z.Praskova

Abstract: TBA

Zuzana Prášková Professor, Charles University

Value at Risk Approach to Producer’s Best Response in Electricity Market with Uncertain Demand

Language: en

Authors: M.Branda

Abstract: TBA

Martin Branda Assistant Professor, Charles University

Marine Fuel Hedging Under the Sulphur Cap Regulations

Language: en

Authors: Frantisek Cech and Michal Zitek

Abstract: We argue that marine fuels consumers and producers can reduce the uncertainty of their portfolios under the environmental regulations aimed at air pollution re- duction. Our results show that uncertainty can be reduced up to 72%. We also identify Gasoil futures as the universal hedging instrument to manage uncertainty.

Frantisek Cech Assistant Professor, Academy of Sciences and Charles University

Monitoring a developing pandemic with available data

Language: en

Authors: Jens Perch Nielsen

Abstract: TBA

Jens Perch Nielsen Professor, Bayes Business School

Actuary from Copenhagen and statistician from UC-Berkeley. Worked as appointed actuary in his young days and led various product development departments before specialising in research and development. He became research director of RSA with responsibilities in life as well as non-life in 1999. From 2006 until 2012 he worked as an entreprenuer and he is still co-owner and board member of Copenhagen based ScienceFirst, London based Operational Science and Cyprus based Emergent. He is co-author of more than 100 scientific papers in reviewed journals of actuarial science, economics, econometrics and statistics and also one book on quantitative operational risk modelling and associate editor of a number of journals.

A Machine Learning Approach to Quantifying the Quality of Real Estate Descriptions

Language: en

Authors: Joakim Olsen, Arild Brandrud Næss, Pierre Lison

Abstract: This paper explores how to automatically measure the quality of human-generated summaries, based on a Norwegian corpus of real estate condition reports and their corresponding summaries. The proposed approach proceeds in two steps. First, the real estate reports and their associated summaries are automatically labelled using a set of heuristic rules gathered from human experts and aggregated using weak supervision. The aggregated labels are then employed to learn a neural model that takes a document and its summary as inputs and outputs a score reflecting the predicted quality of the summary. The neural model maps the document and its summary to a shared summary content space and computes the cosine similarity between the two document embeddings to predict the final summary quality score. The best performance is achieved by a CNN-based model with an accuracy (measured against the aggregated labels obtained via weak supervision) of 89.5%, compared to 72.6% for the best unsupervised model. Manual inspection of examples indicate that the weak supervision labels do capture important indicators of summary quality, but the correlation of those labels with human judgements remains to be validated. Our models of summary quality predict that approximately 30% of the real estate reports in the corpus have a summary of poor quality.

Arild Brandrud Næss Associate professor of statistics, NTNU Business School

Implied volatility smoothing at COVID-19 times

Language: en

Authors: Sebastiano Vitali

Abstract: This work aims at studying the impact of the SARS-CoV-2 pandemic on the global financial markets. In particular, such impact is analysed through the changes of the shape of the implied volatility smile of the options written on several equity indexes and on several stocks. The implied volatility function is estimated using the market-based information of liquid options and applying a semi-parametric smoothing technique that exploits a kernel function and no-arbitrage conditions. Such approach is applied to an extensive set of data to study the evolution of the implied volatility functions through the months of the pandemic. We show, in several cases, a sudden and massive change in the shape of the implied volatility functions.

Sebastiano Vitali Assitant Professor, Charles University

Uniform confidence bands for generalised random forests estimates

Language: en

Authors: Kainat Khowaja

Abstract: In this paper, we show that the generalised random forest estimate, as function of x, is uniformly converging to true function. Athey et al (2020) show in their paper on Generalised Random Forests(GRF) that an estimate obtained from GRF at the point is asymptotically normal. Given a data set at hand, we find a critical value such for the uniform confidence bands using multiplier bootstrap, since it is well known that the standard approach via extreme value theory have a very slow asymptotic. We also demonstrate the construction of UCB on the same example given in the study of Athey et al (2020).

Kainat Khowaja PhD student, Humboldt-Universität zu Berlin

Sentiment-driven cryptocurrency market analysis

Language: en

Authors: Anna Shchekina

Abstract:

Anna Shchekina PhD student, Humboldt-Universität zu Berlin

Quantinar Data - Science - Education

Language: en

Authors: Julian Winkel

Abstract: Living in the Information Age, the power of data and statistics has never been more prevalent. Academics, architects, medical doctors, journalists, lawyers, programmers and many other professionals nowadays require an accurate application of statistical methods. Instead many branches are subject to a crisis of integrity, which is shown in improper use of statistical models, p-hacking, HARKing or failure to replicate results. We propose the use of a peer-to-peer education network, Quantinar, to spread statistical knowledge embedded with code in the form of Quantlets.

Julian Winkel PhD student, Humboldt-Universität zu Berlin

FRM@Asia an ML Approach

Language: en

Authors: Ruting Wang

Abstract:

Ruting Wang PhD student, Humboldt-Universität zu Berlin

Quantile Portfolio Optimization

Language: en

Authors: Martin Hronec and Jozef Barunik

Abstract: TBA

Martin Hronec PhD student, Academy of Sciences and Charles University

Distributional Assymetries and Currency Risk

Language: en

Authors: Josef Kurka

Abstract: TBA

Josef Kurka PhD student, Academy of Sciences and Charles University

Tail Risks, Investment Horizons, and Asset Prices

Language: en

Authors: Matej Nevrla

Abstract: TBA

Matej Nevrla PhD student, Academy of Sciences and Charles University

Deep Reinforcement Learning in Asset Pricing

Language: en

Authors: Lenka Nechvatalova

Abstract: TBA

Lenka Nechvatalova PhD student, Academy of Sciences and Charles University

A Data-driven Explainable Case-based Reasoning for Bankruptcy Prediction

Language: en

Authors: Wei Li

Abstract: TBA

Wei Li PhD student, Norwegian University of Science and Technology

Crypto volatility and blockchain mechanism

Language: en

Authors: Min-Bin Lin

Abstract: TBA

Min-Bin Lin PhD student, Humboldt-Universität zu Berlin

Halfspace depth for general measures

Language: en

Authors: Petra Laketa, Stanislav Nagy and Dusan Pokorny

Abstract: Let $R^d$ be the $d$-dimensional Euclidean space and $mu$ a finite Borel measure on $R^d$. A halfspace depth of a given point $x$ with respect to $mu$ is defined as the infimum of $mu$-masses of all the closed halfspaces that contain $x$. As such, it measures the centrality of $x$ with respect to $mu$ and is used as a multivariate quantile. The notion of the halfspace depth found also application in machine learning. The existing literature on this interesting topic usually imposes restrictive assumptions on measure $mu$. We consider halfspace depth in general setting, for all finite Borel measures, with intention to collect partial results from the literature and give more general theoretical results. We specially focus on 1) when and how is it possible to reconstruct the underlying measure based on its halfspace depth function and 2) extending the so-called ray basis theorem, which gives an interesting characterization of the point with the maximal halfspace depth, called the halfspace median.

Petra Laketa PhD student, Charles University

Compressed time series text regression

Language: en

Authors: Daniel Mittendorf

Abstract: We propose a procedure to model the time-varying relationship between a univariate response and a high-dimensional predictor set derived from textual data. In such settings, standard shrinkage approaches such as lasso can encounter computational and numerical issues. While text-specific regression approaches such as multinomial inverse regression and hurdle regression have been developed, these methods allow for neither time-varying parameters nor heteroskedasticity. Our proposed method first performs several random projections of the predictor matrix to a much lower-dimensional linear subspace. For each of these compressed predictor sets, a time-varying parameter state space model is estimated using fast Kalman filter recursions, allowing for heteroskedasticity in the response. These models' predictions are then averaged dynamically using Bayesian posterior model probabilities. The resulting procedure remains stable with hundreds of thousands of predictors and is trivially parallelisable. Intuitive variable importance measures can be computed naturally with little additional computational cost.

Daniel Mittendorf PhD student, Adam Smith Business School, University of Glasgow

Multivariate crypto-portfolio optimization

Language: en

Authors: Karel Kozmik

Abstract: TBA

Karel Kozmik PhD student, Charles University

State-space moedlling of claims reserves

Language: en

Authors: Petr Vejmelka

Abstract: TBA

Petr Vejmelka PhD student, Charles University