News page

The most recent news from our NCC.

Interview: Preparing Students for the Future - Prof. Dirk Valkenborg's Approach to Supercomputing Education

Prof. Dirk Valkenborg

Prof. Dirk Valkenborg is the Programme director of the data science trajectory in the Master of Statistics & Data Science at UHasselt. He teaches a course on Machine Learning in this programme, and during this course, his students use the supercomputing infrastructure of The Flemish Computer Center. We sat down with Prof. Valkenborg to ask him why he is so keen on getting his students enthusiastic about working with a supercomputer.

Why did you introduce supercomputing to your students?


In the Master of Statistics & Data Science, particularly within the Data Science track, students must handle large datasets. So, we needed a solution to provide our students with more computational power. We also wanted to teach them some basic skills in supercomputing as preparation for their later professional life. Several options were available for consideration.
One possibility was extending our local infrastructure and granting students access to these facilities. But this extension would be very costly and require updating hardware and continuous maintenance, so this was not feasible. Alternatively, we could use a platform like Google Colab or Kaggle, which also has limitations. When training is required on a larger dataset, you get thrown out of the service because it takes too much time.


We contacted the VSC to check whether there was any possibility of giving our students access to the VSC computing facilities, and this arrangement was feasible. Of course, the entry-level here is higher. The other two options had a more accessible entry: run a Jupiter notebook on a GPU via Colab or Kaggle. When opting for local infrastructure facilities, we would give the students a bit more computing power. Still, their mode of operation would remain the as using their regular laptop. Students would not think about parallelisation, making data available, job submission etc. Using VSC infrastructure, forces them to do so. 
Also local infrastructure does not scale, and the VSC infrastructure does.

You decided to introduce the students to supercomputing in the first semester of the second year of the Master's. Why?

In our master's path, the focus on the master thesis project is only in the last three months of the academic year. Getting students on board with supercomputing during that short period is challenging. Therefore, giving them a taste of the VSC capacity earlier in their educational journey would be much more convenient.
I teach a Machine Learning course in the second year of the Master of Statistics & Data Science. In this course, a relatively large dataset needs to be analysed, and this also can perfectly be done using supercomputing infrastructure. So, we added this as an extra competence for this course. 

Prof. Bex and I organise two sessions in the Machine Learning course to gently introduce students to remote computing. One of the final objectives of the Machine Learning course is that their final model runs on a supercomputer.

 

What is the added value for the students?

Companies like VITO or Janssen Pharmaceutics have their own supercomputing infrastructure with a slightly different interface, but it is all terminal-based. Once they get on board with the philosophy, students can work on other infrastructures too.

So, it gives them additional skills in their first steps in the job market.
Or if they want to do a Ph.D., they are one step ahead. Many of our Ph.D. students get the same training as the one we provide in our Machine learning course.

If we ask Ph.D. students who have worked on their Ph.D. for several years to work with a supercomputer, they are reluctant to make that switch. They think: ‘I will just leave my laptop on for a few nights'. You can compare it with learning a new language. The sooner you start, the better. The earlier students learn their way around a supercomputer, the better. 

Which students are currently involved in this? 

We currently focus on students of the Master of Statistics & Data Science students. Still, these are elective courses and can also be taken by others (students entering from biology, biostatistics or bioinformatics). This is the student's choice, but we are considering how to make this compulsory for all our students because supercomputing is also relevant in statistics, bioinformatics, epidemiology etc. ….

How did you convince the students to step in? How did you take away the threshold with your students?

I first explained the project's added value and told them it was necessary. It is a competence evaluated through a pass/fail system. The student fails if one cannot run the script on the supercomputer.

I test two competencies in this way: first, are students technically proficient enough to submit a job on a supercomputer? Second: are they able to write a script? Because I still notice a lot when practically writing scripting languages (especially R), students often have a text file full of R code, which they run block by block. They select a few lines of code, run that code, scroll to another piece in their document, and then select the code they wish to run. This is incompatible with the philosophy of submitting a job to a supercomputer. There, students have to program in a logical line.

Did you have any lessons learned from this experience? How would you address these?

Because I am the only professor requiring this, it’s a bit of a forced exercise: you see students checking this off and struggling for a while but quickly falling back into old habits. To better consolidate this, I recommend that every professor works with the pass/fail method if possible.

I also noticed that students don’t start working on the supercomputer immediately after the introduction. Because the assignment is to run the final model on the supercomputer, which is sometime in December, they postponed it until then. Luckily the introduction session was recorded so they could look at it again. So, during that period, we received many questions about how to work because it was too long ago.
For the future, we are thinking about ways to activate students to work with it immediately and consolidate their knowledge. We could explain how to do it and let them practice through exercises so they do it repeatedly. After three or four times, they will master the skill.

Why would you convince other professors to use supercomputing?

For me, it was a need that became more pressing in the master thesis. If students want to deal with data and start to apply to compute there, they have to switch to a more extensive system.  Introducing them to supercomputing in early in their career is an excellent way to prepare them for this.
In the course where I apply it, it is not strictly necessary, but this way, they can already learn those skills on a small case and see the added value of such a system.

The more we require students to use this, the more natural this becomes, and that’s a plus.

Thank you, prof. Valkenborg for this inspirational talk. We wish you good luck with the course! 


EuroCC Belgium encourages the use of supercomputing in education and wants to facilitate others to teach how to utilise a supercomputer effectively. Therefore, we provide a professional teaching kit, including a comprehensive slide deck. More material (videos etc.) will follow soon.

Check out our teaching kit
via this link. Unlock the potential of others with our training tools!

 

 

?Attention: 3-6 October 2023 Comprehensive General LUMI course - Warsaw/Online

LUMI Supercomputer

This four-day on-site (Warsaw, Poland) and online course serve as a general comprehensive introduction to the LUMI architecture and programming environment. It will include lessons about compiling and using software, programming models (HIP and OpenMP offload), porting, executing jobs, and optimizing applications to run on AMD MI250X. After the course, you will be able to work efficiently on both the CPU (LUMI-C) as well as GPU partition (LUMI-G).

Please note, that this is a comprehensive course with an emphasis on code development and analysis tools.

The four-day course is “on-site first”: on-site participants get plenty of time to interact with the LUST, HPE, and AMD staff that is present, but we cannot organize an equivalent interaction with the online participants. For them, it is mostly a broadcast of the lectures with the option to raise questions via a shared document. Anything that would deteriorate the experience for the on-site participants is avoided. The recent course in Tallinn has shown that the direct interaction with the people present in the room gives a lot of additional value to those people that is hard to get in an online setup. 

Target group

The course is intended for users with ongoing projects on LUMI, users with project proposals in one of the national or EuroHPC channels, and support staff of local organizations of the LUMI consortium members.

More information is available at: https://lumi-supercomputer.eu/events/general-lumi-course-oct2023/

Registration - deadline: 25/09/2023 at 16:00 CEST

Register here: https://events.plgrid.pl/e/lumi-general-course

 

?New User Story - How supercomputing helps Atlas Copco to better explore and design filter media

Streamlines flow through a filter medium coloured by velocity magnitude

Read ? our latest user story ? ? https://www.enccb.be/atlascopco

Did you know Atlas Copco extensively uses computing, computer models and digital twins to predict the behaviour of various products in different conditions. Tom Saenen (Atlas Copco, Technology developer Computational Fluid Dynamics): "Supercomputing allows for more extensive exploration and optimisation of a design. It also permits the simulation of more physics and larger problems that were infeasible before, like for instance having a better understanding of microscale air and oil flow behaviour in oil aerosol filter media. ” Want to know how supercomputing helped to get insight into the design of filter media leading to cleaner air delivered at a lower energy cost for the customer? 

06/06/2023 EuroCC Training - Performance-aware C++ programming

C++ code

6 June 2023 - Performance-aware C++ programming 

Description

The C++ programming language is a systems-level language pervasive in many areas including scientific computing. This complex language provides users with a multitude of approaches to solving complex tasks, but not all programming paradigms or approaches are equal in terms of performance.

This 'back-to-basics' training shifts slightly away from over-abstraction or potential overuse of libraries and focuses primarily on performance through the use of the language itself. We show significant speedups are possible through several key principles:

  1. Having some understanding of the hardware (bandwidth, latency, caches)
  2. Data-Oriented Programming versus Object-Oriented Programming
  3. Vectorization and parallelization
  4. Compiler and how to get the most out of it

By the end of the training you will have some intuition on performance and what affects it, as well as pointers for improving performance in your own projects.

Whether your goal is shortening the calculation time of your simulations/workloads or enabling solutions which are otherwise impractical, extracting the most out of the available hardware is a key skill. This training provides an introduction to this topic with a few exploratory trips into the core details. 

Target audience:

Intermediate level, some experience with C++ is recommended
Anyone with an interest in performance-critical applications

Practical information 

Date & time: 6/06/2023 - from 9.00 am to 11.30 am 

Location: online (after registration, a link will be sent to you prior to the training)

Registrations for this training are now closed. 

13/06/2023 - Event: How to Accelerate Your Business with Belgian Supercomputers

Picture event How to Accelerate Your Business with Belgian Supercomputers

Take advantage of this exclusive opportunity to learn about the latest trends and strategies in High-Performance Computing (HPC). This networking event is designed to provide valuable insights into how HPC can drive innovation, boost productivity, and help your business stay ahead of the curve. 

Join us on June 13th at FWO, Brussels, to hear from industry experts and network with like-minded business leaders. Register to secure your spot and take the first step towards unlocking the power of HPC for your business. 

This event is organised by EuroCC Belgium, the HPC National Competence Center Belgium, in partnership with Agoria and the European Digital Innovation Hubs for the manufacturing industry WalHub and DIGITALIS

Program

  • 13:30 check-in 
  • 14:00-14:15. Welcome by Benoît Dompierre (EuroCC Belgium) and Bart Meert (Agoria) 
  • 14:15-15:00 HPC business case by David Waroquiers (Matgenix
  • 15:00-15:45 HPC business case by Ine Vandebeek (Diabatix
  • Closing: 15:45-16:00 
  • Drinks: 16:00-17:00 

Venue 

HOEK 38 Leuvenseweg 38 

1000 Brussel 

Directions: https://www.fwo.be/en/contact/address-and-directions/


Contact  

For more information or questions contact carl [dota] mensch [ati] uantwerpen [dota] be

 

Registration 

The event is free for participation but please register online via the registration form

 

Logos Partners Event Agoria