As soon as possible.
Occasional evening and weekend hours required.
Under the day-to-day direction of the Director of Architecture and Planning, Heart Institute Information Technology and reporting as well to the Senior Vice-President, Digital Health and Cardiac Technology and Chief Information and Technology Officer (CITO), the incumbent is responsible for maintaining the application software stack, hardware technology, libraries, and specialized APIs on a High-Performance Computing environment dedicated to AI/ML/DL research and development. Assist Principal Investigators with algorithm development and tuning for research project specific goals. Provide technical support and training to cluster users as well as monitoring the performance of the HPC environment and secure access. Work with the Back-Office Technologies team members to maintain the HPC environment OS, VMs, hardware, networking and storage.
Provide professional service to internal clients for High Performance Computing systems.
Provide Subject Matter expertise (SME) to the support team regarding HPC AI systems and technologies.
Mentor and assist other support team members in resolving desktop issues related HPC AI systems.
Exercise resourcefulness in determining the cause of problems through acquired knowledge and expertise, assistance from other IT support staff, and in-depth understanding of problem-solving skills.
Understand defined Service Level Agreements (SLAs), the various user populations and the impact of support issues to help determine the prioritization.
Maintain effective lines of communication with other IT support groups and working with these groups to ensure unresolved problems are handled in an expedient manner, problem trends are identified, and root causes eliminated.
Provide innovative ideas and suggestions on ways to improve existing back-office technologies, processes, and procedures.
Adhere to all documented & formalized policies and procedures.
Promote the use of corporate standard hardware and software to ensure legality and information security.
Post-secondary graduate in Computer Science, Computer Systems, Computer Engineering Technology, Applied Mathematics, or related field.
Proficient in Advanced mathematics including:
Familiar with partial differential equations, Gradient decent, Lagrange, convex optimization, etc.
Basic statistics relevant for ML/AI.
Deep and current understanding of machine learning algorithms and how to apply them to real world problems.
Algorithm optimization for target hardware
Conceptual knowledge of CPU, GPU, network, and storage architectures as applied to ML workloads.
Basic understanding and ideally experience with the following technologies:
Python, R, C/C++
Git code version control system
Singularity, Docker, and Kubernetes container frameworks
Cloud computing services (Amazon Web Service, Google Cloud Platform, Microsoft Azure, etc.)
Slurm HPC environment
Nvidia GPUs and associated driver/software stacks (CUDA, Nvidia NGC)
VMWare (ESX 6.5, VSphere)
Knowledge of Linux (or UNIX) systems and preferably experienced in administering SUSE and CentOS distributions
Linux deployment tools such as Ansible
Storage Area Network and Network Attached Storage
Enterprise Backup Solutions (Enterprise backup software, LTO tape libraries, etc.)
Exposure and understanding of basic Data Base principles (MS SQL, My SQL, PostgreSQL, Oracle)
Fundamental understanding of IT Cyber Security (threats, encryption, inspection, remediation, browser security, etc.)
Networking (fundamental knowledge of TCP/IP, DNS, DHCP, Switches, Routers, firewalls, etc.)
Knowledge of the Heart Institute’s constituencies and the community it serves.
Ability to communicate in French will be considered an asset.
Knowledge of IT back-office support practices
Knowledge of Help Desk problem management software.
Above average analytical, organizational, oral, and written communication skills are required in providing effective liaison with the user community.
Very good interpersonal skills.
Excellent customer service skills including patience and diplomacy.
Team player with demonstrated commitment to service excellence.
Strong problem and analysis and solving skills.
A well-disciplined documenter.
Organizational, time management, leadership skills and the ability to work independently.
Predisposed to collaborate with colleagues and share ideas.
Please email cover letter and CV: email@example.com.
Accommodations will be provided in all parts of the hiring process relating to any specialty requirements. Applicants should make their needs known in advance.
The successful candidate will be required, prior to the start of employment, to complete mandatory organizational training available online, provide a satisfactory Criminal Record Check and provide an official piece of photo identification.