HTCondor experience at IRFU and LLR. Motivations Torque 2.X Plus maintenu officiellement (sécurité !) Maui Evolution/maintenance délaissées (moab…) !

Slides:

Advertisements

Présentations similaires

Status report SOLEIL April 2008

Advertisements

J’aime ma culture francophone, j’aime notre façon d’être, notre joie de vivre, nos traditions, nos manies. Je veux que mes enfants vivent ça et qu’ils.

Core Module 10 Advocacy: Engaging the Public Association des conseils scolaires des écoles publiques de l’Ontario (ACÉPO) Association franco-ontarienne.

© Copyright Showeet.com S OCIAL M EDIA T HINKING.

Laboratoire des outils informatiques pour la conception et la production en mécanique (LICP) ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE 1 Petri nets for.

Alice LCG Task Force Meeting 16 Oct 2008Alice LCG Task Force Meeting 16 Oct 2008 BARBET Jean-Michel - 1/20BARBET Jean-Michel - 1/20 LCGFR Marseille Juin.

Warm up Vocabulary review. manger la pizza jouer au volley.

C’est combien ? Les euros

Accounting régional. Status actuel Base de données node56 : – 22GiB de données – 16 sites – 3.7TiB disponibles… Tous sites sur la base de données node56.

Calcul CMS: bilan CCRC08 C. Charlot / LLR LCGFR, 3 mars 2008.

Research interests Viviane Gascon Vietnam Nurse scheduling Viviane Gascon and Éric Gagné.

Gestion de données : Besoins de la VO Biomed Sorina Pop Laboratoire Creatis Université de Lyon, CREATIS; CNRS UMR5220; Inserm U1044; INSA-Lyon; Université.

Résumé CHEP 2010 Distributed processing and analysis Grid and cloud middleware Thèmes : 1.

THE ADJECTIVES: BEAU, NOUVEAU AND VIEUX 1.

Tier1 at the CC-IN2P3 March Current state at the CC-IN2P3 storage, computation, network...

Indirect Object Pronouns Presentation, Part 2 You should now know the direct object pronouns from the first presentation: Me-me Te-you Le-him, it La—her,

Questions: -W-W-W-What are their main tasks? - What skills should laboratory technicians have? (quote at least 6) -W-W-W-Why is it important for a lab.

Français 2, 26 Novembre 2012 Cinq minutes preparer à L’examen chapitre 2. Did you hear about the Italian chef that died? Partir – to leave. Nous devons.

Experiments, Schedule & Access to the hall. Magdalena Kowalska.

Depuis Describing how long one has been doing something.

DEPUIS: SINCE, FOR Les normes: Les questions essentielles:  Communication 1.2- Which tense is used with “Depuis” to express an action that begins in the.

Français 1441 Chapître 3 Révision d’Examen.  Someone tells you where things on campus are located. You listen and fill in the blank with the missing.

Production DC2 ATLAS David Bouvet LHC Calcul 22 juillet 2004.

ÉCOLE POLYTECHNIQUE CONCOURS 2010 Workshop NSERC scholarship application 23 avril 2015 École Polytechnique.

Unité 2 La vie courante Leçon 3 Bon appétit. Thème et Objectifs Everyday life in France In this unit, you will learn how to get along in France. You will.

Gains from trade Principle # 5: Trades improve the well-being of all.

Lucia - LAPP Phi* meeting - 3 novembre Correcting back to the electrons after FSR So far C Z defined w.r.t. electrons before FSR Z status = 3 and.

Your team’s name. Préselection file You have just downloaded the preselection file: it’s the first step for you to win the challenge! In this file, you.

Irregular Adjectives Not all adjectives are made the same.

Les verbes réfléchis.

Adjective agreement the wizard way

Modèles d’interaction et scénarios

Welcome everyone.

CONTRACTIONS  How to use “À” to say where you are going  How to use “DE” to say where you are coming from.

LHCb DC06 status report (LHCb Week, 13/09/06) concernant le CC : –Site currently fine / stable 3% of simulatated events 30% of reconstructed events –Site.

1. Est-ce que Est-ce que, literally translated "is it that," can be placed at the beginning of any affirmative sentence to turn it into a question: Je.

Français 2, 27 octobre 2014 Ouvrez vos livres á la page 43. Use these sentences to describe your floor plan. What do you get if you cross an alley cat.

Répétez! Bonjour!. Je m’appelle ________. Et toi ? Tu t’appelles comment? Répétez!

Géographie du Canada Les échelles.

Write your answer in French

WALT: how to tell the time in French WILF: to be able to understand ¼ past, ½ past, ¼ to and o’clock (level 2) to be able to understand all times in French.

The Passé Composé Regular verbs with avoir Look at the following 3 sentences. Ali played football yesterday They have visited Paris 3 times We did tidy.

Le Chef a besoin de nouveaux détectives d’inondations qui viennent du coin…… T’es capable de le faire? Complète les niveaux pour en savoir plus sur.

-Transporter specifications sent for comments to Markus and Jean-Louis -We would like to have a mecanum transporter -Feedback from SEAQX: -For the same.

The Passé Composé Objective: to talk about things we have done on a visit to explain what events happened to speak and write about events in the past.

U NITE 7A: E CHAUFFEMENT 1 L E PREMIER OCTOBRE Le mot juste Fill in each blank with an appropriate vocabulary word. 1. M. Tremaine doit ( must ) avoir.

Les pronoms relatifs Relative pronouns join relative (subordinate) clauses to main (independent) clauses. Main clause: The book is boring. The book that.

Fabien Plassard December 4 th European Organization for Nuclear Research ILC BDS MEETING 04/12/2014 ILC BDS MEETING Optics Design and Beam Dynamics Modeling.

Intégration de BQS dans le gLite-CE. Réunion TCG Présentation des difficultés rencontrées: Installation gLite-CE et WMS –Sensibilité aux modifications.

Français 10/12/15 Ouvrez vos livres á la page 40. Faites #6. Two flies are on a porch: which one is the actor? Se dépêcher – to hurry. Dépêche – toi. Hurry.

Negative sentences Questions

Sortir de MAUI – quelles options ? HTCondor dans un CREAM-CE Guillaume Philippon.

La VBM, aspects pratiques. Soft :FSL/ Free/ FSL –Free : segmente, notion de mesure? –FSL : pas de substance blanche disponible –SPM 2 / 5.

Développement des templates Quattor de gLite à EMI Guillaume PHILIPPON.

Mercredi 1er juin 2016 Panorama sur les outils de monitoring Cyril L’Orphelin David Bouvet.

Jobs multicore dans WLCG Présentation en partie basée sur des présentations faites dans le cadre du groupe de travail multicore.

2011/06/14 Efficacité des jobs d’Atlas Pierre Girard Réunion de travail avec Atlas CC-IN2P3, le 14 juin 2011.

CEA DSM Irfu Mises à jour de sécurité… … et la pratique F.SCHAER.

Merci de remplir le formulaire et de le renvoyer à avant le 16 mai 2016 Please complete and send to

Configuration des sites Intérêt de la mutualisation ! Existant avec Quattor Tendance Puppet Discussion.

Français 12/14/15 Ouvrez vos livres á la page 112. Ecrivez six phrases de sports et activités. What is worse than “raining cats and dogs?” Important(e)

CAF-11/10/2010Luc1 Squad Report T1 Période 13/09-11/10 Irena, Sabine, Emmanuel.

20-mars-2008Eric Lançon1 Activités ATLAS sur le nuage Français Emprunts a K. Bernardet, C. Biscarat, S. Jezequel, G. Rahal.

Colloque LCG France14-15 mars SURVEILLANCE ET GESTION D’INCIDENTS Cécile Barbier (LAPP)

Eric Fede : Obernai Intégration des services grille dans l'exploitation des systèmes informatiques du laboratoire.

Géographie du Canada Les échelles.

There are so many types of sports. For example-: Basketball,volleyball, cricket, badminton, table tennis, football, lawn tennis etc.

J’aime ma culture francophone, j’aime notre façon d’être, notre joie de vivre, nos traditions, nos manies. Je veux que mes enfants vivent ça et qu’ils.

J’aime ma culture francophone, j’aime notre façon d’être, notre joie de vivre, nos traditions, nos manies. Je veux que mes enfants vivent ça et qu’ils.

The last grammatical concept

Transcription de la présentation:

HTCondor experience at IRFU and LLR

Motivations Torque 2.X Plus maintenu officiellement (sécurité !) Maui Evolution/maintenance délaissées (moab…) ! Scaling ? 4200job slots/10000 Pas au delà sans modifications (surtout maui) Tendance générale (hepix) : htcondor

HTCondor Pros Equipe réactive Extrêmement complet A priori, meilleur scaling Bien supporté pour les ARC CE Accounting hiérarchique … Cons Concept de batch… « différent » TROP complet ? Moyennement intégré aux CREAM CE (accounting)

Generalités htcondor Classad Scoping Batch ? Pas de queues : juste des priorités/réservations Accounting groups/hiérarchique Permet d’obtenir la fonctionnalité des queues => 120 == 100% => lab1 % == 100*(40/120) = 33%

Généralités htcondor Limitation ressources possible MAIS à définir sur les machines elles même (pas de queue) CGroups Isolation des process possible (PID namespaces) Quotas par job Isolation des filesystems Intégration du cloud (VM kickstarting) …

Goal : s/CREAM|torque|maui/trash/g Config toujours en evaluation 1 ARC CE / condor scheduler – SCHEDD 1 queue single core 1 queue multicore (8 cœurs ou rien) 1 htcondor « mananger » - COLLECTOR/NEGOCIATOR X WN multicore (~8 C6220 / 256 cores) Y WN single core Config grille « standard » IRFU Pool accounts sous NIS Full puppet no yaim Modules HEP-Puppet (ARC, htcondor)/CERNOps(glexec WN) Modules locaux pour la config système

HTCondor - PID namespaces : beau dans la théorie, facile à activer MAIS : atlas se tire une balle dans le pied Filesystem isolation /tmp, /var/tmp isolés (bind mounts) Quotas définis par JOB (tmp inclus…) Dynamic job slots : 1 slot par machine Limites : classads

HTCondor – Initial setup => coredumps Question mailing list => bug fix < 1 semaine Retry : re-core dump. Re-fix < 1 semaine Dynamic job slots par défaut, ne prend pas en compte l’hyperthreading Accounting groups: Basé sur le nom de VO Séparateur : «. » « vo.irfu.cea.fr, vo.grif.fr » ? => MAIN group : « vo »…

HTCondor – No queue, no graph (ARC pb ?) Mix single core/MC non testé par manque de jobs Job defrag non testé config queue par atlas/ « install du soft »… ?

HTCondor – RPM conflicts : Condor fournit de nombreuses dépendances globus GAGNE la transaction yum lors de l’install d’un WN Single paquet >> multi-paquets yum --exclude condor install emi-wn Install de emi nécessaire AVANT condor

Strategy: minimal impact (and possibly effort) use CREAM-CE (we are almost a unique case) minimal changes to Quattor PBS/CREAM config Status: migrated our PP CE on Oct 2014 tested by NGI & CMS sam tests still missing CMS HC jobs CMS promised test MoltiCore jobs. None seen yet Goal: move to HTCondor in prod by Q CMS is not pushing for MC jobs on T2 so we are not in a hurry.

HTCondor - 1 head machine: CREAM-CE + SCHEDD + NEGOTIATOR/COLLECTOR 4 * 16Cores Workers Version (should upgrade to latest) Quattor config Minor changes on CREAM/BDII/Blparser tpls Rather General HTCondor tpls PR currently ongoing on QWG git repo

HTCondor - How do we map user into accounting groups? from the condor manual  map at submit time we get (VO,FQAN,DN) and match it against regexps very flexible (…but should check perfs) “Each job must state which group it belongs to. Currently this is opt-in, and the system trusts each user to put the correct group in the submit description file.” Accounting Group

HTCondor - BDII publication is currently very basic Using a very old plugin « lcg-info-dynamic-condor noarch » that I got from Milan T2 Apel accounting: using « RAL solution » Condor  PBS logs translation && use pbs appel parser waiting for a condor appel parser (which is currently in testing) currently publishing on the test instance

HTCondor - Problems with BLUpdater: Fixed using the one in the condor package Rewriting the bdii config

HTCondor – to Test a cluster with separate CREAM/SCHEDD and NEGOTIATOR/COLLECTOR test and implement in quattor the multicore setup give a better look to BDII publication run some load tests with both single and multicore jobs implement job/user limits (max time, max mem, etc…)

HTCondor : howto Lister les machines / jobs slots : condor_status Lister les machines down/absentes : condor_status -absent Lister les jobs running : condor_q –run Lister les « fairshares »: condor_userprio –grouporder

HTCondor : howto Modifier les quotas d’un job running : condor_qedit -constraint 'RequestCpus == 8' MAX_DISK_KB 'RequestCpus*20*1024*1024' condor_qedit -constraint 'RequestCpus == 8' JobMemoryLimit condor_qedit -constraint 'RequestCpus == 8' RequestMemory 5120

Summary : rn2014.html rn2014.html

Logos :