LCG-France Tier-1 & AF Réunion mensuelle de coordination

Slides:



Advertisements
Présentations similaires
What’s the weather like?. Look at the verb phrase fait-il above Turn it around and you have il fait The phrase Il fait can be used to describe lots of.
Advertisements

RozoFS KPI’s edition /04/2014. © Fizians Ce document ne peut être reproduit ou communiqué sans autorisation écrite. 2 RozoFS high level architecture.
CAF-11/10/2010Luc1 Squad Report T1 Période 13/09-11/10 Irena, Sabine, Emmanuel.
WINS Windows Internet Name Service. What is WINS?  It does name resolution (?!) DNS resolves IP numbers and FQDN ARP resolves IP numbers and MAC addresses.
The subjunctive mood If I were you, I’d call him It is absolutely necessary that you be there on time May God save the queen! Normally, in English we would.
PERFORMANCE One important issue in networking is the performance of the network—how good is it? We discuss quality of service, an overall measurement.
Alice LCG Task Force Meeting 16 Oct 2008 BARBET Jean-Michel - 1 /20 LCGFR Marseille Juin 2010Jean-Michel BARBET Subatech 1 /22 Support de la VO Alice à.
Yannick Patois 1 Utilisation LCG-France Les Technical Evolution Groups et LCG-France.
PIPE SUPPORTS 1 Pipe supports inside the compression station and pumping stations AUGUST 2014.
1 Case Study 1: UNIX and LINUX Chapter History of unix 10.2 Overview of unix 10.3 Processes in unix 10.4 Memory management in unix 10.5 Input/output.
An Introduction To Two – Port Networks The University of Tennessee Electrical and Computer Engineering Knoxville, TN wlg.
IP Multicast Text available on
Update on Edge BI pricing January ©2011 SAP AG. All rights reserved.2 Confidential What you told us about the new Edge BI pricing Full Web Intelligence.
Overview of SUN’s Unix Campus-Booster ID : **XXXXX Copyright © SUPINFO. All rights reserved Introduction to Solaris 10.
Résumé wLCG/EGEE Operations Workshop
LCG-France Tier-1 & AF Réunion mensuelle de coordination
Passage de Main SYSGRID Réunion 1
IGTMD réunion du 4 Mai 2007 CC IN2P3 Lyon
The Passé Composé Tense
Réunion Opérations France Grilles – 6 juin 2017
Compte rendu HEPIX et CHEP2015 Stockage et gestion des données
(Specialized Support Center)
Vue d'ensemble de l'utilisation du CCIN2P3 par les expériences LHC
Where Can I Buy Permethrin Spray For Scabies
12 mars 2004, Lyon Reunion CAF F.Chollet 1
Projet eXtreme DataCloud XDC
Samples for evaluation from All Charts & Templates Packs for PowerPoint © All-PPT-Templates.comPersonal Use Only – not for distribution. All Rights Reserved.
Fonctionnement de la grille
LCG-France Tier-1 & AF Réunion mensuelle de coordination
LCG-France Tier-1 & AF Réunion mensuelle de coordination
LCG-France Tier-1 & AF Réunion mensuelle de coordination
Etat des lieux des VO Boxes LHC
LCG-France Tier-1 & AF Réunion mensuelle de coordination
2ème coloque LCG-France
Quantum Computer A New Era of Future Computing Ahmed WAFDI ??????
Réunion de Coordination – Bilan des opérations LCG Hélène CORDIER
LCG-France Tier-1 & AF Réunion mensuelle de coordination
LCG –France Tier2 & AF Réunion de Coordination – Problèmes récurrents des VOs 11 Juin- 10 Septembre Hélène CORDIER.
Résumé CB WLCG du 3 février 2005
Conditional Clauses By Mª Mercedes Sánchez Year
Theme One Speaking Questions
Exercices: Système d’Information
LCG –France Tier2 & AF Réunion de Coordination – Problèmes récurrents des VOs 10 Septembre – 21 Octobre Hélène CORDIER.
Tools & Bibliography November 2008
1 ISO/TC 176/SC 2/N1219 ISO 9001:2015 Revision overview - General users July 2014.
Lect12EEE 2021 Differential Equation Solutions of Transient Circuits Dr. Holbert March 3, 2008.
Introduction to Computational Journalism: Thinking Computationally JOUR479V/779V – Computational Journalism University of Maryland, College Park Nick Diakopoulos,
High-Availability Linux Services And Newtork Administration Bourbita Mahdi 2016.
The Passé Composé In the previous lesson we looked at the formation of the passé composé (perfect tense) with Avoir verbs. In this lesson we will further.
Le soir Objectifs: Talking about what you do in the evening
Restoration efforts required for achieving the objectives of the Birds and Habitats Directives Expert Group on Reporting under the Nature Directives –
Data Center Interconnect Ethernet VPN
Forum national sur l’IMT de 2004.
Definition Division of labour (or specialisation) takes place when a worker specialises in producing a good or a part of a good.
Introduction à GENIUS et GILDA
Quelle est la date aujourd’hui?
Standards Certification Education & Training Publishing Conferences & Exhibits Automation Connections ISA EXPO 2006 Wed, 1:00 Oct 18.
1-1 Introduction to ArcGIS Introductions Who are you? Any GIS background? What do you want to get out of the class?
Vulnerability Analysis by : Wail Belhouchet Dr Djouad Tarek 1.
What’s the weather like?
Aymeric Weinbach MVP Microsoft
BRMS Implementation Status Update Template designed by 18-July-2015.
By : HOUSNA hebbaz Computer NetWork. Plane What is Computer Network? Type of Network Protocols Topology.
Programmation de l'égalité des genres dans l'action humanitaire
The Passé Composé Tense
Résumé des Actions Suite aux Réunions CB et MB
Le Passé Composé (Perfect Tense)
University : Ammar Telidji Laghouat Faculty : Technology Department : Electronics 3rd year Telecommunications Professor : S.Benghouini Student: Tadj Souad.
IMPROVING PF’s M&E APPROACH AND LEARNING STRATEGY Sylvain N’CHO M&E Manager IPA-Cote d’Ivoire.
M’SILA University Information Communication Sciences and technology
Transcription de la présentation:

LCG-France Tier-1 & AF Réunion mensuelle de coordination 21/07/2018 12/01/2012 LCG-France Tier-1 & AF Réunion mensuelle de coordination Pierre Girard Pierre.girard@in2p3.fr

Les rats quittent le navire… Poste de responsable du projet LCG@CC.IN2P3.FR vacant au 1er septembre… Poste d’administrateur grille vacant au 1er septembre… Poste de développeur accounting grille vacant au 1er septembre… 12/01/2012

Technical Evolution Groups Disponibilité du site Dossiers en cours Plan Nouvelles de LCG CR du dernier GDB Technical Evolution Groups Disponibilité du site Dossiers en cours Evénements 12/01/2012

Nouvelles de LCG 12/01/2012

GDB du 8 février 2012 CR du dernier GDB http://indico.cern.ch/conferenceDisplay.py?confId=155065 12/01/2012

CR du dernier GDB http://indico.cern.ch/getFile.py/access?sessionId=0&resId=1&materialId=0&confId=155065 Source: 12/01/2012

CR du dernier GDB http://indico.cern.ch/getFile.py/access?sessionId=0&resId=1&materialId=0&confId=155065 Source: 12/01/2012

CR du dernier GDB Need MW to switch to RFC proxies http://indico.cern.ch/getFile.py/access?sessionId=8&resId=3&materialId=0&confId=155065 Source: Need MW to switch to RFC proxies We have summer & autumn to deal with the bugs. Prepare for major production upgrades early 2013 12/01/2012

Technical Evolution Groups 12/01/2012

Mardi 7 février au CERN TEG Reports http://indico.cern.ch/conferenceDisplay.py?confId=158775 12/01/2012

TEG Storage/Data Management (1) Constat Often more status quo / known defect discussions than a strategy one so far Ofter we know better in which direction to leave than where we want to arrive… Most of the strategy proposals originate from s/w providers La suite prévue There is a need for an overall architecture diagram and concrete recommendations. Should document (the architecture of) both what we have now and what we want in the future.” 12/01/2012

TEG Storage/Data Management (2) Data placement responsibilities Site: Storage Management Stable storage for placed data Detection/Repairing of Mismatch between SE disk and SE catalogue (dark data) Experiments: Data Management Geographical dataset distribution Mismatch/Repairing between SE and experiment catalogue (grey data) DM solution are used to overcome storage management problems Catalogs Abandon programmé de LFC dans les 2 prochaines années Solution spécifique par VO 12/01/2012

TEG Storage/Data Management (3) Data placement & Federations Push via FTS/xroot Now complemented with federated access via xroot to reduce impact of (temporarily or permanently) unavailable data. Relevance increasing towards smaller sites. Federation in WLCG: Alice, US Atlas and US CMS Xrootd used as federation infrastructure with mixed storage infrastructures: dCache, DPM, GPFS, HDFS, and native xrootd Protocoles considérés : xrootd, NFS v4 et HTTP WAN Protocols Fonctionnalités attendues third party copy, data integrity (checksum), efficiency (parallel streams), stable/dependable server/client implementations Should be compatible with storage federation Candidats possibles gridFTP, Xrootd, http(s)/WebDav, NFS-4.1/pNFS, S3 https considered as very promising 12/01/2012

TEG Storage/Data Management (4) SRM + Cloud discussions Problèmes de SRM dont son coût de maintenance Discussion “Cloud vs SRM” Unclear if Cloud provides any benefit Should consider USA's push toward virtualisation WebDav considéré aussi It's a standard. Lots of clients, covering all platforms Support in SEs coming with EMI-2. Need to separate SRM into core functionality blocks Transfer management, Interacting with namespace, Aggregated space querying, Storage management Use this breakdown to compare alternatives Analyse Cloud APIs and WebDAV Identify their limitations http://indico.cern.ch/getFile.py/access?contribId=1&resId=4&materialId=slides&confId=158775 Source: 12/01/2012

TEG Workload Management (1) Main areas tracked so far Commonalities with pilots and frameworks Streamed submission Support of jobs requiring whole nodes or multiple cores specifying whole node / multi-core requirements in the JDL/RSL SGE support for these features in CREAM (provided in EMI by CESGA) is currently missing (?) Support of CPU affinity and CPU sets CPU Pinning, LRMS solution, SL6… Tagging I/O vs. CPU intensive jobs We propose to introduce a new parameter to specify whether a job is CPU- or I/Obound. The parameter is a scalar in the range [0,1] (0=CPU-bound; 1=I/O bound). Requirements for a CE service define a “CE alias” with a pool of individual CE’s behind the alias, where the individual CE’s share state data Summary of the use of the IS by WLCG experiments Source: http://indico.cern.ch/getFile.py/access?contribId=2&resId=1&materialId=slides&confId=158775 12/01/2012

TEG Workload Management (2) Use of virtualization technologies https://twiki.cern.ch/twiki/bin/view/LCG/WLCGWlMgmtTEGVirtualization Virtualization penalties must be carefully considered… Static virtualization of batch resources vs dynamic allocation of VMs e.g. of automating deployment of intrusive updates, or of optimizing allocation and encapsulation of multi-core VMs used for multi-core jobs. Scalable mechanism, possibly linked to an LRMS, is required to properly handle dynamic allocations. Early tests suggest that both e.g. Torque and LSF may have troubles scaling to several (tens of) thousands of VMs, at least if not properly configured. Source: http://indico.cern.ch/getFile.py/access?contribId=2&resId=1&materialId=slides&confId=158775 12/01/2012

TEG Workload Management (3) Cloud computing https://twiki.cern.ch/twiki/bin/view/LCG/WLCGWlMgmtTECloudComputing VO’s already expressed interest in accessing IaaS entry points to sites. Batch cloud factory Adopt HEPiX recommendations (e.g. wrt VM capabilities) and consider the EGI policy for what regards running VMs. Define common methods to access resources (e.g. via EC2 or OCCI) and common sizes to ensure consistency and avoid effort duplication; track possible integration into an IS. Explore integration into an LRMS and with CREAM for the provisioning of cloud instances. Track AuthN/AuthZ developments and integration of multiple cloud sites. Early tests on Cloud instantiations at WLCG sites include CERN’s lxcloud and INFN’s WNoDeS. participating to the EGI distributed cloud TF Source: http://indico.cern.ch/getFile.py/access?contribId=2&resId=1&materialId=slides&confId=158775 12/01/2012

TEG DB Management Source: http://indico.cern.ch/getFile.py/access?contribId=3&resId=0&materialId=slides&confId=158775 12/01/2012

Autres TEG TEG Operations TEG Secutity http://indico.cern.ch/materialDisplay.py?contribId=5&materialId=slides&confId=158775 Draft version of report at https://twiki.cern.ch/twiki/bin/view/LCG/WLCGTEGOperations TEG Secutity http://indico.cern.ch/materialDisplay.py?contribId=4&materialId=slides&confId=158775 Only serious option to take for the short-term is ID switching with gLExec GlideinWMS already handles the ID switch transparently via Condor 12/01/2012

Résultats du site 12/01/2012

Résultats Janvier 2012 https://espace.cern.ch/WLCG-document-repository/ReliabilityAvailability/Forms/AllItems.aspx 12/01/2012

Disponibilités des VOs pour janvier 2012 https://espace.cern.ch/WLCG-document-repository/ReliabilityAvailability/Forms/AllItems.aspx?RootFolder=%2FWLCG-document-repository%2FReliabilityAvailability%2FTier-1%2F2011%2Ftier1_reliab_122011 Source: Disponibilités des VOs pour janvier 2012 Fin des LCG-CEs et pas de SE Avail : N/A Avail : 99% ALICE CMS Avail : 92% Avail : 100% ATLAS LHCb 12/01/2012

Dossiers en cours 12/01/2012

Décisionel, portail CESGA et accounting LCG faux depuis juin 2011. Pb d’accounting GE Décisionel, portail CESGA et accounting LCG faux depuis juin 2011. https://grid.in2p3.fr/LCGFrAccounting/CPU/2011/ 12/01/2012

Décisionel, portail CESGA et accounting LCG faux depuis juin 2011. Pb d’accounting GE Décisionel, portail CESGA et accounting LCG faux depuis juin 2011. Préoccupation pour 2012: migration (et reprise) de l’accounting grille vers la nouvelle infrastructure d’accounting de WLCG/EGI ? https://grid.in2p3.fr/LCGFrAccounting/CPU/2011/ 12/01/2012

Chantiers en cours CREAM GEs Cf. talk des cemasters 12/01/2012

Chantiers en cours Problèmes réseaux (update) Avec les sites étrangers qui ne sont pas sur LHCOPN Impact sur ATLAS et CMS https://ggus.eu/ws/ticket_info.php?ticket=75983 https://ggus.eu/ws/ticket_info.php?ticket=74268 Reconfiguration du LAN en janvier Amélioration des transferts T2s->CCIN2P3 Dernier update des tickets le 26/01 par Jérôme We put in production a second 10Gb/s interface provided by RENATER in order to separate LHCONE and generic IP traffics. Let's see the evolution of our transfers on the generic IP network. Des progrés notés ? 12/01/2012

Chantiers en cours Important UPGRADE/UPDATE de dCache Documentés https://cctools.in2p3.fr/wiki/exploitation:arrets:arret20120207 http://cctools.in2p3.fr/elog/dCache/910 Principaux changements effectués Postgres 9.1. 2 core servers chimera sur ccdcamcli04 + SRM sur ccdcamcli05 96GB de RAM, disques SSD 200GB en RAID-10 Passage a la deuxième golden release (aka 1.9.12) Qques problèmes rencontrés Prolongation du downtime jusqu'au 08/02/2011 10:00 Pb de checksum sur le Pool2Pool Pb d'autentification pour certains DNs de LHCb Etc. 12/01/2012

Migration du LFC Atlas prévue 20/22 mars Migration du MW vers EMI-1/2 Chantiers en cours Migration du LFC Atlas prévue 20/22 mars Suivi par Emmanouil et David Migration du MW vers EMI-1/2 “Very urgent” Recensement des service grille à entamer Etat, Hardware, Configuration, Monitoring, Contacts… 12/01/2012

Date et lieu Proposition de sujets Prochaine T1/AF 22 mars en salle 202 Proposition de sujets “Whole node” job Sébastien ? Frontier/Squid Emmanouil ? Etat d’avancement du déploiement d’OTRS ? David ? Autres ? 12/01/2012

A venir (à mettre à jour) Evénements A venir (à mettre à jour) OGF 34, Oxford, 11-14 March 2012 HEPiX Spring Meeting, Prague, 23-27 April 2012 ISGC, Taipei, 26 February - 2 March 2012 EGI User Forum, Munich, 19-23 March WLCG Workshop, NYC, 19-20th May 2012 CHEP2012, NYC, 21-25th May 2012 EGI Technical Forum, Prague, September 2012 HEPiX Fall Meeting, IHEP, October 2012 12/01/2012