Home » Jobs » Site Reliability Engineer

Site Reliability Engineer

ION Core Banking, Full-time, Collecchio

About us:

The ION Group is made up of innovators who provide trading and workflow automation solutions, high-value analytics, and strategic consulting to corporations, financial institutions, central banks, and governments.

More than 40% of the world’s largest companies use our solutions. We’ve achieved tremendous growth by bringing together some of the best and most successful financial technology companies in the world.

At ION, we offer careers that provide many opportunities: To invent. To design. To collaborate. To build. To transform businesses and empower people around the world to do more, faster and better than before. Imagine what you can do and experience. This is where you can do your best work.

Learn more ationgroup.com.

We are looking for experienced people who are competent in the cloud and knowledgeable about the SRE (site reliability engineering) domain.

The team

The Core Architecture Team (CAT) produces and manage the core technology, methodologies and frameworks that underpins all new or re-engineered ION products.

We provide our internal and external customers foundations and an open platform they can extend and evolve to manage their solutions independently and with reduced cost of ownership.

The ION Cloud Center of Excellence is aimed to support the Groups strategy toward “a Cloud native offering" via a cross–functional team of empowered people that are responsible for developing and managing the strategy, governance, and best practices for the entire Group

Some of the team deliverables:

· Create the ION Cloud Infrastructure reusable by all the ION Divisions

· Reduce the total cost of ownership

· Provide guidelines and best practices for the entire organization

· Reduce operation complexity via automated platform configuration and deployment

· Provide tools that ease the developers to setup the CI environment for ION Products

· Governance on the development tools, to increase operational efficiency

· Technology recommendations standardization and infrastructure and product design, across the Group

Who you are

Your background is either in software development or operations/infrastructure (or both!), and you enjoy to code or automate your workflows.

You have proven experience in working with cloud providers and dealing with cloud-first applications engineered with a cloud-native mindset.

You are a self-starter individual and constantly learning engineer and enjoy working in a team of peers.

You are open and candid about discussing solutions, problems and improvements within your team and others in the engineering organization.

You have a passion for site reliability engineering (SRE) principles and adoption, and you are keen to start conversations with teams about reliability, performance and security of the applications, services and systems.

You are an advocate of DevOps or SRE approach, promoting loosely coupled, heavily automated, constantly monitored distributed systems, and you always plan for failure and never take anything for granted.

You are keen to raise the bar of the solutions provided by the whole engineering team (dev and ops).

You possess strong written and verbal communication skills

You are happy to be involved into an on-call rotation whether needed.

What you'll be doing

It’s fine to have some of these, the more the merrier!

The Cloud Engineer side

· Maintain our internal tooling and automation, to improve the reliability, scalability and the observability of our services.

· Proactively identify and solve issues across the whole stack, together with the rest of the infrastructure and engineering teams.

· Contribute to raise awareness in the security and protection of the cloud, understanding how to fit these in timelines and backlog of the end team.

· Understand how a distributed application works, constraints, and limitations.

· Have strong coding and scripting experience and you are interested in improving your programming / coding knowledge (python or go ideally).

The Site Reliability Engineer side

· Promote and execute the adoption of SRE principles and raise awareness on the importance of reliability and automation.

· Help the team understand concepts like ownership, error budgets and production readiness.

· Help define and implement SLIs, SLOs and check SLAs, to meet customer satisfaction.

· Work together with teams to identify and solve issues in platforms and tune services for reliability and performance.

· Aim to reduce toil and manual efforts with automation and repeatable and documented tooling and standard procedures.

· Take active part in the incident management process to troubleshoot impacting issues in a timely manner and engage with all stakeholders involved.

Your skills, experience, and qualifications

These are must-haves!

· Our work language is English, hence it’s very important to be proficient with it.

· Extensive knowledge and experience in one of the major clouds, including AWS, Azure, GCP; with a comprehensive understanding and real-world implementation experience (We currently use AWS and Azure).

· Microservices in a cloud-native world: architecture, deployments and engineering in the Kubernetes and Container space. You are familiar with how to protect services and adhere with industry standards / best practices.

· Understanding of network topologies, deployment methods and constraints in the cloud.

· Familiarity with application development methodologies in a cloud-native environment and container-based runtime.

· Understanding of distributed systems is essential. You would benefit from having architectural concepts like SOA, object-oriented analysis and design, and/or client/server systems.

· Experience working with diverse, remote, and distributed teams across multiple regions and time zones.

· A proven track record as site reliability or production engineer, and working in a consulting capacity directly with teams, to educate and provide the best solution achievable within the project constraints

· Cyber Security and operations awareness: understanding the basic principles (identity and access management, least privilege, encryption, etc) and strive towards implementing best practices and education, to establish a robust set of defences in line with the company requirements.

Contract and locations

· Contract Type: Full-time, permanent contract.

· Locations: London, Milan, Pisa, Parma

Enjoy a hybrid work culture that offers the best of remote flexibility and in-person collaboration.

Important notes (Italy):

According to the Italian Law (L.68/99) Please note that candidates from the disability list will be given priority.

Due to the high volume of applications, only those candidates that meet the required criteria for selection will be contacted.

If you’re from a non-EU country, you must have a valid EU visa or work permit.

« Back to Careers

Cookie	Type	Duration	Description
__cfduid	1	29 days 23 hours 59 minutes	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
cookielawinfo-checkbox-analytics	1	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	0	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	0	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary".
cookielawinfo-checkbox-other	1	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Other".
JSESSIONID	1	13 days 23 hours 59 minutes	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users'' state across page requests.
viewed_cookie_policy	0	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
vuid	1	1 years 11 months 28 days 23 hours 59 minutes	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.
XSRF-TOKEN	1	1 days 23 hours 59 minutes	The cookie is set by Wix website building platform on Wix website. The cookie is used for security purposes.

Cookie	Type	Duration	Description
_ga	1	1 years 11 months 28 days 23 hours 59 minutes	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site''s analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_64Q73PPC3E	1	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_45487328_32	0		This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gat_UA-85023278-1	0		This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gid	1	23 hours 59 minutes	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.
_omappvp	1	10 years 11 months 10 days 23 hours 59 minutes	The cookie is set to identify new vs returning users. The cookie is used in conjunction with _omappvs cookie to determine whether a user is new or returning.
_omappvs	1	9 minutes	The cookie is used to in conjunction with the _omappvp cookies. If the cookies are set, the user is a returning user. If neither of the cookies are set, the user is a new user.
bscookie	1	1 years 11 months 29 days 11 hours 37 minutes	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
ELQSTATUS	1	1 years 1 months	This cookies collect information in an anonymous form, including the number of visitors to the site, where visitors have come to the site from, and the pages they visited. Once consent is provided, through a form submission by the visitor, we can associate a visitor's ID to individual characteristics and past behavior.
li_sugr	1	2 months 28 days 23 hours 59 minutes	This cookie is used to make a probabilistic match of a user's identity outside the Designated Countries.
lissc	1	11 months 29 days 23 hours 59 minutes	This cookie is provided by LinkedIn. This cookie is used for tracking embedded service.
sbjs_current	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_current_add	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_first	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_first_add	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_migrations	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_session	1	30 minutes	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
sbjs_udata	1	5 months 27 days	This cookie is to identify the source of a visit and store user action information about it in a cookies. This is a analytic and behavioural cookie used for improving the visitor experience on the website.
UserMatchHistory	1	29 days 23 hours 59 minutes	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Cookie	Type	Duration	Description
__ncuid	1	1 year	No description available.
_clck	0	1 year	No description
_clsk	0	1 day	No description
_gat_ncAudienceInsightsGa	0	1 minute	No description
AnalyticsSyncHistory	0	1 month	No description
CLID	1	1 year	No description
country	1	1 month	No description available.
i18next	1	11 months 29 days 23 hours 59 minutes	No description
KV_CLIENT_SESSION_ID	1	11 months 29 days 23 hours 59 minutes	No description
li_gc	0	5 months 27 days	No description
pap_session	1	1 days 23 hours 59 minutes	No description
pap_wcaid_288	1	5 days 23 hours 59 minutes	No description
SM	1	session	No description available.
TS01bd9a65	1		No description

Cookie	Type	Duration	Description
__cf_bm	1	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	1	1 years 11 months 29 days 11 hours 37 minutes	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
ELOQUA	1	1 years 1 months	The domain of this cookie is owned byOracle Eloqua. This cookie is used for email services. It also helps for marketing automation solution for B2B marketers to track customers through all phases of buying cycle.
lang	1	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1	23 hours 59 minutes	This cookie is set by LinkedIn and used for routing.
optimizelyEndUserId	1	5 months 27 days	Optimizely uses this cookie to store a visitor''s unique identifier which is a combination of a timestamp and a random number. Different variations of web parts are shown to users that optimizes the website''s user experience.

Site Reliability Engineer

Get notified for similar jobs