Why machine learning projects fail and how to make them succeed

Name	Provider	Purpose	Expiry	Type
JSESSIONID	NewRelic	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.	Session	First party
__cfruid	HubSpot	This cookie is set by HubSpot’s CDN provider because of their rate limiting policies.	Session	First party
hs-membership-csrf	HubSpot	This cookie is used to ensure that content membership logins cannot be forged. It contains a random string of letters and numbers used to verify that a membership login is authentic.	Session	First party
__cf_bm	CloudFlare	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.	30 Minutes	First party
cookie-agreed	www.cgi.com	Stores the user's cookie consent state for the current domain	23 Days	First party
AKA_A2	Akamai	This cookie is generally provided by Akamai and is used for the Advanced Acceleration feature, which enables DNS Prefetch and HTTP2 Push.	1 Hour	First party

Name	Provider	Purpose	Expiry	Type
hs-messages-is-open	HubSpot	This cookie is used to determine and save whether the chat widget is open for future visits. It is set in your visitor's browser when they start a new chat, and resets to re-close the widget after 30 minutes of inactivity. If your visitor manually closes the chat widget, it will prevent the widget from re-opening on subsequent page loads in that browser session for 30 minutes. It contains a boolean value of True if present.	30 Mins	Third party
__hsmem	HubSpot	This cookie is set when visitors log in to a HubSpot-hosted site. It contains encrypted data that identifies the membership user when they are currently logged in.	7 Days	Third party
starlight	www.cgi.com	In specific parts of our website, we have disclaimer flags with buttons to accept or reject the disclaimer. If user accepts the disclaimer, then this cookie stores the information about the fact that a visitor has accepted the disclaimer	365 Days	First party
player	Vimeo	This cookie saves your settings before you play an embedded Vimeo video. This means that the next time you watch a Vimeo video, you will get your preferred settings back.	365 Days	Third party
hs_ab_test	HubSpot	This cookie is used to consistently serve visitors the same version of an A/B test page they’ve seen before. It contains the id of the A/B test page and the id of the variation that was chosen for the visitor.	Session	Third party
_cfuvid	CloudFlare	This cookie is a part of the services provided by Cloudflare - Including load-balancing, deliverance of website content and serving DNS connection for website operators	Session	Third party
lang	LinkedIn	This domain is owned by LinkedIn, the business networking platform. It typically acts as a third party host where website owners have placed one of its content sharing buttons in their pages, although its content and services can be embedded in other ways. Although such buttons add functionality to the website they are on, cookies are set regardless of whether or not the visitor has an active LinkedIn profile, or agreed to their terms and conditions. For this reason it is classified as a primarily tracking/targeting domain.	Session	Third party
WFESessionId	Microsoft Azure	This cookie is necessary to enable a Power BI session, a Microsoft tool that helps visualize data.	Session	Third party
YSC	Youtube	YouTube is a Google owned platform for hosting and sharing videos. YouTube collects user data through videos embedded in websites, which is aggregated with profile data from other Google services in order to display targeted advertising to web visitors across a broad range of their own and other websites.	Session	Third party
<id>_key	HubSpot	When visiting a password-protected page, this cookie is set so future visits to the page from the same browser do not require login again. The cookie name is unique for each password-protected page. It contains an encrypted version of the password so future visits to the page will not require the password again.	14 Days	Third party
s_cc	Adobe Analytics	Used to determine if cookies are enabled for Adobe Analytics	Session	Third party
__hs_opt_out	HubSpot	This cookie is used by the opt-in privacy policy to remember not to ask the visitor to accept cookies again. This cookie is set when you give visitors the choice to opt out of cookies. It contains the string "yes" or "no".	6 Months	Third party
__hs_do_not_track	HubSpot	This cookie can be set to prevent the tracking code from sending any information to HubSpot. It contains the string "yes".	6 Months	Third party
_GRECAPTCHA	Google reCAPTCHA	This cookie is set by Google reCAPTCHA, which protects our site against spam enquiries on contact forms.	6 Months	Third party
hs_langswitcher_choice	HubSpot	This cookie is used to save a visitor’s selected language choice when viewing pages in multiple languages. It is set when a visitor selects a language from the language switcher and is used as a language preference to redirect them to sites in their chosen language in the future if they are available. It contains a colon delimited string with the ISO639 language code choice on the left and the top level private domain it applies to on the right. An example will be "EN-US:hubspot.com".	2 Years	Third party
cookieValue	www.cgi.com	This cookie used for the disclaimer acceptance flag.	1 Day	First party
__hs_cookie_cat_pref	HubSpot	This cookie is used to record the categories a visitor consented to. It contains data on the consented categories.	6 Months	Third party
ARRAffinitySameSite	Microsoft Azure	ARRAffinitySameSite is for Azure Web Sites for load balancing our application. This cookie is used to distribute traffic to the website on several servers in order to optimize response times.	Session	Third party
__hs_initial_opt_in	HubSpot	This cookie is used to prevent the banner from always displaying when visitors are browsing in strict mode. It contains the string "yes" or "no".	7 Days	Third party
language	www.cgi.com	This cookie remembers your preferred language based on your previous selections, allowing the website to present content in your chosen language without you having to manually select it each time you visit.	365 Days	First party
VISITOR_INFO1_LIVE	Youtube	This cookie is used as a unique identifier to track viewing of videos	365 Days	Third party
cookie-agreed-categories	www.cgi.com	Stores the user's cookie consent category states for the current domain	23 Days	First party
hs-messages-hide-welcome-message	HubSpot	This cookie is used to prevent the chat widget welcome message from appearing again for one day after it is dismissed. It contains a boolean value of True or False.	1 Day	Third party

Name	Provider	Purpose	Expiry	Type
_tr	Meta	This cookie is used to track your interactions with ads that are powered by Meta. It is stored for 30 days.	30 Days	Third party
tiktok_ads_id	Tiktok	This cookie is used to track your interactions with ads that are powered by TikTok. It is stored for 13 months.	13 Months	Third party
VID	LinkedIn	A visitor-related identifier for a LinkedIn microsite used to determine conversions for lead gen purposes.	1 Year	Third party
li_c_user	LinkedIn	This cookie is used to track your activity on websites that have the LinkedIn Pixel installed. It is stored for 1 year.	1 Year	Third party
_ga	Google Analytics	This cookie enables Google Analytics to distinguish one visitor from another in order to generate statistical website usage data. Each ‘_ga’ cookie is unique to the specific property, so it cannot be used to track a given user or browser across unrelated websites.	365 Days	First and third party
ms_ta*	Bing	These cookies are used to track your interactions with ads that are powered by Bing. They are stored for 1 year.	1 Year	Third party
__utmb	Google Analytics	This is one of the four main cookies set by the Google Analytics service which enables website owners to track visitor behaviour and measure site performance. This cookie determines new sessions and visits and expires after 30 minutes. The cookie is updated every time data is sent to Google Analytics. Any activity by a user within the 30 minute life span will count as a single visit, even if the user leaves and then returns to the site. A return after 30 minutes will count as a new visit, but a returning visitor.	365 Days	First party
s_ppv	Adobe	Used by Adobe Analytics to retain and fetch what percentage of a page was viewed	Session	Third party
tiktok_pixel	Tiktok	This cookie is used to track your activity on websites that have the TikTok Pixel installed. It is stored for 13 months.	13 Months	Third party
__hssc	HubSpot	This cookie keeps track of sessions. This is used to determine if HubSpot should increment the session number and timestamps in the __hstc cookie. It contains the domain, viewCount (increments each pageView in a session), and session start timestamp.	30 Minutes	First party
gpy_pn	Adobe	Used to store and retrieve the previous page in Adobe Analytics.	6 Months	Third party
__utmc	Google Analytics	This is one of the four main cookies set by the Google Analytics service which enables website owners to track visitor behaviour and measure site performance. It is not used in most sites but is set to enable interoperability with the older version of Google Analytics code known as Urchin. In this older versions this was used in combination with the __utmb cookie to identify new sessions/visits for returning visitors. When used by Google Analytics this is always a Session cookie which is destroyed when the user closes their browser. Where it is seen as a Persistent cookie it is therefore likely to be a different technology setting the cookie.	365 Days	First party
s_tslv	Adobe	Used to retain and fetch time since the last visit in Adobe Analytics	6 Months	Third party
_ga_LC0YVRL587	Google Analytics	This is a pattern-type cookie set by Google Analytics, where the name element contains the unique identifier of the account or website to which it is associated. Used to store and count pageviews.	1 Year	First party
_gat	Google Analytics	This cookie name is associated with Google Universal Analytics, according to documentation it is used to throttle the request rate - limiting the collection of data on high traffic sites. It expires after 1 minute.	365 Days	First party
fr	Facebook	Contains browser and user unique ID combination, used for targeted advertising.	365 Days	Third party
sc_hit	SnapChat	This cookie is used to track your activity on websites that have the Snapchat Pixel installed. It is stored for 1 year.	13 Months	Third party
mf_[website-id]	Mouseflow	1st party cookie, session lifetime: A cookie for identifying the current session on a website	Session	First party
simpli.fi_visit	Simpli.fi	This cookie is used to track your visits to websites that have the Simpli.fi Pixel installed. It is stored for 30 days.	30 Days	Third party
s_pltp	Adobe	Provides page name value (URL) for use by Adobe Analytics	Session	Third party
s_tp	Adobe	Tracks percent of page viewed	2 Years	Third party
mf_user	Mouseflow	This cookie establishes whether the user is a returning or first-time visitor. This is done simply by a yes/no toggle and no further information about the user is stored. This cookie has a lifetime of 90 days.	3 Months	First party
__utmz	Google Analytics	This is one of the four main cookies set by the Google Analytics service which enables website owners to track visitor behaviour measure of site performance. This cookie identifies the source of traffic to the site - so Google Analytics can tell site owners where visitors came from when arriving on the site. The cookie has a life span of 6 months and is updated every time data is sent to Google Analytics.	365 Days	First party
s_plt	Adobe	Tracks the time that the previous page took to load	Session	Third party
vuid	Vimeo	This domain is owned by Vimeo. The main business activities are: Video Hosting / Sharing	365 Days	Third party
_fbp	Meta	This cookie is used to track your activity on websites that have the Meta Pixel installed. It is stored for 30 days.	30 Days	Third party
_gid	Google Analytics	Registers a unique ID that is used to generate statistical data on how the visitor uses the website. This cookie expires after 1 day.	1 Day	Third party
_hjid	Hotjar	This is a pattern-type cookie set by Google Analytics, where the name element contains the unique identifier of the account or website to which it is associated. It is a variation of the _gat cookie that is used to limit the amount of data that Google stores on high-traffic websites.	365 Days	First party
simpli.fi_id	Simpli.fi	This cookie is used to track your activity on websites that have the Simpli.fi Pixel installed. It is stored for 30 days.	30 Days	Third party
gpv_pn	Adobe	This cookie gathers data for analyzing the visitor's use of the website including activity tracking, page visits and links clicked	2 Hours	Third party
_gat_UA-114077998-1	Google Analytics	This is a pattern-type cookie set by Google Analytics, where the name element contains the unique identifier of the account or website to which it is associated. It is a variation of the _gat cookie that is used to limit the amount of data that Google stores on high-traffic websites.	365 Days	First party
appcast_job_ad	Appcast	This cookie is used to track your interactions with job ads that are powered by Appcast. It is stored for 30 days.	30 Days	Third party
appcast_visitor	Appcast	This cookie is used to track your visits to the Appcast website. It is stored for 30 days.	30 Days	Third party
li_cs	LinkedIn	This cookie is used to track your interactions with ads that are powered by LinkedIn. It is stored for 1 year.	1 Year	Third party
RT	Boomerang	It measures page load time, or other timers associated with the page.	365 Days	First party
_ga	HubSpot	This cookie records a unique identification which is used to generate statistical data about how the visitor uses the Website.	Session	First party
AnalyticsSyncHistory	LinkedIn	Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries	30 Days	Third party
ai_session	Microsoft Azure	This cookie name is associated with the Microsoft Application Insights software, which collects statistical usage and telemetry information for apps built on the Azure cloud platform. This is a unique anonymous session identifier cookie. The main purpose of this cookie is: Performance	Session	First party
__utma	Google Analytics	This is one of the four main cookies set by the Google Analytics service which enables website owners to track visitor behaviour and measure site performance. This cookie lasts for 2 years by default and distinguishes between users and sessions. It it used to calculate new and returning visitor statistics. The cookie is updated every time data is sent to Google Analytics. The lifespan of the cookie can be customised by website owners.	365 Days	First party
__hstc	HubSpot	The main cookie for tracking visitors. It contains the domain, utk, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).	6 Months	First party
ai_user	Microsoft Azure	This cookie name is associated with the Microsoft Application Insights software, which collects statistical usage and telemetry information for apps built on the Azure cloud platform. This is a unique user identifier cookie enabling counting of the number of users accessing the application over time. The main purpose of this cookie is: Performance	1 Year	First party
_gclxxxx	Google Analytics	This is the Google conversion tracking cookie. It allows to count visits and traffic sources, so we can measure and improve the performance of our site.	365 Days	First party
aam_uuid	Adobe	Set for ID sync for Adobe Audience Manager	30 Days	Third party
_gid	Google Analytics	This cookie name is associated with Google Universal Analytics. This appears to be a new cookie and as of Spring 2017 no information is available from Google. It appears to store and update a unique value for each page visited.	365 Days	First and third party
ms_u*	Bing	These cookies are used to track your activity on websites that have the Bing Pixel installed. They are stored for 1 year.	1 Year	Third party
lms_analytics	LinkedIn	Used to identify LinkedIn Members in the Designated Countries for analytics	30 Days	Third party
__utmt	Google Analytics	This cookie is set by Google Analytics. According to their documentation it is used to throttle the request rate for the service - limiting the collection of data on high traffic sites. It expires after 10 minutes	365 Days	First party
appcast_session	Appcast	This cookie is used to track your session on the Appcast website. It is deleted when you close your browser.	30 Days	Third party
__hssrc	HubSpot	Whenever HubSpot changes the session cookie, this cookie is also set to determine if the visitor has restarted their browser. If this cookie does not exist when HubSpot manages cookies, it is considered a new session. It contains the value "1" when present.	Session	First party
hubspotutk	HubSpot	This cookie enables us to deliver the service and or response that individuals needs and expects from us, in a seamless manner	6 Months	First party
mf_user	Mouseflow	1st party cookie, persistent: A cookie for checking if the user is new or returning	90 Days	First party
s_ips	Adobe	Tracks percent of page viewed	Session	Third party
__hjSessionUser_204526	HubSpot	Hotjar cookie that is set when a user first lands on a page with the Hotjar script. It is used to persist the Hotjar User ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.	Session	First party
_gat_UA-nnnnnnn-nn	Google Analytics	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.	365 Days	First party
_gat_UA-399437-1	Google Analytics	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.	365 Days	First party
sc_gpt	SnapChat	This cookie is used to track your interactions with ads that are powered by Snapchat. It is stored for 1 year.	13 Months	Third party

Name	Provider	Purpose	Expiry	Type
sp_t	Spotify	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.	364 Days	Third party
bcookie	LinkedIn	Browser Identifier cookie to uniquely identify devices accessing LinkedIn to detect abuse on the platform.	365 Days	Third party
dpm	Adobe marketing cloud	The cookie is used for targeted advertising and marketing. Domain is owned by Adobe Audience Manager.	365 Days	Third party
NID	Google Ads Optimization	This is a Google cookie that allows a company, such as CGI, to target advertising to users who have signed out of their service. A cookie allows CGI to show you useful content on Google services. By accepting marketing cookies, you authorize Google to process your information. You can also influence your own information or withdraw your consent in the Google services settings or by modifying your cookie settings from this cookie manager (link at the bottom of the page). Learn more: https://policies.google.com/technologies/cookies	365 Days	Third party
lissc	LinkedIn	Used by the social networking service LinkedIn for tracking the use of embedded services.	365 Days	Third party
UserMatchHistory	LinkedIn	This domain is owned by LinkedIn, the business networking platform. It typically acts as a third party host where website owners have placed one of its content sharing buttons in their pages, although its content and services can be embedded in other ways. Although such buttons add functionality to the website they are on, cookies are set regardless of whether or not the visitor has an active LinkedIn profile, or agreed to their terms and conditions. For this reason it is classified as a primarily tracking/targeting domain.	365 Days	Third party
_gcl_au	HubSpot	Google Adsense to store and track conversions.	89 Days	Third party
c	Cision	This domain is owned by IPONWEB and is used to provide a real time bidding platform for online advertising.	184 Days	Third party
_gcl_au	Google Adsense	Used through Google Analytics to understand user interaction with the site and advertising	3 Months	Third party
_fbp	Facebook	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers	365 Days	Third party
IDE	Google DoubleClick	This cookie, used by Google DoubleClick, helps measure the effectiveness of ads and delivers targeted ads to users. It tracks actions after viewing or clicking an ad to improve user experience, with a focus on ad preferences.	2 Years	Third party
lidc	LinkedIn	This domain is owned by LinkedIn, the business networking platform. This cookies is used to facilitate data center selection.	365 Days	Third party
ln_or	LinkedIn	Used to determine if Oribi analytics can be carried out on a specific domain	1 Day	Third party
uuid	MediaMath	MediaMath uses cookies to help recognize a computer or device so that they can deliver relevant advertising to you, measure the impact of that advertising and better understand and recognize digital media usage patterns.	13 Months	Third party
_cc_aud	Lotame	We use this cookie to target advertising that is appropriate for you through the Adform service. This domain is owned by Lotame. The cookie can be used to collect the following information: Cookie ID, Mobile Advertising ID, Partner ID, browser and device information, IP address and analytics information about the functionality of advertising. By accepting cookies, you allow Adform and Lotame to process your cookie information. You can influence the processing of your information by contacting dpo@adform.com or modifying your cookie settings.	365 Days	Third party
AMCVS_*	Adobe experience cloud	Indicates the start of a session for Adobe Experience Cloud	Session	Third party
c_user	Facebook	The c_user cookie contains the user ID of the currently logged in user. The lifetime of this cookie is dependent on the status of the ‘keep me logged in’ checkbox. If the ‘keep me logged in’ checkbox is set, the cookie expires after 90 days of inactivity. If the ‘keep me logged in’ checkbox is not set, the cookie is a session cookie and will therefore be cleared when the browser exits.	3 Months	Third party
bscookie	LinkedIn	This cookie is used for remembering that a logged in user is verified by two factor authentication and has previously logged in.	365 Days	Third party
_fbp	HubSpot	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers	3 Months	Third party
lms_ads	LinkedIn	Used to identify LinkedIn Members off LinkedIn in the Designated Countries for advertising	30 Days	Third party
AMCV_*	Adobe experience cloud	Unique Identifier for Adobe Experience Cloud	180 Days	Third party
datr	Facebook	The purpose of the Datr cookie is to identify the web browser used to connect to Facebook, regardless of the logged in user. This cookie plays a key role in Facebook's security and site integrity functions.	2 Years	Third party
sb	Facebook	Facebook – Allows Facebook to recover your account in the event that you forget your password, or to require additional authentication if you tell us that your account has been hacked.”sb” and “dbln” cookies enable Facebook to identify your browser securely.	2 Years	Third party
RUL	Google DoubleClick	Used by Google DoubleClick to determine whether website advertisement has been properly displayed.	1 Year	Third party
personalization_id	X	This cookie is set due to X integration and sharing capabilities for the social media.	2 Years	Third party
GPS	Youtube	YouTube is a Google owned platform for hosting and sharing videos. YouTube collects user data through videos embedded in websites, which is aggregated with profile data from other Google services in order to display targeted advertising to web visitors across a broad range of their own and other websites.	365 Days	Third party
liap	LinkedIn	This cookie locates LinkedIn functionalities in the page and share the Website information on social networks.	1 Year	Third party
demdex	Adobe marketing cloud	This cookie helps Adobe Audience Manger perform basic functions such as visitor identification, ID synchronization, segmentation, modeling and reporting	365 Days	Third party
A3	Yahoo	This domain is owned by Yahoo, whose principal business is Search and Advertising Services.	365 Days	Third party
_gcl_aw	Google Adsense	to provide ad delivery or retargeting.	90 Days	Third party
sp_landing	Spotify	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.	23 Days	Third party
li_sugr	LinkedIn	Used to make a probabilistic match of a user's identity outside the Designated Countries	90 Days	Third party
_kuid_	Salesforce.com	We use this cookie to target advertising that is appropriate for you online. This domain is owned by the Krux Digital.	365 Days	Third party
xs	Facebook	Session cookies are c_user and xs. c_user stores the username and the xs session secret, these two cookies together determine whether the user is logged in or not.	3 Months	Third party
_guid	LinkedIn	Used to identify a LinkedIn Member for advertising through Google Ads	90 Days	Third party

Machine Learning (ML) and Data Science has received a lot of press over the last 5-10 years. Algorithms with origins in the 1940’s and 1950’s have now become very effective due to exponential increases in data availability and processing power. Some companies have been able to leverage these advances to achieve fantastic success, while many others have struggled to emulate that success and get machine learning to work for them. So what is causing the hold-up for this much-hyped technology?

Most of the companies that have been able to successfully leverage ML are digital native companies; their core products are computer programs. This means that collecting and storing large amounts of accurate data and experimenting in rigidly defined environments (like a website) are second nature. More often than not, companies that are not digital natives are attempting to adopt ML before having an ecosystem in place that can support it.

Here are some common issues holding ML projects back:

ML FOMO

If companies aren’t doing ML, they want to be doing it, right? It is sometimes presumed to be the secret sauce that can magically accomplish anything, and that hype can work against prospective projects. ML is immensely effective in the right scenarios, and establishing upfront with a data scientist whether your project is one of those is a very important and surprisingly overlooked step. I’ve seen many ML projects launched prior to anyone qualified in it being consulted and data scientists being hired to solve something with ML without an understanding of whether it’s applicable to the problem. Although exploring a new technology is valuable and important, poorly considered ML FOMO or blindly forging ahead under the banner of Agile leads to the cart being put before the horse.

Solution: A data scientist should be part of the scoping conversation as early as possible to identify if there is an opportunity for ML to make an impact. If a project sounds like a good candidate for automation, a data scientist can tell you what the best way to solve that problem is. If the data scientist is saying it’s not feasible, it’s worth addressing why that is, or looking for an alternative solution.

Data Quality

As is often said: ‘garbage in, garbage out’. Having big data isn’t enough; it has to be good data for an ML algorithm to be effective. If large amounts of data are missing or incorrect, performance of any ML algorithm will suffer. Collecting data and getting data into a usable state is a vastly underestimated process, both by data scientists and by project leaders. Between an initial point of contact for an ML project and developing a production model, several months can go by before the Extract-Transform-Load (ETL) is functioning as needed for a production level model to work. If the data is discovered to be poor quality at this point, lots of energy has already been wasted.

Solution: Have a data quality assessment done after a successful POC, but before going to development. This could suggest what the performance limits are given the data quality, and highlight the opportunity to improve the quality of that data collection and storage. At the very least, some comment should be made about what risks the data quality presents before development.

Loss Aversion

There are a few reasons why ML triggers loss aversion for stakeholders.

Whatever process your project might be aiming to enhance or replace, its error will be on full display. This is intentional, as any good data scientist wants as accurate an appraisal of their model performance as possible; that error will not be 0 as machine learning models are probabilistic. An initial model might be wrong 15% of the time, 20% of the time, and accepting that many mistakes is a hard sell, even if it could be a usable model.

However, just because you can’t see the mistakes happening in your current process so explicitly, it doesn’t mean they don’t exist. Machines are expected to be wholly reliable, while humans are not, perhaps because dealing with a failure you can’t foresee is easier than accepting a guaranteed amount of failures, leading to slow adoption of less-than-perfect models.

Solution: Allow for your existing process to be quantified as part of your project and as a benchmark for machine learning to beat. Agreeing on metrics and KPIs will give everyone involved a clear vision of what is trying to be achieved. This will give all stakeholders the most confidence that an algorithm is competitive. If it really isn’t possible to say what success looks like up-front, the only recourse is real-world experimentation to collect more data. This might be expensive but it’s better than guesswork.

Conclusion

In summary, scoping projects carefully to see if they have the elements in place for success is the best approach for ML projects. Consult early, check the data, agree the metrics, understand the end user. However, even if all the necessary precautions are taken, the end goal can’t be achieved. But doing your due diligence can certainly help dodge the pitfalls.

If you’d like a conversation around your current use of ML please don’t hesitate to get in touch with me or visit our Artificial Intelligence page.

About this author

Chris Annone

AI & Automation Practice Lead

Chris Annone is Senior Consultant and Practice Lead in CGI UK’s London Metro Business Unit. He leads a team of AI and Automation consultants across multiple verticals including Local Government, Education, Transport, Health and Housing. ...

View profile

CGI Advisory Services

CGI in the UK – Doing Complex Things Well

2020 CGI Client Global Insights Summary

A game changer for net zero: Climate-related Financial Disclosures

The joy of tidying up: Decluttering local authority IT

Moving towards an innovative openSERVICE approach

Dream inspired. Client driven.

Chris Annone

AI & Automation Practice Lead

ML FOMO

Data Quality

Loss Aversion

Conclusion

About this author

Chris Annone

AI & Automation Practice Lead

Insights you can act on

Company

Resource centre

Support

Follow us

CGI Advisory Services

CGI in the UK – Doing Complex Things Well

2020 CGI Client Global Insights Summary

A game changer for net zero: Climate-related Financial Disclosures

The joy of tidying up: Decluttering local authority IT

Moving towards an innovative openSERVICE approach

Dream inspired. Client driven.

Chris Annone

AI & Automation Practice Lead

ML FOMO

Data Quality

Loss Aversion

Conclusion

Share this

About this author

Chris Annone

AI & Automation Practice Lead

Related media

How SovereignOps is securing the future of the UK’s Critical National Infrastructure

Serving up more efficient knowledge management for Food Forensics

Quantum computing: Is it a conversation leaders should have now?

From AI to ROI: Insights on using AI for good (part 2 of 2)

Discover more about CGI

Keeping you informed