• No results found

Risk-based labour inspection by means of machine learning

N/A
N/A
Protected

Academic year: 2022

Share "Risk-based labour inspection by means of machine learning"

Copied!
17
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

RISK-BASED LABOUR

INSPECTION BY MEANS OF MACHINE LEARNING

Øyvind Dahl, SINTEF

(2)

Background

• Norwegian Labour Inspection Authority (NLIA)

• Selecting objects (enterprises) for inspection is a fundamental part of the inspection process

• The selection shall be risk-based

I.e. select objects on basis of where the assumed risk is highest (injuries, harmful exp. etc)

• The NLIA has 230.000 potential objects for inspection

• 15.000 inspections per year

• How can the NLIA identify the high-risk enterprises?

(3)

The challenge

• Develop a tool which can function as an aid to the inspectors' selection of object for inspection

• The tool must be capable of differentiating between different groups of enterprises, based on risk

(4)

Operationalizing the challenge

• The model behind the tool must be based on available data that are not biased in any way

• Risk was defined as non-compliance with legal requirements

• Thus enterprises with the highest risk will have more regulatory breaches per checkpoint than enterprises with the lowest risk

(5)

Operationalizing the challenge

• Utilized structured data collected during inspections (N≈ 40,000)

• Non-compliance was operationalized as follows:

≥ median value with regard to number of regulatory breaches per checkpoint and/or

≥ 1 shutdown of operation, and/or

≥ 1 coercive fine, and/or

≥ 1 penalty, and/or

≥ 1 report to the police

Inspections with these characteristics were coded 1, else 0

(6)

Operationalized research question

• Roughly 45% of the enterprises were assigned value 1

• Is it possible to predict, with a low margin of error, which enterprises that belong to group 1?

This means: do enterprises which belong to group 1 have specific characteristics?

• To test this we used logistic regression analysis

• If the results of the analysis shows that it is possible to predict which enterprises that belong to group 1, then the model can be used to tell inspectors which enterprises they should prioritize by

generalizing the results to all 230.000 enterprises

(7)

Inspection results Characteristics (predictors)

Logistic regression analysis

Type of industry

Number of employees

Age of enterprise

Notifications of illness

Safety and Health Complaints

Accidents

VAT register

Employee register

Results previous inspections

Economic variables

Etc.

0 1

Probability

100%

0%

(8)

Inspection results

Logistic regression analysis

0 1

Probability

100%

Lowest probability of non-compliance (15%) 0%

Highest probability of non-compliance (15%) High probability of non-compliance (35%) Low probability of non-compliance (35%)

(9)

Assessing predictive validity

High compliance Low compliance Low compliance High compliance

Does the model predict correct?

Prediction % correct

91%

69%

46%

75%

(10)

Assessing predictive validity

Inspections: are there any differences between the groups?

Breaches per checkpoint

A B C D

0.15

0.24

0.33

0.48

(11)

Assessing predictive validity

Inspections: are there any differences between the groups?

Formal orders per checkpoint

0.1

0.16

0.23

0.34

(12)

Assessing predictive validity

Inspections: are there any differences between the groups?

Coercive fines per inspection

A B C D

0.04

0.14

0.24

0.42

(13)

Assessing predictive validity

Inspections: are there any differences between the groups?

Shutdown of operations

0.023

0.055

(14)

Conclusion

• In addition to assessing predictive validity (ANOVA), other tests were run as well. E.g. Hosmer-Lemeshov, area under curve (AUC)

• Conclusion 1: all tests indicate high predictive validity and high model fit

40,000 ---> 230,000

(15)
(16)
(17)

Teknologi for et bedre samfunn

Referanser

RELATERTE DOKUMENTER

3 The definition of total defence reads: “The modernised total defence concept encompasses mutual support and cooperation between the Norwegian Armed Forces and civil society in

By use of established damage criteria from the literature, it can safely be concluded that detonation of 10 kg TNT under the flail will not injure the operator, provided that the

This report documents the experiences and lessons from the deployment of operational analysts to Afghanistan with the Norwegian Armed Forces, with regard to the concept, the main

In the analysis of flow around an acoustic antenna, various tensors appear, for example the strain rate tensor, structural tensors and tensorial expressions involved in the

The goal with this thesis is to explore machine learning and the possibilities of using machine learning to create a model that can predict the effect of

previously to a drop in oil prices. This vulnerability is determined by the extent to which enterprises will be able to shift their activities to non-petroleum-related markets

The offshore working environment is said to be unique in that it combines the risk and dangers of major industrial enterprises with those specific to the oil and gas extraction

Our approach is based on a texture atlas, which makes it possible to store the global illumination in a set of textures, that can be mapped in real-time onto the model by the