Hitting the balance between quality and cost in data labelling

 

How do we achieve High Quality at Minimal cost?

 

That is a question every project manager is accustomed to. Especially companies or teams working in the field of Artificial Intelligence and Machine learning are facing this question over and over again. Usually, the cost of software products is worth the investment due to its endless scalability as intangible goods. However, there is one thing which makes AI development an insane resource-crushing occupation and, unfortunately, it constitutes the foundation of every machine learning model:

Labelled Datasets are a Precondition

There is a common opinion that an algorithm can only be as good as the datasets upon which it is trained. Depending on the later use it is wise to keep in mind when setting up a road map for AI development; Computer Vision system that distinguish mature tomatoes from unripe ones will need less training than a system that aims to enable automated driving. “If you talk about autonomous driving, one hour of video data can lead up to 800 man-hours of work” says Siddarth Mall, CEO at Playment, an Annotation Platform. Knowing this we can safely assume that an annotation process takes up a large share of the human capital that is dedicated towards an AI project.

Let’s take an example here: What if your valuable resources that are currently dedicated towards the collection, segmentation and annotation of huge datasets goes back to their core responsibilities, that they were hired for and are expert in ? It has long been neglected without the realization that the most efficient way of allocating human capital in the AI development does not lie within the old practice but by outsourcing annotation process to experts in the market. This way. Software developers can return to their core skills for utmost results

Data Annotation Table

Which leads us to an important conclusion:

Leave Data Annotation to the experts and channelize your valuable in-house resources’ skill that aligns with the objective of your business. However, from this stems another concern that AI teams and project managers face: “How can we leave this project that demands an extraordinary amount of expertise in our field to an external workforce?” What you would need is not just a dedicated team that has sufficient knowledge on the application case of a model (can we distinguish a skin tumor from a mole?) but one that is specialized in all different kinds of image annotations.

Since not only images can be annotated, we are looking at a broad range of data including LIDAR/RADAR-scans, video, even audio and text data that can undergo an annotation process. Different data types vary in the annotation process that can be applied upon them. We have created a short list of different annotation techniques that can be used on certain data types to give you a concise overview.

As you can see there is a large variation of markup-techniques that vary depending on the data type requirement. However, it is difficult to find teams that can work on all different data types with high quality outcome. The problem is twofold: Either teams have specialized on a specific subject within the annotation landscape, or they are capable of doing a bit of everything. In that case, quality will suffer under the attempt to have a broader focus. Once teams realize the need to be not just broad in scope but also achieve high quality, they tend to get extremely pricey up to a point where the operation may not get financed anymore.

Borek Solutions (BS)  however has taken a different approach. Unlike other annotation services, BS builds teams that are designated to fulfill a broad range of projects in image annotation. Experts are integrated for several annotation types, so that they can form a unit of broad expertise. What differentiates the modern data labelling workforce from conventional data labelling workforce is that these teams are created with a long-term cooperative vision.The focus remains on economies of scale in order to achieve declining marginal cost after an initial investment,

Cost per annotation

With the vision of pristine services for our clients, BS is able to offer time- and cost- efficiency at the same time.  Our track records shows that Borek Solutions’ clients not only saved 15 to 25% of time by engaging with us but savings of up to 30-50% could also be measured for the projects that our teams performed for them. Besides this, our clients save on fixed labor cost and overhead expenses which can be channelized towards the value added objective of the business.

Scroll to Top

Terms and Conditions

Last updated (2.11.2022)

Imprint

Borek IT Sourcing Pvt. Ltd.

Registered Office Address
306-311 Gajanand Complex, Opp. Tube Company, Old Padra Road, Vadodara – 390010, India

Management
Kushal Rao, Konstantin Borek
Contact
E-Mail: connect@boreksolutions.com
Phone: +91 96011-02487

Corporate Identity Number
U72200GJ2014PTC078952

RIVACY POLICY
The protection of your privacy is important to us. Borek IT Sourcing web activities comply with all applicable laws to protect personal information and guarantee data security. This privacy protection statement tells you how Borek treats the information generated during your visit to any or all of the Borek IT Sourcing websites (www.borek-it-sourcing.com, www.boreksolutions.com).

PRIVACY POLICY
The protection of your privacy is important to us. Borek IT Sourcing web activities comply with all applicable laws to protect personal information and guarantee data security. This privacy protection statement tells you how Borek treats the information generated during your visit to any or all of the Borek IT Sourcing websites (www.borek-it-sourcing.com, www.boreksolutions.com).

COLLECTION AND PROCESSING OF PERSONAL DATA
Personal data is information that identifies you, such as your name, your address, e-mail or postal addresses. Borek IT Sourcing does not collect personal data from you except when you specifically provide it, for example when ordering information or subscribing to newsletters.

USE AND DIVULGENCE OF PERSONAL DATA
Borek IT Sourcing will use your personal data exclusively for purposes of technical web site administration, to give you access to special information or for general communication with you. Borek IT Sourcing will neither sell your personal data to third parties nor market it elsewhere. The employees of Borek IT Sourcing are duty-bound to respect the confidentiality of your data and abide by Borek IT Sourcing’s Codes of Conduct.

COOKIES
This website uses Google Analytics, a web analytics service provided by Google, Inc. (“Google”). Google Analytics uses “cookies”, which are text files placed on your computer, to help the website analyze how users use the site. The information generated by the cookie about your use of the website (including your IP address) will be transmitted to and stored by Google on servers in the United States. However, if IP anonymization is activated on this website, Google will first shorten your IP address within EU member states or in other states that are signatories to the European Economic Area agreement. The full IP address is only transferred to a Google server in the US and shortened there in exceptional cases.Google will use this information for the purpose of evaluating your use of the website, compiling reports on website activity for website operators and providing other services relating to website activity and internet usage. Google will not associate your IP address with any other data held by Google. You may refuse the use of cookies by selecting the appropriate settings on your browser, however please note that if you do this you may not be able to use the full functionality of this website. Furthermore, you can prevent Google from capturing and processing the data generated by the cookie about your use of the website (including your IP address) by downloading and installing the browser plug-in available at the following link:You may refuse the use of cookies by selecting the appropriate settings on your browser, however please note that if you do this you may not be able to use the full functionality of this website. Furthermore, you can prevent Google from capturing and processing the data generated by the cookie about your use of the website (including your IP address) by downloading and installing the browser plug-in available at the following link: You may refuse the use of cookies by selecting the appropriate settings on your browser, however please note that if you do this you may not be able to use the full functionality of this website.Furthermore, you can prevent Google from capturing and processing the data generated by the cookie about your use of the website (including your IP address) by downloading and installing the browser plug-in available at the following link: tools.google.com/dlpage / gaoptout

RIGHT TO INFORMATION
If you have any questions about processing of your personal data, please contact our Data Protection Supervisor: Email: connect@boreksolutions.com | Upon request, you will receive written notice, in accordance with applicable law, as to whether the Borek IT Sourcing has stored any of your personal data and, if so, which data has been stored using the web technology we employ. If these data protection guidelines are revised, such revisions will be noted in these guidelines, on our homepage and in other appropriate venues.

FREEDOM OF CHOICE
You control the information you provide Borek IT Sourcing about yourself. However, if you choose not to share your information with Borek IT Sourcing, please be aware that you may be unable to access some of the areas of the Web site. If your personal information changes (eg zip code, e-mail, postal address), please e-mail the changes to Borek IT Sourcing in order to correct or update your personal data (our address: connect@boreksolutions.com).

AUTOMATICALLY RECORDED INFORMATION (NON-PERSONAL DATA)
When you access the Borek IT Sourcing website, general non-personal information (Internet browser used, number of visits, average time spent on site, pages viewed) is recorded automatically (not as part of registration). This information is used to gauge our Web site’s appeal and to improve its content and functionality. Your data is not processed any further nor is it transmitted to third parties.

SECURITY
Borek IT Sourcing takes great care to ensure the security of personal data. Your data is conscientiously protected from loss, destruction, distortion/falsification, manipulation and unauthorized access or unauthorized disclosure

MINORS
Borek IT Sourcing strongly advises all parents and guardians to teach their children safe and responsible handling of personal data on the Internet. Minors should not transmit any personal data to Borek IT Sourcing websites without the permission of their parents or guardians. Borek IT Sourcing will never knowingly collect personal data from minors or use it in any way or disclose it to third parties without permission.

COPYRIGHT
All rights reserved. All content (text, pictures, graphics, audio-, video- and animated data as well as it’s formation ao) on the website of Borek IT Sourcing are protected under copyright law and other protective legislation. The legal protection applies also with regard to databases and similar devices. The public content of this website is available only for the intended use in the internet and is not permitted to be used otherwise without prior written permission of Borek IT Sourcing. Furthermore, several areas in the Borek IT Sourcing website contain images that are copyrighted by third parties. Wherever not otherwise specified, all trademarks on the Borek IT Sourcing website are protected by trademark law.

GUARANTEE / LIABILITY
All information of this Internet offer has been carefully examined. We endeavor to expand and update this range of information on an ongoing basis. However, we shall not accept any liability or provide any guarantee for the accuracy and completeness. Borek IT Sourcing makes this information available without any sort of explicit or implicit promise or guarantee. Borek IT Sourcing rules out any liability for damages incurred directly or indirectly by using this Web site, as far as they are not attributable to intention or gross negligence on the part of Borek IT Sourcing. Site development: This Web site was developed by Borek IT Sourcing