{"id":2708,"date":"2014-01-20T12:30:35","date_gmt":"2014-01-20T17:30:35","guid":{"rendered":"http:\/\/me-en.kaspersky.com\/blog\/?p=2708"},"modified":"2020-02-26T18:58:09","modified_gmt":"2020-02-26T14:58:09","slug":"a-day-in-the-life-of-a-content-analyst","status":"publish","type":"post","link":"https:\/\/me-en.kaspersky.com\/blog\/a-day-in-the-life-of-a-content-analyst\/2708\/","title":{"rendered":"A Day In The Life Of A Content Analyst"},"content":{"rendered":"<p>There is a position at Kaspersky Lab that exists that is officially named \u2018content analyst\u2019. To put it simply, the content analyst\u2019s task is to create content filter databases, which are, in turn, used in our products. The filters are included in the security solutions both for end users and corporate clients. A module responsible for content filtering is called <a href=\"https:\/\/www.kaspersky.com\/blog\/tag\/parental-control\/\" target=\"_blank\" rel=\"noopener nofollow\">Parental Control<\/a>, for the first group of users, and <a href=\"https:\/\/business.kaspersky.com\/tag\/web-control\/\" target=\"_blank\" rel=\"noopener nofollow\">Web Control<\/a>, for the second group. \u00a0By using the tools implemented in Parental Control, parents are able to restrict the access of their children to inappropriate websites. In order to apply these filters, the security company has to know which category each site falls into and what kind of content is stored there. This is where our databases come along.<\/p>\n<p><a href=\"https:\/\/media.kasperskydaily.com\/wp-content\/uploads\/sites\/37\/2014\/01\/05111039\/content-1.jpeg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-3515\" alt=\"content\" src=\"https:\/\/media.kasperskydaily.com\/wp-content\/uploads\/sites\/37\/2014\/01\/05111039\/content-1.jpeg\" width=\"640\" height=\"480\"><\/a><\/p>\n<p>Apparently, filling in the filter databases manually is not possible. According to Google, the Internet now contains approximately 15 billion web pages, which is why a majority of the scanning is performed by robots. They analyze the Internet content using our <a href=\"https:\/\/www.kaspersky.com\/blog\/ksn\/\" target=\"_blank\" rel=\"noopener nofollow\">KSN<\/a> cloud and automatically deliver the verdict on each web site. The decision made in the cloud is used in our mobile security solutions (including <a href=\"https:\/\/www.kaspersky.com\/blog\/have-no-fear-ipad\/\" target=\"_blank\" rel=\"noopener nofollow\">Safe Browser<\/a> for iOS), and, of course, the rest of our products use them to power the components of the content filtration.\u00a0 The analyst\u2019s task is to teach robots to correctly categorize the content on the Internet. A robot has to evaluate keywords and combinations on a Web page, housekeeping data, as well as images, and carry out decisions whether the content on the page falls within a certain category. For example, if the web site code contains combinations like \u2018watch porn online\u2019, \u2018free porn\u2019 or a large number of flesh-colored images and alike, a robot would categorize that given resource as \u2018Pornography and erotic\u2019 based on a set of criteria.<\/p>\n<div class=\"pullquote\">The analysts\u2019 task is to teach robots to correctly categorize content from the Internet<\/div>\n<p>As a part of their job responsibilities, web analysts at times have to scan \u2018inappropriate content\u2019. We work with many content categories, including a number of highly dangerous ones:<\/p>\n<ul>\n<li>Drugs;<\/li>\n<li>Cruelty and violence;<\/li>\n<li>Arms;<\/li>\n<li>Gambling;<\/li>\n<li>Obscene language.<\/li>\n<\/ul>\n<p>As a rule, the above-mentioned categories are, by default, ticked as restricted by the Administrator when Parental or Web Control is enabled.<\/p>\n<p>As web analysts, content may vary. One should be psychologically resistant, able to abstract from certain emotions, and healthily cynical in order to work with the \u2018Cruelty and violence\u2019 category. These personal qualities are explicitly stated as required to the content analytics candidate during the very early stages of a job interview.<\/p>\n<p>But that\u2019s not our entire job\u2026 it\u2019s difficult to describe the work process and all the peculiarities of searching the wickedest content on the Internet in detail, but, in general, the typical work day looks like this:<\/p>\n<p>Morning email check<\/p>\n<ul>\n<li>To me personally, this process helps to get into the mood for work and evaluate the volume of tasks for today.<\/li>\n<\/ul>\n<p>Status meeting<\/p>\n<ul>\n<li>Every day at 11am our team gathers in the meeting room and discusses everything done yesterday and the scope of work for today. This is an integral part of the work process, which helps to exercise discipline and ensure transparency when working on a team.<\/li>\n<\/ul>\n<p>Coffee break<\/p>\n<ul>\n<li>The job is important, but who could do without a morning cup!<\/li>\n<\/ul>\n<p>User inquiries<\/p>\n<ul>\n<li>On a daily basis, our tech support has to deal with user inquiries relative to incorrect blocking of web sites by content filter modules. Such requests are passed to our teams of content analysts who take care of the inquiry, analyze it, and provide feedback to users. We also work with issues like Anti-Banner malfunctioning in our products. Officially, each request takes up to three work days, but, as always, the sooner the better.<\/li>\n<\/ul>\n<p>Work with content categories<\/p>\n<ul>\n<li>The major part of content analyst\u2019s work is creating new categories and supporting existing ones. Now we support 15 categories and 7 languages. That is why the most important criteria for an applicant to web content analyst position is fluency in a second foreign language, besides English. We welcome candidates with degrees in philology or linguistics, and, which goes without saying, analytical thinking. In a nutshell, the candidate must be an \u2018arts and math\u2019 person, which is quite a rare case.<\/li>\n<\/ul>\n<p>Live test<\/p>\n<ul>\n<li>When categorizing the content, the bots use databases created and supported by content analysts. Obviously, before being released, the databases go through a number of tests. The most important of them is live testing using real-life, top web sites. This test is run daily. A list of the most popular global web sites is formed every morning, and those which have not been included into our test list before, are sent to our analysts to categorize. The analysts deliver their verdict on the new sites and update the test manually. The latter is meant to ensure that the categories assigned to web sites by bots correspond to the analyst\u2019s verdict. As the web site content may change, we have to constantly work with this test. As a rule, we initiate a live test three times a day: in the morning, in the afternoon, and in the evening, and take turns working with it.<\/li>\n<\/ul>\n<p>Education<\/p>\n<ul>\n<li>The employees at our department are well-known experts in protecting children from inappropriate content on the Internet: they write articles, post in blogs and are interviewed or invited as spokespeople at a number of conferences.<\/li>\n<li>Nevertheless, despite a number of articles, speeches, or interviews, the protection of children from explicit content is still a hot topic.<\/li>\n<\/ul>\n<p>Surely, there are a million ways to find resources with explicit content. Parental or Web Control is not foolproof. However, we do our best to protect users, as well as their family or colleagues, from inappropriate content on the Internet by providing useful tools to organize their defense.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There is a position at Kaspersky Lab that exists that is officially named \u2018content analyst\u2019. To put it simply, the content analyst\u2019s task is to create content filter databases, which<\/p>\n","protected":false},"author":353,"featured_media":2709,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[5],"tags":[498,499],"class_list":{"0":"post-2708","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-news","8":"tag-content-analyst","9":"tag-products-2"},"hreflang":[{"hreflang":"en-ae","url":"https:\/\/me-en.kaspersky.com\/blog\/a-day-in-the-life-of-a-content-analyst\/2708\/"},{"hreflang":"en-in","url":"https:\/\/www.kaspersky.co.in\/blog\/a-day-in-the-life-of-a-content-analyst\/2815\/"},{"hreflang":"en-us","url":"https:\/\/usa.kaspersky.com\/blog\/a-day-in-the-life-of-a-content-analyst\/3045\/"},{"hreflang":"en-gb","url":"https:\/\/www.kaspersky.co.uk\/blog\/a-day-in-the-life-of-a-content-analyst\/2897\/"},{"hreflang":"x-default","url":"https:\/\/www.kaspersky.com\/blog\/a-day-in-the-life-of-a-content-analyst\/3514\/"},{"hreflang":"ja","url":"https:\/\/blog.kaspersky.co.jp\/a-day-in-the-life-of-a-content-analyst\/2383\/"},{"hreflang":"en-au","url":"https:\/\/www.kaspersky.com.au\/blog\/a-day-in-the-life-of-a-content-analyst\/3514\/"},{"hreflang":"en-za","url":"https:\/\/www.kaspersky.co.za\/blog\/a-day-in-the-life-of-a-content-analyst\/3514\/"}],"acf":[],"banners":"","maintag":{"url":"https:\/\/me-en.kaspersky.com\/blog\/tag\/content-analyst\/","name":"content analyst"},"_links":{"self":[{"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/posts\/2708","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/users\/353"}],"replies":[{"embeddable":true,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/comments?post=2708"}],"version-history":[{"count":2,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/posts\/2708\/revisions"}],"predecessor-version":[{"id":15697,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/posts\/2708\/revisions\/15697"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/media\/2709"}],"wp:attachment":[{"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/media?parent=2708"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/categories?post=2708"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/me-en.kaspersky.com\/blog\/wp-json\/wp\/v2\/tags?post=2708"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}