利用企业的网络影响力来改进 NACE 代码分类.pdf

编号:718580 PDF 19页 857.99KB 下载积分:VIP专享
下载报告请您先登录!

利用企业的网络影响力来改进 NACE 代码分类.pdf

1、www.statistik.atUnabhngige Statistiken fr faktenbasierte EntscheidungenExploiting the Web Presence of Enterprises to Improve NACE Code ClassificationJohannes GussenbauerWIN 2025 CONFERENCE Danzig,05.02.2025Johannes.Gussenbauerstatistik.gv.atAlexander KowarikAlexander.Kowarikstatistik.gv.atwww.statis

2、tik.atFolie 2Outline Aim of classification task Data acquisition and processing Modelling and performance evaluation Hierarchical performance measuresFolie 3www.statistik.atAim of classification taskwww.statistik.atFolie 4Aim of classification task NACE editing labour intensive task+NACE revision co

3、ming 2025 Possible to predict NACE of entrprise using text from enterprise website?Test NACE predicion during ESSNet Web Intelligence Network Main focus on developing model used in recommendation system for editing task reduceediting timeFolie 5www.statistik.atData Acquisition and pre-processingwww.

4、statistik.atFolie 6Data Acquisition Collect web data during ICT-survey cycles Collected data from 2019 to 2023(results limited up to 2021)Google Custom API Search withname and address ofenterpriseSelenium+R Scrape text fromwebsite;especially searchfor imprint“Link Websiten and address Process text a

5、nd deterministicallylink via VAT orCRN found in imprint“www.statistik.atFolie 7www.statistik.atFolie 8Text data processing Process collected text from website Transform each word with the German morphological lexicon available on https:/www.openthesaurus.de/about/download Lemmetization and stemming

6、did not improve classification performance Removing all digits and punctuations Remove characters not part of the German dictionary Remove German stop words.Folie 9www.statistik.atModelling&Resultswww.statistik.atFolie 10NACE Classification Make NACE level 2 prediction using text as features=Pre-pro

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(利用企业的网络影响力来改进 NACE 代码分类.pdf)为本站 (Flechazo) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠