☆ 4.7 Article

Part-of-speech tagging of building codes empowered by deep learning and transformational rules

ADVANCED ENGINEERING INFORMATICS (2021)

期刊

ADVANCED ENGINEERING INFORMATICS

卷 47, 期 -, 页码 -

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.aei.2020.101235

关键词

Automated compliance checking; Automated information extraction; Natural language processing; Part-of-speech tagging; Automated construction management systems; Deep learning

类别

Computer Science, Artificial Intelligence Engineering, Multidisciplinary

资金

National Science Foundation (NSF)
NSF [1827733]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The paper discusses the development of automated building code compliance checking systems and highlights the limitations of existing systems in converting regulatory rules. It proposes a deep learning-based POS tagger specifically tailored to building codes.

Automated building code compliance checking systems were under development for many years. However, the excessive amount of human inputs needed to convert building codes from natural language to computer understandable formats severely limited their range of applicable code requirements. To address that, automated code compliance checking systems need to enable an automated regulatory rule conversion. Accurate Part-of-Speech (POS) tagging of building code texts is crucial to this conversion. Previous experiments showed that the state-of-the-art generic POS taggers do not perform well on building codes. In view of that, the authors are proposing a new POS tagger tailored to building codes. It utilizes deep learning neural network model and error-driven transformational rules. The neural network model contains a pre-trained model and one or more trainable neural layers. The pre-trained model was fine-tuned on Part-of-Speech Tagged Building Codes (PTBC), a POS tagged building codes dataset. The fine-tuning of pre-trained model allows the proposed POS tagger to reach high precision with a small amount of available training data. Error-driven transformational rules were used to boost performance further by fixing errors made by the neural network model in the tagged building code. Through experimental testing, the authors found a well-performing POS tagger for building codes that had one bidirectional LSTM trainable layer, utilized BERT_Cased_Base pre-trained model and was trained 50 epochs. This model reached a 91.89% precision without error-driven transformational rules and a 95.11% precision with error-driven transformational rules, which outperformed the 89.82% precision achieved by the state-of-the-art POS taggers.

Part-of-speech tagging of building codes empowered by deep learning and transformational rules

期刊

ADVANCED ENGINEERING INFORMATICS

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Part-of-speech tagging of building codes empowered by deep learning and transformational rules

期刊

ADVANCED ENGINEERING INFORMATICS

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文