Information technology for gender recognition by voice

Коштура, Діана; Koshtura, Diana

Information technology for gender recognition by voice

dc.citation.epage	360
dc.citation.issue	13
dc.citation.journalTitle	Вісник Національного університету "Львівська політехніка". Інформаційні системи та мережі
dc.citation.spage	350
dc.contributor.affiliation	Національний університет “Львівська політехніка”
dc.contributor.affiliation	Lviv Polytechnic National University
dc.contributor.author	Коштура, Діана
dc.contributor.author	Koshtura, Diana
dc.coverage.placename	Львів
dc.coverage.placename	Lviv
dc.date.accessioned	2025-03-06T09:14:10Z
dc.date.created	2023-02-28
dc.date.issued	2023-02-28
dc.description.abstract	Розпізнавання статі людини за голосом є складною проблемою в опрацюванні мовлення. Це завдання передбачає виділення значущих ознак із мовних сигналів, класифікацію їх на чоловічі чи жіночі категорії. У статті реалізовано інформаційну технологію розпізнавання статі. Спочатку записали зразки голосу, як чоловічого, так і жіночого, і визначили кепстральні коефіцієнти Mel-частоти (MFCC) як характеристики. Потім, пройшовши навчання, класифікатор опорних векторів (SVM) вивчав ці функції та оцінював їх ефективність, використовуючи показники точності, запам’ятовування та показників F1. Ці експерименти продемонстрували, що запропонована інформаційна технологія повинна досягати високої точності на тестовому наборі та точно передбачати стать мовця на основі прослуховування його голосу. Досліджено використання попередньо навчених моделей, щоб зменшити потребу у великих обсягах навчальних даних, і виявлено, що вони можуть забезпечити високу продуктивність, і потребують менше обчислень. Це дослідження підкреслює потенціал використання методів машинного навчання для розпізнавання статі за голосом і може бути поширене на інші програми опрацювання мовлення.
dc.description.abstract	Gender recognition from voice is a challenging problem in speech processing. This task involves extracting meaningful features from speech signals and classifying them into male or female categories. In this article, was implemented a gender recognition system using Python programming. I first recorded voice samples from both male and female speakers and extracted Mel-frequency cepstral coefficients (MFCC) as features. Then trained, a Support Vector Machine (SVM) classifier was on these features and evaluated its performance using accuracy, precision, recall, and F1-score metrics. These experiments demonstrated that proposed system should achieve high accuracy on the test set and will accurately predict the gender of a speaker based on their voice. I also explored using pre-trained models to reduce the need for large amounts of training data and found that they can provide good performance while requiring less computation. This study highlights the potential of using machine learning techniques for gender recognition from voice and can be extended to other speech processing applications.
dc.format.extent	350-360
dc.format.pages	11
dc.identifier.citation	Koshtura D. Information technology for gender recognition by voice / Koshtura Diana // Information Systems and Networks. — Lviv : Lviv Politechnic Publishing House, 2023. — No 13. — P. 350–360.
dc.identifier.citationen	Koshtura D. Information technology for gender recognition by voice / Koshtura Diana // Information Systems and Networks. — Lviv : Lviv Politechnic Publishing House, 2023. — No 13. — P. 350–360.
dc.identifier.doi	doi.org/10.23939/sisn2023.13.350
dc.identifier.uri	https://ena.lpnu.ua/handle/ntb/63971
dc.language.iso	en
dc.publisher	Видавництво Львівської політехніки
dc.publisher	Lviv Politechnic Publishing House
dc.relation.ispartof	Вісник Національного університету "Львівська політехніка". Інформаційні системи та мережі, 13, 2023
dc.relation.ispartof	Information Systems and Networks, 13, 2023
dc.relation.references	1. Balasubramanian, V., & Manikandan, M. S. (2018). Automatic Gender Recognition from Speech Using Machine Learning Techniques. International Journal of Engineering & Technology, 7(4.35), 116–119. https://doi.org/10.14419/ijet.v7i4.35.22005
dc.relation.references	2. Sethi, P., & Chandra, M. (2018). Gender Classification of Speakers using Mel Frequency Cepstral Coefficients and Support Vector Machine. International Journal of Advanced Research in Computer Science, 9(3), 129–133. https://doi.org/10.26483/ijarcs.v9i3.5507
dc.relation.references	3. D. Koshtura and N. Kunanets, “Information Sysem Project for Communication of Hearing Impaired Users”, 2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, Ukraine, 2022, 247–251. DOI: 10.1109/CSIT56902.2022.10000866.
dc.relation.references	4. Andrunyk V., Shestakevych T. and Koshtura D. (2021). The text analysis software for hearing-impaired persons, 2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT), lviv, Ukraine, 119–123. DOI: 10.1109/CSIT52700.2021.9648605.
dc.relation.references	5. Chen G., Li J., Li Y., and Li J. (2020). Gender classification using a fusion of MFCC and deep residual network features. IEEE Transactions on Affective Computing, Vol. 11, No. 4, 656–665, Oct. – Dec. 2020.
dc.relation.references	6. Huang X., Cai M., and Zhang Q. (2021). Gender recognition in noisy environments using convolutional neural networks. Journal of Ambient Intelligence and Humanized Computing, Vol. 12, No. 10, 10425–10438, Oct. 2021.
dc.relation.references	7. Srivastava, R., & Singh, N. (2016). A Study of Feature Extraction Techniques for Gender Recognition System. International Journal of Computer Science and Mobile Computing, 5(11), 15–21. http://www.ijcsmc.com/docs/papers/November2016/V5I11201602.pdf.
dc.relation.references	8. Librosa documentation: https://librosa.org/doc/latest/index.html.
dc.relation.references	9. Scikit-learn documentation: https://scikit-learn.org/stable/documentation.html
dc.relation.referencesen	1. Balasubramanian, V., & Manikandan, M. S. (2018). Automatic Gender Recognition from Speech Using Machine Learning Techniques. International Journal of Engineering & Technology, 7(4.35), 116–119. https://doi.org/10.14419/ijet.v7i4.35.22005
dc.relation.referencesen	2. Sethi, P., & Chandra, M. (2018). Gender Classification of Speakers using Mel Frequency Cepstral Coefficients and Support Vector Machine. International Journal of Advanced Research in Computer Science, 9(3), 129–133. https://doi.org/10.26483/ijarcs.v9i3.5507
dc.relation.referencesen	3. Koshtura D. and Kunanets N. (2022). Information Sysem Project for Communication of Hearing Impaired Users, 2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, 247–251. DOI: 10.1109/CSIT56902.2022.10000866.
dc.relation.referencesen	4. Andrunyk V., Shestakevych T. and Koshtura D. (2021). The text analysis software for hearing-impaired persons, 2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, Ukraine, 119–123. DOI: 10.1109/CSIT52700.2021.9648605.
dc.relation.referencesen	5. Chen G., Li J., Li Y., and Li J. (2020). Gender classification using a fusion of MFCC and deep residual network features. IEEE Transactions on Affective Computing, Vol. 11, No. 4, 656–665, Oct.–Dec. 2020.
dc.relation.referencesen	6. Huang X., Cai M., and Zhang Q. (2021). Gender recognition in noisy environments using convolutional neural networks. Journal of Ambient Intelligence and Humanized Computing, Vol. 12, No. 10, 10425–10438, Oct. 2021.
dc.relation.referencesen	7. Srivastava, R., & Singh, N. (2016). A Study of Feature Extraction Techniques for Gender Recognition System. International Journal of Computer Science and Mobile Computing, 5(11), 15–21. http://www.ijcsmc.com/docs/papers/November2016/V5I11201602.pdf.
dc.relation.referencesen	8. Librosa documentation: https://librosa.org/doc/latest/index.html.
dc.relation.referencesen	9. Scikit-learn documentation: https://scikit-learn.org/stable/documentation.html.
dc.relation.uri	https://doi.org/10.14419/ijet.v7i4.35.22005
dc.relation.uri	https://doi.org/10.26483/ijarcs.v9i3.5507
dc.relation.uri	http://www.ijcsmc.com/docs/papers/November2016/V5I11201602.pdf
dc.relation.uri	https://librosa.org/doc/latest/index.html
dc.relation.uri	https://scikit-learn.org/stable/documentation.html
dc.rights.holder	© Національний університет “Львівська політехніка”, 2023
dc.rights.holder	© Koshtura D. A., 2023
dc.subject	розпізнавання статі
dc.subject	Python
dc.subject	Mel-частотні кепстральні коефіцієнти
dc.subject	опорна векторна машина
dc.subject	машинне навчання
dc.subject	Python
dc.subject	Mel-frequency cepstral coefficients
dc.subject	Support Vector Machine
dc.subject	Machine Learning
dc.subject.udc	004.4'277
dc.subject.udc	305
dc.title	Information technology for gender recognition by voice
dc.title.alternative	Інформаційна технологія розпізнання статі за голосом
dc.type	Article

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 2023n13_Koshtura_D-Information_technology_350-360.pdf
Size:: 642.88 KB
Format:: Adobe Portable Document Format

Download

Name:: 2023n13_Koshtura_D-Information_technology_350-360__COVER.png
Size:: 397.16 KB
Format:: Portable Network Graphics

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.77 KB
Format:: Plain Text
Description:

Download

Collections

Вісник Національного університету "Львівська політехніка". Інформаційні системи та мережі. – 2023. – Випуск 13