Natural Language Processing Algorithms to Improve Digital Marketing Data Quality and its Ethical Implications
| dc.contributor.author | Pons, Sergi | |
| dc.contributor.author | Huertas García, Rubén | |
| dc.contributor.author | Lengler, Jorge | |
| dc.contributor.author | Nascimento, Daniel Luiz de Mattos | |
| dc.date.accessioned | 2026-03-23T11:16:27Z | |
| dc.date.available | 2026-03-23T11:16:27Z | |
| dc.date.issued | 2025-06-09 | |
| dc.date.updated | 2026-03-23T11:16:27Z | |
| dc.description.abstract | The ethical implications of personalization in digital marketing are significantly greater when companies adapt their marketing actions to individual consumer preferences. While this approach helps to reduce oversaturation and a sense of irrelevance among consumers, it also raises concerns about privacy and potential algorithmic bias. One form of personalization is self-referencing, where companies use the customer’s name in all communications with that person. For this to be effective, customer data must be accurate and sourced from a high-quality database. This study presents a real case of data mining by a lead generation company, illustrating the sequential process of cleaning a database containing the names and surnames of 100,000 customers. In the final filtering step, we compared the performance of two Natural Language Processing (NLP) algorithms, Levenshtein and RapidFuzz, using ratio tests. The results demonstrate that the Levenshtein algorithm outperformed RapidFuzz, the former achieving a 93.43% clean dataset compared to the latter’s 92.93%. Finally, we discuss the ethical challenges posed by the privacy-personalization paradox, explore the theoretical and managerial implications, and propose future research directions that balance digital marketing interests with consumer privacy. | |
| dc.format.extent | 12 p. | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.idgrec | 757766 | |
| dc.identifier.issn | 0742-6046 | |
| dc.identifier.uri | https://hdl.handle.net/2445/228401 | |
| dc.language.iso | eng | |
| dc.publisher | John Wiley & Sons | |
| dc.relation.isformatof | Reproducció del document publicat a: https://doi.org/10.1002/mar.22211 | |
| dc.relation.ispartof | Psychology & Marketing, 2025, vol. 42, num.7, p. 1946-1957 | |
| dc.relation.uri | https://doi.org/10.1002/mar.22211 | |
| dc.rights | cc-by (c) Pons et al., 2025 | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.source | Articles publicats en revistes (Empresa) | |
| dc.subject.classification | Tractament del llenguatge natural (Informàtica) | |
| dc.subject.classification | Màrqueting per Internet | |
| dc.subject.classification | Ètica empresarial | |
| dc.subject.other | Natural language processing (Computer science) | |
| dc.subject.other | Internet marketing | |
| dc.subject.other | Business ethics | |
| dc.title | Natural Language Processing Algorithms to Improve Digital Marketing Data Quality and its Ethical Implications | |
| dc.type | info:eu-repo/semantics/article | |
| dc.type | info:eu-repo/semantics/publishedVersion |
Fitxers
Paquet original
1 - 1 de 1