Color Sift Descriptors to Categorize Illegal Activities in Images of Onion Domains

  1. David Matilla 1
  2. Víctor González-Castro 1
  3. Laura Fernández-Robles 1
  4. Eduardo Fidalgo 1
  5. Mhd Wesam Al-Nabk 1
  1. 1 Universidad de León
    info

    Universidad de León

    León, España

    ROR https://ror.org/02tzt0b78

Libro:
XXXIX Jornadas de Automática: actas. Badajoz, 5-7 de septiembre de 2018
  1. Inés Tejado Balsera (coord.)
  2. Emiliano Pérez Hernández (coord.)
  3. Antonio José Calderón Godoy (coord.)
  4. Isaías González Pérez (coord.)
  5. Pilar Merchán García (coord.)
  6. Jesús Lozano Rogado (coord.)
  7. Santiago Salamanca Miño (coord.)
  8. Blas M. Vinagre Jara (coord.)

Editorial: Universidad de Extremadura

ISBN: 978-84-9749-756-5 978-84-09-04460-3

Año de publicación: 2018

Páginas: 991-997

Congreso: Jornadas de Automática (39. 2018. Badajoz)

Tipo: Aportación congreso

DOI: 10.17979/SPUDC.9788497497565.0991 DIALNET GOOGLE SCHOLAR lock_openRUC editor

Resumen

Dark Web, i.e. the portion of the Web whose content is not indexed either accessible by standard web browsers, comprises several darknets. The Onion Router (Tor) is the most famous one, thanks to the anonymity provided to its users, and it results in the creation of domains, or hidden services, which hosts illegal activities. In this work, we explored the possibility of identifying illegal domains on Tor darknet based on its visual content. After crawling and filtering the images of 500 hidden services, we sorted them into five different illegal categories, and we trained a classifier using the Bag of Visual Words (BoVW) model. In this model, SIFT (Scale Invariant Feature Transform) or dense SIFT were used as the descriptors of the images patches to compute the visual words of the BoVW model. However, SIFT only works with gray-scale images; thus the information given by color in an image is not retrieved. To overcome this drawback, in this work we implemented and assessed the performance of three different variants of SIFT descriptors that can be used in color images, namely HSV-SIFT, RGB-SIFT and the BoVW model for image classification. The obtained results showed the usefulness of using color-SIFT descriptors instead of SIFT, whereas in our experiments the latter achieved an accuracy of 57.52%, the HSV-SIFT descriptor achieved an accuracy up to 59.44%.