Ajuste fino (aprendizaje profundo)

Fine-tuning, a technique utilized in deep learning within the realm of inteligencia artificial[1], specifically pertains to machine learning algorithms. It primarily serves to boost the efficacy of existing neural network models by modifying and repurposing specific parameters within these models. This method is a subset of transfer learning, wherein insights obtained from one task are leveraged for another related task. Fine-tuning can be implemented across the entire network or a selected group of layers, often incorporating adapters for enhancement. It proves particularly beneficial in natural language processing for language modeling. Nonetheless, it’s crucial to acknowledge that fine-tuning can occasionally impact a model’s stability, necessitating techniques like linear interpolation to maintain performance. Various strategies, such as the Low-rank adaptation (LoRA) method, present alternative ways to fine-tune.

1. inteligencia artificial. La disciplina de la Inteligencia Artificial (IA) es un subconjunto de la informática dedicado a desarrollar sistemas capaces de ejecutar tareas que normalmente requieren el intelecto humano, como el razonamiento, el aprendizaje, la planificación, la percepción y la comprensión del lenguaje. Basada en campos tan diversos como la psicología, la lingüística, la filosofía y la neurociencia, la IA es fundamental en la creación de modelos de aprendizaje automático y sistemas de procesamiento del lenguaje natural. También contribuye significativamente al desarrollo de asistentes virtuales y sistemas de computación afectiva. La IA encuentra aplicaciones en numerosos sectores como la sanidad, la industria, la administración y la educación. Sin embargo, también plantea problemas éticos y sociales, por lo que requiere políticas reguladoras. Con la llegada de técnicas sofisticadas como el aprendizaje profundo y la IA generativa, el campo sigue expandiéndose, abriendo nuevas vías en diversos sectores.

En aprendizaje profundo, fine-tuning is an approach to transfer learning in which the weights of a pre-trained model are trained on new data. Fine-tuning can be done on the entire neural network, or on only a subset of its layers, in which case the layers that are not being fine-tuned are "frozen" (not updated during the backpropagation step). A model may also be augmented with "adapters" that consist of far fewer parameters than the original model, and fine-tuned in a parameter–efficient way by tuning the weights of the adapters and leaving the rest of the model's weights frozen.

For some architectures, such as redes neuronales convolucionales, it is common to keep the earlier layers (those closest to the input layer) frozen because they capture lower-level features, while later layers often discern high-level features that can be more related to the task that the model is trained on.

Models that are pre-trained on large and general corpora are usually fine-tuned by reusing the model's parameters as a starting point and adding a task-specific layer trained from scratch. Fine-tuning the full model is common as well and often yields better results, but it is more computationally expensive.

Fine-tuning is typically accomplished with supervised learning, but there are also techniques to fine-tune a model using weak supervision. Fine-tuning can be combined with a reinforcement learning from human feedback-based objective to produce language models like ChatGPT (a fine-tuned version of GPT-3) y Sparrow.

