Matrix-Vector Products for Model Predictions

Cursos Empresas Blog Live Conf Precios

Contenido del curso

Introducción al Álgebra Lineal para Machine Learning

Operaciones con Vectores y Matrices

Multiplicación de Matrices

Construcción de un Modelo de Regresión Lineal

Tomar examen

Matrix-Vector Products for Model Predictions

Resumen

Applying a trained model to a full dataset means turning learned weights into predictions, and the matrix-vector product is the operation that makes it possible. You will learn how to align dimensions, use transpose in NumPy, and run predictions over multiple examples in parallel.

What is a matrix-vector product and why does it matter for predictions?

When a model finishes training, it stores its knowledge in a vector of weights. To use that knowledge on new data, you need an operation called inference, and the matrix-vector product is its mathematical engine.

Think of it like this: each row of your data matrix is one example, and the weight vector holds what the model considers important. Multiplying them with a dot product gives you a single prediction per row. Stack all those results and you get a new vector of predictions.

What is inference in machine learning? It is the step where you take the weights a model learned during training and use them to compute outputs for new data it has never seen.

This operation sits at the core of linear models and neural networks. Every time a model predicts at scale, it is essentially running many dot products in parallel.

How do you fix incompatible shapes with transpose in NumPy?

For the multiplication to work, dimensions must match. The number of columns in the matrix has to equal the number of rows in the vector. And here is where most beginners hit a wall.

Sometimes your weight vector loads as a row vector with shape (1, 2) when you actually need a column vector with shape (2, 1). The math refuses to cooperate, and NumPy throws an error.

The fix is the transpose operation, written as .T in NumPy. Its only job is to swap rows and columns:

A matrix of shape (3, 2) becomes (2, 3).
A row vector of shape (1, 5) becomes a column vector of shape (5, 1).
A column vector of shape (2, 1) becomes a row vector of shape (1, 2).

What does .T do in NumPy? It transposes the array, swapping its rows and its columns so the shapes align for matrix operations.

With transpose you rotate your arrays until the rule "columns of the first equal rows of the second" is satisfied.

How to multiply a matrix by a vector in NumPy step by step

Start by importing NumPy and creating a small dataset. Imagine three students described by two features: study hours and sleep hours.

python import numpy as np

X = np.array([ [10, 8], [5, 7], [12, 4] ])

w_row = np.array([[0.6, 0.4]])

print(X.shape) # (3, 2) print(w_row.shape) # (1, 2)

The weight 0.6 tells the model that study hours matter more than sleep hours. But the shapes (3, 2) and (1, 2) are not compatible for multiplication.

If you try X @ w_row directly, NumPy raises a shape error. The columns of X (2) do not match the rows of w_row (1).

Transposing the weight vector

Apply .T to turn the row vector into a column vector:

python w_col = w_row.T print(w_col) print(w_col.shape) # (2, 1)

Now the values 0.6 and 0.4 are stacked vertically, and the shape (2, 1) is compatible with (3, 2).

Running the matrix-vector product

Use the @ operator to perform the multiplication:

python predictions = X @ w_col print(predictions) print(predictions.shape) # (3, 1)

The result is a column vector with three predictions: 9.2, 5.8, and 8.8. Each value is the predicted performance score for one student, on a scale roughly from 1 to 10.

The output shape (3, 1) makes sense: rows of the matrix times columns of the vector.

What happens if you change the weights in the model?

Try swapping the weights to give more importance to sleep hours. Change w_row from [0.6, 0.4] to [0.3, 0.7] and run everything again.

The predictions shift because the model now values rest over study time. The student with the most sleep hours climbs in the ranking, while the one who only studied a lot may drop.

This simple change shows how weights shape behavior. In real models, training is exactly this: finding the combination of numbers that produces the best predictions across thousands of examples.

Which student got the highest score in your version? Share your results in the comments and tell me which weight combination you tried.

Comentarios

Gabriel Obregón

Estudiante

🧮 Producto matriz-vector e inferencia en NumPy

🔑 CONCEPTOS CLAVE (visión rápida)

🔹 Producto punto → Combina dos vectores → Mide alineación / similitud

🔹 Inferencia / predicción → Aplicar pesos aprendidos a datos no vistos

🔹 Producto matriz-vector → Muchos productos punto a la vez → Cálculo eficiente y escalable

🔹 Compatibilidad de dimensiones → Columnas de la primera = filas de la segunda

🔹 Vector fila vs. vector columna → Si no encajan ➜ se transpone

📐 REGLA DE ORO DE LAS DIMENSIONES

✔ Para que la multiplicación funcione:

(m, n) @ (n, 1) → (m, 1)

❌ Si los números del centro no coinciden → error

🛑 Error típico: intentar multiplicar una matriz por un vector fila

🔄 TRANSPOSICIÓN .T EN NUMPY

🔁 ¿Qué hace? → Cambia filas por columnas

🛠 ¿Para qué sirve? → Corrige incompatibilidades de dimensiones

📌 Ejemplos rápidos:

· 3,2 ➜ 2,3

· 1,5 ➜ 5,1

✍️ En NumPy se escribe: .T

🧪 EJEMPLO PRÁCTICO (modelo simple)

📊 DATOS DE ENTRADA X

👨‍🎓 Cada fila = un estudiante 📚 Característica 1: horas de estudio 😴 Característica 2: horas de sueño

X = [[10, 8],

[ 5, 7],

[12, 4]]

📏 Forma: 3,2

⚖️ PESOS DEL MODELO W

🎚 Pesos iniciales:

W = [0.6, 0.4]

📏 Forma: 1,2 ❌ ➡️ No compatible con X

🔁 AJUSTE CON TRANSPOSICIÓN

W_columna = W.T

📏 Nueva forma: 2,1 ✅ 🎯 Ahora sí se puede multiplicar

Jesús Alberto Romero Hernández

Estudiante

En este caso se expresó de manera acertiva para la clase al vector w como un vector fila de la manera:

w_fila=np.array([[0.6,0.4]])

Esto nos permite diferenciarlo de los vectores columnas:

w_columna=np.array([[0.6],
 [0.4]])

Para así poder explicar los requisitos que deben tener las formas (shape) de los arreglos al momento de realizar el producto punto. Sin embargo, la manera más dinámica de expresar los vectores es como vector unidemsional mediante una lista simple:

w=np.array([0.6,0.4])

Ya que al momento de multiplicarlos no se necesita transponerlos para cumplir con los requisitos. Numpy y Python hace el trabajo automáticamente. Por supuesto, es necesario tener claro la teoría de Algebra Lineal.

Alberto Ezequiel Marin Chacon

Estudiante

El primer estudiante sigue siendo aquel con mejor calificación a pesar del cambio. Al dar más importancia a las horas de sueño, el tercer y segundo estudiante tendrían la misma calificación.

Daniel Erazo

Profesor

Gran respuesta!

Victor Alfonso Anaya Pineda

Estudiante

Buenas tardes,

Esta es mi propuesta para el ejercicio planteado de clase.

Dando como resultado las siguientes notas:

Como el modelo prioriza las horas de sueño a las horas de estudio, incluso si los estudiantes estudian menos, si duermen más tendrán mejores notas. Muchas gracias.

José Eder Guzmán Mendoza

Estudiante

El producto matriz-vector es la operación clave para realizar inferencia en machine learning, ya que permite aplicar los pesos de un modelo a múltiples datos de entrada de forma simultánea. Cada fila de la matriz de datos representa una observación, y al multiplicarla por el vector de pesos se obtiene una predicción. Esta operación encapsula múltiples productos punto en paralelo, lo que la hace eficiente y escalable.

Para que la multiplicación funcione, es fundamental verificar la compatibilidad de dimensiones: el número de columnas de la matriz debe coincidir con el número de filas del vector. En caso de desajuste, se utiliza la transposición (.T) para convertir un vector fila en columna. Con herramientas como NumPy, esta operación se implementa fácilmente usando el operador @, permitiendo generar predicciones de manera rápida y entender cómo los cambios en los pesos afectan los resultados del modelo.

Alex Xiomar Rubio Lopez

Estudiante

Reto: cambiar las caracteristicas de horas de estudio y sueño para tres estudiantes y obtener su predicción de rendimiento.

Daniela Estupiñan

Estudiante

Daniel Erazo

Profesor

•

Gran solución! 😃

Darlinson Felipe Polania Camacho

Estudiante

•

Creo que es mas equilibrio, Ya que veo que, entre mas horas de estudio y menos sueño, esto no impacta nada en el resultado, no supera al primeropue se sel unico que tiene buen equilibrio entre horas de sueño y estudio.

Diego Ortiz

Estudiante

•

Esta pregunta en el examen podría tener mucho mejor contexto. Por ejemplo establacer primero el significado de cada lugar del array

Daniel Erazo

Profesor

Muchas gracias por tu comentario, lo tomaremos en cuenta para mejorar la pregunta.

Alejandro Molina

Estudiante

El estudiante A, al parecer tiene mayor peso que el resto de estudiantes.

Daniel Erazo

Profesor

Así es, gran respuesta!

Jaime Pelaez Valencia

Estudiante

Introducción al Álgebra Lineal para Machine Learning

Linear Algebra Behind AI Recommendations

Google Colab Setup for Machine Learning Python

NumPy Arrays and Matplotlib Visualized

Vectors, Matrices, and Tensors in NumPy

Operaciones con Vectores y Matrices

How Models Learn From Their Own Errors

Norma L2 vs L1 en vectores con NumPy

Cosine Similarity Explained With Word Vectors

Orthogonal vs Orthonormal Vectors in NumPy

Multiplicación de Matrices