当前位置：首页 > news >正文

网站一键备案网站根目录怎么写

news 2025/11/26 11:52:40

网站一键备案,网站根目录怎么写,文化企业网站模板,做商演任务的网站文章目录嫌啰嗦直接看代码Q2 Image Captioning with Vanilla RNNs一个给的工具代码里的bug问题展示问题解决思路解决办法 rnn_step_forward题面解析代码输出 rnn_step_backward题面解析代码输出 rnn_forward题面解析代码输出 rnn_backward题面解析代码输出 word_embedding_for… 文章目录嫌啰嗦直接看代码Q2 Image Captioning with Vanilla RNNs一个给的工具代码里的bug问题展示问题解决思路解决办法 rnn_step_forward题面解析代码输出 rnn_step_backward题面解析代码输出 rnn_forward题面解析代码输出 rnn_backward题面解析代码输出 word_embedding_forwardword embedding 技术解释题面解析代码输出 word_embedding_backward题面解析代码输出 CaptioningRNN .loss题面解析代码输出 CaptioningRNN.sample题面解析代码输出结语嫌啰嗦直接看代码 Q2 Image Captioning with Vanilla RNNs 一个给的工具代码里的bug image_from_url 里的报错 [WinError 32] 另一个程序正在使用此文件进程无法访问。: C:\\Users\\Leezed\\AppData\\Local\\Temp\\tmp7r3fjusu问题展示我在运行这段代码的时候就报错了另一个进程正在使用此文件文件无法访问问题解决思路我一开始以为是img imread(fname) 里的问题导致文件还在被占用所以无法释放文件的所有权导致os.remove(fname)无法删除。就是我以为img imread(fname) 是另开了一个线程去读取图片然后直接运行了os.remove,但是图片还没有读取完导致占用没有被释放所以删除失败所以我一开始加了延时函数time.sleep(5),但是还是同样的问题正常情况下一张图片5秒肯定能读完了我不死心再检查了一下我直接吧img imread(fname) 改成了 img None ,结果还是报了同样的错我就知道肯定不是它的问题了我后来甚至开始怀疑是ff没有close的问题了但是with as 语句会自动关闭ff这就很奇怪了最后我一步步排查觉得是tempfile.mkstemp的问题查阅相关文档这个函数返回的是两个参数一个是fd一个是fnamefd是文件描述符fname是指生成的文件的绝对路径。那么文件描述符是啥呢内核kernel利用文件描述符file descriptor来访问文件。文件描述符是非负整数。打开现存文件或新建文件时内核会返回一个文件描述符。读写文件也需要使用文件描述符来指定待读写的文件。文件描述符在同一进程下与文件是对应的一个描述符只指向一个文件但是一个文件可以被多个文件描述符关联。同一进程下文件描述符是不可重复的。但是不同进程可以有一样的文件描述符。它们也可以指向不同的文件。因此如果直接使用os.remove(fname)删除文件的话文件描述符还在我们的内存里而文件描述符还在内存里说明文件还被占用着所以无法删除那我们应该怎么删除临时文件呢首先需要使用os.close(fd) 方法用于关闭指定的文件描述符 fd, 然后再使用os.remove(fname)删除临时文件。解决办法在os.remove(fname)之前加一句代码 os.close(_)就好了如下图所示 rnn_step_forward 题面让我们完成循环神经网络的前向一步解析看课吧我觉得课程里讲的很详细了或者看代码注释代码 def rnn_step_forward(x, prev_h, Wx, Wh, b):Run the forward pass for a single timestep of a vanilla RNN using a tanh activation function.The input data has dimension D, the hidden state has dimension H,and the minibatch is of size N.Inputs:- x: Input data for this timestep, of shape (N, D)- prev_h: Hidden state from previous timestep, of shape (N, H)- Wx: Weight matrix for input-to-hidden connections, of shape (D, H)- Wh: Weight matrix for hidden-to-hidden connections, of shape (H, H)- b: Biases of shape (H,)Returns a tuple of:- next_h: Next hidden state, of shape (N, H)- cache: Tuple of values needed for the backward pass.next_h, cache None, None############################################################################### TODO: Implement a single forward step for the vanilla RNN. Store the next ## hidden state and any values you need for the backward pass in the next_h ## and cache variables respectively. ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****next_h x Wx prev_h Wh bnext_h np.tanh(next_h)cache (x, prev_h, Wx, Wh, b, next_h)# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return next_h, cache输出 rnn_step_backward 题面让我们完成循环神经网络的后向一步解析不赘述了看代码吧 tanh函数求导公式可以看下面这个连接的文章激活函数tanh(x)求导代码 def rnn_step_backward(dnext_h, cache):Backward pass for a single timestep of a vanilla RNN.Inputs:- dnext_h: Gradient of loss with respect to next hidden state, of shape (N, H)- cache: Cache object from the forward passReturns a tuple of:- dx: Gradients of input data, of shape (N, D)- dprev_h: Gradients of previous hidden state, of shape (N, H)- dWx: Gradients of input-to-hidden weights, of shape (D, H)- dWh: Gradients of hidden-to-hidden weights, of shape (H, H)- db: Gradients of bias vector, of shape (H,)dx, dprev_h, dWx, dWh, db None, None, None, None, None############################################################################### TODO: Implement the backward pass for a single step of a vanilla RNN. ## ## HINT: For the tanh function, you can compute the local derivative in terms ## of the output value from tanh. ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****x, prev_h, Wx, Wh, b, next_h cache# 求导 tanh(x) (1 - tanh(x)^2) * dxdnext_h dnext_h * (1 - next_h ** 2)dx dnext_h Wx.Tdprev_h dnext_h Wh.TdWx x.T dnext_hdWh prev_h.T dnext_hdb np.sum(dnext_h, axis0)# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return dx, dprev_h, dWx, dWh, db输出 rnn_forward 题面解析看代码吧代码 def rnn_forward(x, h0, Wx, Wh, b):Run a vanilla RNN forward on an entire sequence of data.We assume an input sequence composed of T vectors, each of dimension D. The RNN uses a hiddensize of H, and we work over a minibatch containing N sequences. After running the RNN forward,we return the hidden states for all timesteps.Inputs:- x: Input data for the entire timeseries, of shape (N, T, D)- h0: Initial hidden state, of shape (N, H)- Wx: Weight matrix for input-to-hidden connections, of shape (D, H)- Wh: Weight matrix for hidden-to-hidden connections, of shape (H, H)- b: Biases of shape (H,)Returns a tuple of:- h: Hidden states for the entire timeseries, of shape (N, T, H)- cache: Values needed in the backward passh, cache None, None############################################################################### TODO: Implement forward pass for a vanilla RNN running on a sequence of ## input data. You should use the rnn_step_forward function that you defined ## above. You can use a for loop to help compute the forward pass. ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****# 获取维度N, T, _ x.shape_, H h0.shape# 初始化h np.zeros((N, T, H))cache []# 前向传播for i in range(T):if i 0:h[:, i, :], cache_i rnn_step_forward(x[:, i, :], h0, Wx, Wh, b)else:h[:, i, :], cache_i rnn_step_forward(x[:, i, :], h[:, i - 1, :], Wx, Wh, b)cache.append(cache_i)cache tuple(cache)# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return h, cache输出 rnn_backward 题面解析看代码吧认真听课的话肯定能理解的代码 def rnn_backward(dh, cache):Compute the backward pass for a vanilla RNN over an entire sequence of data.Inputs:- dh: Upstream gradients of all hidden states, of shape (N, T, H)NOTE: dh contains the upstream gradients produced by the individual loss functions at each timestep, *not* the gradientsbeing passed between timesteps (which youll have to compute yourselfby calling rnn_step_backward in a loop).Returns a tuple of:- dx: Gradient of inputs, of shape (N, T, D)- dh0: Gradient of initial hidden state, of shape (N, H)- dWx: Gradient of input-to-hidden weights, of shape (D, H)- dWh: Gradient of hidden-to-hidden weights, of shape (H, H)- db: Gradient of biases, of shape (H,)dx, dh0, dWx, dWh, db None, None, None, None, None############################################################################### TODO: Implement the backward pass for a vanilla RNN running an entire ## sequence of data. You should use the rnn_step_backward function that you ## defined above. You can use a for loop to help compute the backward pass. ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****# 获取维度N, T, H dh.shapeD, _ cache[0][2].shape# 初始化dx np.zeros((N, T, D))dh0 np.zeros((N, H))dWx np.zeros((D, H))dWh np.zeros((H, H))db np.zeros((H,))# 反向传播for i in range(T - 1, -1, -1):dx[:, i, :], dh0, dWx_i, dWh_i, db_i rnn_step_backward(dh[:, i, :] dh0, cache[i])dWx dWx_idWh dWh_idb db_i# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return dx, dh0, dWx, dWh, db输出 word_embedding_forward word embedding 技术解释 word embedding 技术解释题面解析看代码吧代码 def word_embedding_forward(x, W):Forward pass for word embeddings.We operate on minibatches of size N whereeach sequence has length T. We assume a vocabulary of V words, assigning eachword to a vector of dimension D.Inputs:- x: Integer array of shape (N, T) giving indices of words. Each element idxof x muxt be in the range 0 idx V.- W: Weight matrix of shape (V, D) giving word vectors for all words.Returns a tuple of:- out: Array of shape (N, T, D) giving word vectors for all input words.- cache: Values needed for the backward passout, cache None, None############################################################################### TODO: Implement the forward pass for word embeddings. ## ## HINT: This can be done in one line using NumPys array indexing. ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****out W[x]cache (x, W)# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return out, cache输出 word_embedding_backward 题面解析代码 def word_embedding_backward(dout, cache):Backward pass for word embeddings.We cannot back-propagate into the wordssince they are integers, so we only return gradient for the word embeddingmatrix.HINT: Look up the function np.add.atInputs:- dout: Upstream gradients of shape (N, T, D)- cache: Values from the forward passReturns:- dW: Gradient of word embedding matrix, of shape (V, D)dW None############################################################################### TODO: Implement the backward pass for word embeddings. ## ## Note that words can appear more than once in a sequence. ## HINT: Look up the function np.add.at ################################################################################ *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****dW np.zeros_like(cache[1])np.add.at(dW, cache[0], dout)# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################### END OF YOUR CODE ###############################################################################return dW输出 CaptioningRNN .loss 题面解析就按照题面给的意思一步一步来就好了代码 def loss(self, features, captions):Compute training-time loss for the RNN. We input image features andground-truth captions for those images, and use an RNN (or LSTM) to computeloss and gradients on all parameters.Inputs:- features: Input image features, of shape (N, D)- captions: Ground-truth captions; an integer array of shape (N, T 1) whereeach element is in the range 0 y[i, t] VReturns a tuple of:- loss: Scalar loss- grads: Dictionary of gradients parallel to self.params# Cut captions into two pieces: captions_in has everything but the last word# and will be input to the RNN; captions_out has everything but the first# word and this is what we will expect the RNN to generate. These are offset# by one relative to each other because the RNN should produce word (t1)# after receiving word t. The first element of captions_in will be the START# token, and the first element of captions_out will be the first word.captions_in captions[:, :-1]captions_out captions[:, 1:]# Youll need thismask captions_out ! self._null# Weight and bias for the affine transform from image features to initial# hidden stateW_proj, b_proj self.params[W_proj], self.params[b_proj]# Word embedding matrixW_embed self.params[W_embed]# Input-to-hidden, hidden-to-hidden, and biases for the RNNWx, Wh, b self.params[Wx], self.params[Wh], self.params[b]# Weight and bias for the hidden-to-vocab transformation.W_vocab, b_vocab self.params[W_vocab], self.params[b_vocab]loss, grads 0.0, {}############################################################################# TODO: Implement the forward and backward passes for the CaptioningRNN. ## In the forward pass you will need to do the following: ## (1) Use an affine transformation to compute the initial hidden state ## from the image features. This should produce an array of shape (N, H)## (2) Use a word embedding layer to transform the words in captions_in ## from indices to vectors, giving an array of shape (N, T, W). ## (3) Use either a vanilla RNN or LSTM (depending on self.cell_type) to ## process the sequence of input word vectors and produce hidden state ## vectors for all timesteps, producing an array of shape (N, T, H). ## (4) Use a (temporal) affine transformation to compute scores over the ## vocabulary at every timestep using the hidden states, giving an ## array of shape (N, T, V). ## (5) Use (temporal) softmax to compute loss using captions_out, ignoring ## the points where the output word is NULL using the mask above. ## ## ## Do not worry about regularizing the weights or their gradients! ## ## In the backward pass you will need to compute the gradient of the loss ## with respect to all model parameters. Use the loss and grads variables ## defined above to store loss and gradients; grads[k] should give the ## gradients for self.params[k]. ## ## Note also that you are allowed to make use of functions from layers.py ## in your implementation, if needed. ############################################################################## *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****# 第一步使用全连接层将图像特征转换为隐藏层的初始状态h0, cache_h0 affine_forward(features, W_proj, b_proj)# 第二步使用词嵌入层将输入的单词转换为词向量word_vector, cache_word_vector word_embedding_forward(captions_in, W_embed)# 第三步使用RNN或者LSTM将词向量序列转换为隐藏层状态序列if self.cell_type rnn:h, cache_h rnn_forward(word_vector, h0, Wx, Wh, b)elif self.cell_type lstm:h, cache_h lstm_forward(word_vector, h0, Wx, Wh, b)# 第四步使用全连接层将隐藏层状态序列转换为词汇表上的得分序列scores, cache_scores temporal_affine_forward(h, W_vocab, b_vocab)# 第五步使用softmax计算损失loss, dscores temporal_softmax_loss(scores, captions_out, mask)# 反向传播# 第四步全连接层的反向传播dh, dW_vocab, db_vocab temporal_affine_backward(dscores, cache_scores)# 第三步RNN或者LSTM的反向传播if self.cell_type rnn:dword_vector, dh0, dWx, dWh, db rnn_backward(dh, cache_h)elif self.cell_type lstm:dword_vector, dh0, dWx, dWh, db lstm_backward(dh, cache_h)# 第二步词嵌入层的反向传播dW_embed word_embedding_backward(dword_vector, cache_word_vector)# 第一步全连接层的反向传播dfeatures, dW_proj, db_proj affine_backward(dh0, cache_h0)# 将梯度保存到grads中grads[W_proj] dW_projgrads[b_proj] db_projgrads[W_embed] dW_embedgrads[Wx] dWxgrads[Wh] dWhgrads[b] dbgrads[W_vocab] dW_vocabgrads[b_vocab] db_vocab# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################# END OF YOUR CODE #############################################################################return loss, grads输出 CaptioningRNN.sample 题面解析看代码注释吧代码 def sample(self, features, max_length30):Run a test-time forward pass for the model, sampling captions for inputfeature vectors.At each timestep, we embed the current word, pass it and the previous hiddenstate to the RNN to get the next hidden state, use the hidden state to getscores for all vocab words, and choose the word with the highest score asthe next word. The initial hidden state is computed by applying an affinetransform to the input image features, and the initial word is the STARTtoken.For LSTMs you will also have to keep track of the cell state; in that casethe initial cell state should be zero.Inputs:- features: Array of input image features of shape (N, D).- max_length: Maximum length T of generated captions.Returns:- captions: Array of shape (N, max_length) giving sampled captions,where each element is an integer in the range [0, V). The first elementof captions should be the first sampled word, not the START token.N features.shape[0]captions self._null * np.ones((N, max_length), dtypenp.int32)# Unpack parametersW_proj, b_proj self.params[W_proj], self.params[b_proj]W_embed self.params[W_embed]Wx, Wh, b self.params[Wx], self.params[Wh], self.params[b]W_vocab, b_vocab self.params[W_vocab], self.params[b_vocab]############################################################################ TODO: Implement test-time sampling for the model. You will need to ## initialize the hidden state of the RNN by applying the learned affine ## transform to the input image features. The first word that you feed to ## the RNN should be the START token; its value is stored in the ## variable self._start. At each timestep you will need to do to: ## (1) Embed the previous word using the learned word embeddings ## (2) Make an RNN step using the previous hidden state and the embedded ## current word to get the next hidden state. ## (3) Apply the learned affine transformation to the next hidden state to ## get scores for all words in the vocabulary ## (4) Select the word with the highest score as the next word, writing it ## (the word index) to the appropriate slot in the captions variable ## ## For simplicity, you do not need to stop generating after an END token ## is sampled, but you can if you want to. ## ## HINT: You will not be able to use the rnn_forward or lstm_forward ## functions; youll need to call rnn_step_forward or lstm_step_forward in ## a loop. ## ## NOTE: we are still working over minibatches in this function. Also if ## you are using an LSTM, initialize the first cell state to zeros. ############################################################################# *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****# 第一步初始化隐藏层状态h, _ affine_forward(features, W_proj, b_proj)# 第二步初始化第一个单词word np.repeat(self._start, N)c np.zeros_like(h)# 第三步生成后面的单词for i in range(max_length):# 第一步生成第i个单词的词向量word, _ word_embedding_forward(word, W_embed)# 第二步生成第i个单词的隐藏层状态if self.cell_type rnn:h, _ rnn_step_forward(word, h, Wx, Wh, b)elif self.cell_type lstm:h, c, _ lstm_step_forward(word, h, c, Wx, Wh, b)# 第三步生成第i个单词的得分scores, _ affine_forward(h, W_vocab, b_vocab)# 第四步生成第i个单词的预测值并记录到captions中同时作为下一个单词的输入word np.argmax(scores, axis1)captions[:, i] word# *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****############################################################################# END OF YOUR CODE #############################################################################return captions输出结语对于循环神经网络的理解不仅需要课程的讲解也需要实验的理解然后在结合课程会有一个更深的理解。

查看全文

http://www.zqtcl.cn/news/156139/