#attention matrix | Explore Tumblr Posts and Blogs

pythonprogrammingsnippets · 2 years

Text

python fine tune a distilgpt llm model using attention matrix pruning, low-rank approximation and low-rank adaptation (lora)

# fine tune a model with attention matrices pruned and low-rank approximation/adaptation # https://pythonprogrammingsnippets.tumblr.com import torch from transformers import AutoTokenizer, AutoModelForCausalLM import os # load the pretrained model if it exists in _MODELS/lora_attention # otherwise load the pretrained model from huggingface if os.path.exists("_MODELS/lora_attention"): print("loading trained model") # Load the tokenizer tokenizer = AutoTokenizer.from_pretrained("_MODELS/lora_attention") # Load the pre-trained DistilGPT2 model model = AutoModelForCausalLM.from_pretrained("_MODELS/lora_attention") else: print("Downloading pretrained model from huggingface") # Load the tokenizer tokenizer = AutoTokenizer.from_pretrained("distilgpt2") # Load the pre-trained DistilGPT2 model model = AutoModelForCausalLM.from_pretrained("distilgpt2") # set padding token tokenizer.pad_token = tokenizer.eos_token # Define the training data from _DATASETS/data.txt with one sentence per line # now train with the train_data from the file _DATASETS/data.txt with one sentence per line. with open("_DATASETS/data.txt") as f: data = f.read() # now split data by \n train_data = data.split( '\n' ) # shuffle the data import random random.shuffle(train_data) # define the function for pruning the attention matrices def prune_attention_matrices(model, threshold): for name, param in model.named_parameters(): if "attention" in name and "weight" in name: data = param.data data[torch.abs(data) < threshold] = 0 param.data = data # define the function for low-rank approximation of the attention matrices def low_rank_approximation(model, rank): for name, param in model.named_parameters(): if "attention" in name and "weight" in name: data = param.data u, s, v = torch.svd(data) data = torch.mm(u[:, :rank], torch.mm(torch.diag(s[:rank]), v[:, :rank].t())) param.data = data # define the function for low-rank adaptation def low_rank_adaptation(model, train_data, tokenizer, rank, num_epochs, lr): # Define the optimizer and loss function optimizer = torch.optim.Adam(model.parameters(), lr=lr) loss_fn = torch.nn.CrossEntropyLoss() # Tokenize the training data input_ids = tokenizer(train_data, padding=True, truncation=True, return_tensors="pt")["input_ids"] # Perform low-rank adaptation fine-tuning for epoch in range(num_epochs): # Zero the gradients optimizer.zero_grad() # Get the model outputs outputs = model(input_ids=input_ids, labels=input_ids) # Get the loss loss = outputs.loss # Backpropagate the loss loss.backward() # Update the parameters optimizer.step() # Print the loss print("Epoch: {}, Loss: {}".format(epoch, loss.item())) # Low-rank approximation low_rank_approximation(model, rank) # prune the attention matrices prune_attention_matrices(model, 0.1) # low-rank approximation low_rank_approximation(model, 32) # low-rank adaptation low_rank_adaptation(model, train_data, tokenizer, 32, 5, 5e-5) # now train # Define the optimizer and loss function optimizer = torch.optim.Adam(model.parameters(), lr=5e-5) loss_fn = torch.nn.CrossEntropyLoss() # Tokenize the training data input_ids = tokenizer(train_data, padding=True, truncation=True, return_tensors="pt")["input_ids"] # Perform fine-tuning for epoch in range(5): # Zero the gradients optimizer.zero_grad() # Get the model outputs outputs = model(input_ids=input_ids, labels=input_ids) # Get the loss loss = outputs.loss # Backpropagate the loss loss.backward() # Update the parameters optimizer.step() # Print the loss print("Epoch: {}, Loss: {}".format(epoch, loss.item())) # save the model model.save_pretrained("_MODELS/lora_attention") # save the tokenizer tokenizer.save_pretrained("_MODELS/lora_attention") ## # load the model model = AutoModelForCausalLM.from_pretrained("_MODELS/lora_attention") # load the tokenizer tokenizer = AutoTokenizer.from_pretrained("_MODELS/lora_attention") # define the function for generating text def generate_text(model, tokenizer, prompt, max_length): # Tokenize the prompt input_ids = tokenizer(prompt, return_tensors="pt")["input_ids"] # Generate the text output_ids = model.generate(input_ids, max_length=max_length, do_sample=True, top_k=50, top_p=0.95, temperature=0.5, num_return_sequences=1) # Decode the text output_text = tokenizer.decode(output_ids[0], skip_special_tokens=True) # Print the text print(output_text) # generate text generate_text(model, tokenizer, "quick brown", 125)

0 notes

reality-detective · 5 months

Text

Billy Carson explains "How To Escape The Matrix." 🤔

#pay attention #educate yourselves #educate yourself #knowledge is power #reeducate yourself #reeducate yourselves #think about it #think for yourselves #think for yourself #do your homework #do some research #do your own research #ask yourself questions #question everything #billy carson #escape the matrix #stop participating

924 notes · View notes

bearlyted · 1 year

Text

its haaalloween (it is almost may)

#be more chill #bmc #musicals #the squip #jeremy heere #michael mell #christine canigula #thanks for the attention on the other post!! just wanted to make their cool halloween costumes hehe #also YES the squip is dressed up as neo from the matrix

398 notes · View notes

shijiujun · 1 year

Photo

“Tell me, who are you? Let me tell you who you are, or... who you should be.”

- TILL THE END OF THE MOON 长月烬明 (2023) | EP. 36 -

#till the end of the moon #tteotm #cdramaedit #cyjm #chang yue jin ming #my god lyx is so damn good at this #you can tell the diff btw v1 and v3 #ttj and ttj #gotta love the glitch in the matrix #lyx was like can u guys tell the diff #cdrama #me: yeah sure cang jiumin v3 looks like he's trying to look like he's paying attention #to old fart v1 lecture him about how good it was back in the old days

197 notes · View notes

xerith-42 · 2 months

Text

Watching VGHS at a young age ruined my standards for romance actually. Like everything Brian does in Season 1 is so awkward and adorable and endearing as hell to both the audience and Jenny.

We get so much of Brian learning how to take his feelings and Jenny seriously that by the time we get to that line, that scene in the final episode of season 1, the one where Brian finally overcomes everything he's been fighting this season, it's also the most romantic thing he's done the entire season.

"Almost nothing."

Followed by killing an entire team, capping your love interests shitty ex, and then walking away from a narratively satisfying explosion??

Call me Jenny Matrix because I too would have been all over Brian trying to style his do

#xer's rambles #vghs #video game high school #brian d #jenny matrix #it came to my attention recently that this fandom is nearing death. i will not allow this to happen.

9 notes · View notes

therubyjailcell · 6 months

Text

i'm a little bit in love with liu qingge i think

#v talks too much #liu qingge #mxtx #svsss #listen! listen. he's great and i love him #little bit obsessed #this is fine #also liu mingyan deserves so much attention #matrix moving fucker i adore her

16 notes · View notes

synthient · 1 year

Text

the essential Smith character arc is from "cat who is hissing & yowling & flailing & knocking everything off your countertop & clawing your arms to shreds. because he fell into the bathtub as a consequence of his own actions and is So So Sopping Wet" -> "smug anime catboy"

#the matrix #the core of resurrections smiths power is that he Does know the phrase 'you know like nya'#and he Will deploy it after splaying himself across neos lap because neo wasnt paying enough attention to him #agent smith

23 notes · View notes

butchlifeguard · 4 months

Text

millennials 5 years after telling themselves 'we aren't going to bitch about how people these days are lazy and how people were smarter back in my day like boomers did'

[ID: cropped and blue tinted tumblr text that says "the problem isn't just that media literacy is slowly becoming a dying art. it's that people straight up do not pay attention when they watch tv/film anymore." end ID.]

#yeah man nobody pays attention to movies and tv these days #back in my day nobody misinterpreted future classic films. like saw. and the matrix. and fight club. and fucking mean girls idk #because NOWADAYS people dont pay attention and media literacy is lost #id added #edit. im not taking anything out of context btw this was the full text of a post with 10k

4 notes · View notes

wrightandco · 2 months

Text

just had a weird experience where I was in a crowd of people waiting for an event to start and then the minute it hit the commencement time, everyone collectively hushed to silence at same moment, completely unprompted

#it was like a glitch in the matrix everyone just finished speaking at the same time #there was no can I have your attention please #attention was given #they had to just start they had no choice #everyone was like we’re readyyy

2 notes · View notes

imaginefear · 9 months

Text

me: they gonna be sooooo bad guy chic and trickster this time i'm so gonna do that for this thread!

me, 5 mins later when confronted with something living matrix related: so that was a fucking lie.

#pay no attention to the man behind the curtain / ooc.#this like the one place i let him be genuinely good with very little in the way of ulterior motives #his life used to revolve around a matrix and his eldritch crush is a reincarnated one #so things like the tardis are sweet baby angels to him #he'll hurl the doctor into a volcano for a laugh but the tardis gets treated like a favorite granddaughter

3 notes · View notes

reality-detective · 5 months

Text

NUREMBERG 2.0 - GENOCIDE WAR CRIMINALS - NOTHING CAN STOP WHAT IS COMING - Bill Gates, Anthony Fauci, Tedros Ghebreyesus, Alex Azar, Ralph Baric, Peter Daszak, Drosten, Albert Bourla, Stéphane Bancel, Klaus Schwab, Rockefellers, Rothschilds, the DOD are charged with Bioweapon Injection Genocide War Crimes.

If you do your research you'd realize all the above fμ¢%tards are not real. F. I. T. F. O. (Figure It The Fμ¢% Out) 🤔

This video was released in January 2023, who knows when it was actually done?

"Everything You See isn't Fake, it's controlled." - The Truman Show

Do NOT get stuck in the Matrix 🤔