C#集成AzureAD

最新推荐文章于 2024-08-31 09:00:52 发布

axb你猜我猜不猜

最新推荐文章于 2024-08-31 09:00:52 发布

阅读量540

点赞数 7

文章标签： c# flask 开发语言

本文链接：https://blog.csdn.net/weixin_44763000/article/details/140779219

版权

Microsoft 将 Azure Active Directory (Azure AD) 更名为 Microsoft Entra ID，以介绍产品的多云多平台功能、缓解与 Windows Server Active Directory 的混淆，并统一 Microsoft Entra 产品系列。

使用或服务无中断

如果你目前正在使用 Azure AD 或以前在组织中部署了 Azure AD，则可以继续使用该服务，而不会出现中断。所有现有的部署、配置和集成继续如常运行，无需执行任何操作。

可以继续使用熟悉的 Azure AD 功能，这些功能可以通过 Azure 门户、Microsoft 365 管理中心和 Microsoft Entra 管理中心进行访问。

所有特性和功能在该产品中仍然可用。许可、条款、服务级别协议、产品认证、支持和定价保持不变。

为了实现无缝转换，所有现有登录 URL、API、PowerShell cmdlet 和 Microsoft 身份验证库 (MSAL) 以及开发人员体验和工具都保持不变。

服务计划显示名称将于 2023 年 10 月 1 日更改。 Microsoft Entra ID Free、Microsoft Entra ID P1 和 Microsoft Entra ID P2 将是独立产品/服务的新名称，当前 Azure AD 计划中包含的所有功能保持不变。 Microsoft Entra ID（以前称为 Azure AD）继续包含在 Microsoft 365 许可计划中，其中包括 Microsoft 365 E3 和 Microsoft 365 E5。有关定价和包含内容的详细信息可在定价和免费试用版页面上找到。

# pip install modelscope transformers peft datasets

import torch
from modelscope import BitsAndBytesConfig, snapshot_download, AutoTokenizer, AutoModelForCausalLM
from transformers import AutoTokenizer, AutoModelForCausalLM, Trainer, TrainingArguments, DataCollatorForSeq2Seq
from peft import get_peft_model, LoraConfig, TaskType
from datasets import load_dataset
import json
import torch


# load model
if torch.cuda.is_available():
    device = 'cuda'
print(device)

# download model
model_dir_1 = snapshot_download("qwen/Qwen2-0.5B-Instruct")
model_dir_2 = snapshot_download("qwen/Qwen2-0.5B")


# 量化加载模型
_bnb_config = BitsAndBytesConfig(load_in_4bit=True, # 加载为4位
                                bnb_4bit_use_double_quant=True,# 双量化 权重和激活都量化
                                bnb_4bit_quant_type="nf4", # 非对称4位
                                bnb_4bit_compute_dtype=torch.float32)# 放到32位精度上训练


# load model & tokenizer
model_path = "YOUR_MODEL_PATH"
_model = AutoModelForCausalLM.from_pretrained(model_path,
                                             quantization_config=_bnb_config,
                                             device_map="auto",
                                             torch_dtype="auto")

_tokenizer = AutoTokenizer.from_pretrained(model_path)



messages = [

    {"role":"system", "content":"you are a helpful assistant"},
    {"role":"user", "content":"你是谁？"}
]

# 查看template
_tokenizer.apply_chat_template(messages, tokenize=False)


# 处理 json 文件（假设已经有）
# [{"instruction":"问题", "output":"答案"}]
_datasets = load_dataset("json", data_files="json_file_name.json", split="train")


# 数据预处理
def preprocess_dataset(example):
    MAX_LENGTH = 256
    _input_ids, _attention_mask, _labels = [], [], []
    _instruction = _tokenizer(f'<|im_start|>system\nyou are a helpful assistant<|im_end|>\n<|im_start|>user\n{example["instruction"]}<|im_end|>\n',
                                         add_special_tokens=False)
    # 注意这里需要加上add_special_tokens=False以及在output那里加上_tokenizer.eos_token
    # 注意这里不需要return为tensors，不然会报错
    _response = _tokenizer(f'<|im_start|>assistant\n{example["output"]  + _tokenizer.eos_token}<|im_end|>', add_special_tokens=False)
    _input_ids = _instruction["input_ids"] + _response["input_ids"]
    _attention_mask = _instruction["attention_mask"] + _response["attention_mask"]
    _labels = [100] * len(_instruction["input_ids"]) + _response["input_ids"]

    if len(_input_ids) > MAX_LENGTH:
        _input_ids = _input_ids[:MAX_LENGTH]
        _attention_mask = _attention_mask[:MAX_LENGTH]
        _labels = _labels[:MAX_LENGTH]

    return {
        "input_ids":_input_ids,
        "attention_mask":_attention_mask,
        "labels":_labels
    }




# 操作dataset
_datasets = _datasets.map(preprocess_dataset, remove_columns=_datasets.column_names)
_datasets = _datasets.shuffle()


# fine tune


# trainining config
# trainining config
lora_config = LoraConfig(
    r=8,
    lora_alpha=16,
    target_modules="all-linear",
    task_type=TaskType.CAUSAL_LM, # 因果语言模型
)
# 加载预训练模型和Lora config
_model = get_peft_model(_model, lora_config)

# trainig

# training parameters
_training_args = TrainingArguments(
    output_dir='checkpoints/lora',
    gradient_accumulation_steps=2,
    per_device_train_batch_size=16, # batch_size
    save_steps=300,
    logging_steps=100,
    num_train_epochs=300
)

_trainer = Trainer(
    model=_model,
    args=_training_args,
    train_dataset=_datasets,
    data_collator=DataCollatorForSeq2Seq(tokenizer=_tokenizer, padding=True)
)

_trainer.train()


# 加载预训练过后的模型

# 加载微调过后的模型(还没有merge), 通过 PeftModel
from peft import PeftModel
from modelscope import AutoModelForCausalLM, AutoTokenizer

_model = AutoModelForCausalLM.from_pretrained("/root/.cache/modelscope/hub/qwen/Qwen2-0___5B", 
                                             torch_dtype="auto",
                                             device_map="auto")

_tokenizer = AutoTokenizer.from_pretrained("/root/.cache/modelscope/hub/qwen/Qwen2-0___5B")

peft_model = PeftModel.from_pretrained(model=_model, model_id="your_path/checkpoint-3750")


# 问问题
# use model to generate QA pair
def ask(question, model, tokenizer):
    messages = [
        {"role":"system", "content":"you are a helpful assistant"},
        {"role":"user", "content": question}
    ]
    text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
    model_inputs = tokenizer([text], return_tensors="pt").to("cuda")
    generated_ids = model.generate(**model_inputs, max_new_tokens=128)
    generated_ids = [output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)]
    answer = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
    return answer


ask("你是谁？", model, tokenizer)


# merge model

ckpt_list = ["checkpoint-3750"] # checkpoint name

for checkpoint in ckpt_list:
    print('Merge checkpoint: {}'.format(checkpoint))
    model = PeftModel.from_pretrained(_model, os.path.join("your_path", checkpoint))
    model = model.merge_and_unload()

print('merge config =', model.config)


# save model
model.save_pretrained("fine-tuned-model_path")
# 注意tokenizer不变
tokenizer.save_pretrained("fine-tuned-model_path")


# load new model

model = AutoModelForCausalLM.from_pretrained("fine-tuned-model_path")
tokenizer = AutoTokenizer.from_pretrained("fine-tuned-model_path")

axb你猜我猜不猜

关注

7
点赞
踩
13

收藏

觉得还不错? 一键收藏
0
评论
C#集成AzureAD

Microsoft 将 Azure Active Directory (Azure AD) 更名为 Microsoft Entra ID，以介绍产品的多云多平台功能、缓解与 Windows Server Active Directory 的混淆，并统一产品系列。
复制链接

扫一扫