why padding embedding is 128 x 512 Parameter

<img width="1726" height="1752" alt="Image" src="https://github.com/user-attachments/assets/31b9d766-961c-42af-b297-1e06d7736247" />

<img width="1752" height="1204" alt="Image" src="https://github.com/user-attachments/assets/8e59e582-16cf-4794-9e1e-b1d7fbf6bda7" />


padding embedding in my mind is usually 1 shared token, but GLIDE use nn.Parameter. My question is `what about 128K context-window`？Why not share padding token？

Thanks for your attention 


code is at https://github.com/openai/glide-text2im/blob/main/glide_text2im/text2im_model.py#L10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why padding embedding is 128 x 512 Parameter #53

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

why padding embedding is 128 x 512 Parameter #53

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions