Lipid Modification Database
Tag Content
LipidDB ID
LipidDB-11734-01058
Entry Name
UniProt Accession
Theoretical PI
8.72
Molecular Weight
101251.81
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
Transmembrane protein gp41
Protein Synonyms/Alias
Env polyprotein; SU; Glycoprotein 120; gp120; TM; Glycoprotein 32; gp32;
Gene Name
env
Gene Synonyms/Alias
Created Date
01-NOV-1988
 Lipid Modification Sites 
 Position   Sequence Form   Peptide   References   Modification Type 
789
Canonical
LTWLFSNCRTLLSRA
[1]
S-Palmitoylation
Organism
Simian immunodeficiency virus (isolate Mm251) (SIV-mac) (Simia immunodeficiency virus rhesus monkey)
NCBI Taxa ID
11734
Reference
[1] Vzorov AN, Weidmann A, Kozyr NL, Khaoustov V, Yoffe B, Compans RW. Role of thelong cytoplasmic domain of the SIV Env glycoprotein in early and late stages ofinfection. Retrovirology. 2007 Dec 14;4:94.[PMID:18081926]
Functional Description
The surface protein gp120 (SU) attaches the virus to the host lymphoid cell by binding to the primary receptor CD4. This interaction induces a structural rearrangement creating a high affinity binding site for a chemokine coreceptor like CCR5. This peculiar 2 stage receptor-interaction strategy allows gp120 to maintain the highly conserved coreceptor-binding site in a cryptic conformation, protected from neutralizing antibodies. These changes are transmitted to the transmembrane protein gp41 and are thought to activate its fusogenic potential by unmasking its fusion peptide (By similarity).
Sequence Annotation
Topological domain: 20 696 Extracellular.
Transmembrane: 697 717 Helical.
Topological domain: 718 881 Cytoplasmic.
Region: 113 169 V1.
Region: 170 213 V2.
Region: 313 345 V3.
Region: 404 434 V4.
Region: 477 484 V5.
Region: 528 548 Fusion peptide.
Region: 591 607 Immunosuppression.
Region: 673 694 MPER; binding to GalCer.
Motif: 723 726 YXXV motif; contains endocytosis signal.
Motif: 880 881 Di-leucine internalization motif.
Functional site: 527 528 Cleavage; by host furin.
Functional site: 736 736 In-frame UAG termination codon.
Protein Length
881 AA.
Protein Sequence
(Canonical)
MGCLGNQLLI AILLLSVYGI YCTQYVTVFY GVPAWRNATI PLFCATKNRD TWGTTQCLPD  60
NGDYSELALN VTESFDAWEN TVTEQAIEDV WQLFETSIKP CVKLSPLCIT MRCNKSETDR  120
WGLTKSSTTI TTAAPTSAPV SEKIDMVNET SSCIAQNNCT GLEQEQMISC KFTMTGLKRD  180
KTKEYNETWY STDLVCEQGN STDNESRCYM NHCNTSVIQE SCDKHYWDTI RFRYCAPPGY  240
ALLRCNDTNY SGFMPKCSKV VVSSCTRMME TQTSTWFGFN GTRAENRTYI YWHGRDNRTI  300
ISLNKYYNLT MKCRRPGNKT VLPVTIMSGL VFHSQPINDR PKQAWCWFGG KWKDAIKEVK  360
QTIVKHPRYT GTNNTDKINL TAPGGGDPEV TFMWTNCRGE FLYCKMNWFL NWVEDRDVTT  420
QRPKERHRRN YVPCHIRQII NTWHKVGKNV YLPPREGDLT CNSTVTSLIA NIDWTDGNQT  480
SITMSAEVAE LYRLELGDYK LVEITPIGLA PTDVKRYTTG GTSRNKRGVF VLGFLGFLAT  540
AGSAMGAASL TLTAQSRTLL AGIVQQQQQL LDVVKRQQEL LRLTVWGTKN LQTRVTAIEK  600
YLKDQAQLNA WGCAFRQVCH TTVPWPNASL TPDWNNDTWQ EWERKVDFLE ENITALLEEA  660
QIQQEKNMYE LQKLNSWDVF GNWFDLASWI KYIQYGIYVV VGVILLRIVI YIVQMLAKLR  720
QGYRPVFSSP PSYFQXTHTQ QDPALPTREG KEGDGGEGGG NSSWPWQIEY IHFLIRQLIR  780
LLTWLFSNCR TLLSRAYQIL QPILQRLSAT LRRVREVLRT ELTYLQYGWS YFHEAVQAGW  840
RSATETLAGA WRDLWETLRR GGRWILAIPR RIRQGLELTL L                      881
FASTA
(Canonical)
>LipidDB-11734-01058|P08810
MGCLGNQLLIAILLLSVYGIYCTQYVTVFYGVPAWRNATIPLFCATKNRDTWGTTQCLPD
NGDYSELALNVTESFDAWENTVTEQAIEDVWQLFETSIKPCVKLSPLCITMRCNKSETDR
WGLTKSSTTITTAAPTSAPVSEKIDMVNETSSCIAQNNCTGLEQEQMISCKFTMTGLKRD
KTKEYNETWYSTDLVCEQGNSTDNESRCYMNHCNTSVIQESCDKHYWDTIRFRYCAPPGY
ALLRCNDTNYSGFMPKCSKVVVSSCTRMMETQTSTWFGFNGTRAENRTYIYWHGRDNRTI
ISLNKYYNLTMKCRRPGNKTVLPVTIMSGLVFHSQPINDRPKQAWCWFGGKWKDAIKEVK
QTIVKHPRYTGTNNTDKINLTAPGGGDPEVTFMWTNCRGEFLYCKMNWFLNWVEDRDVTT
QRPKERHRRNYVPCHIRQIINTWHKVGKNVYLPPREGDLTCNSTVTSLIANIDWTDGNQT
SITMSAEVAELYRLELGDYKLVEITPIGLAPTDVKRYTTGGTSRNKRGVFVLGFLGFLAT
AGSAMGAASLTLTAQSRTLLAGIVQQQQQLLDVVKRQQELLRLTVWGTKNLQTRVTAIEK
YLKDQAQLNAWGCAFRQVCHTTVPWPNASLTPDWNNDTWQEWERKVDFLEENITALLEEA
QIQQEKNMYELQKLNSWDVFGNWFDLASWIKYIQYGIYVVVGVILLRIVIYIVQMLAKLR
QGYRPVFSSPPSYFQXTHTQQDPALPTREGKEGDGGEGGGNSSWPWQIEYIHFLIRQLIR
LLTWLFSNCRTLLSRAYQILQPILQRLSATLRRVREVLRTELTYLQYGWSYFHEAVQAGW
RSATETLAGAWRDLWETLRRGGRWILAIPRRIRQGLELTLL
Gene Ontology
GO:0044174; C:host cell endosome; IEA:UniProtKB-KW
GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-KW
GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW
GO:0019031; C:viral envelope; IEA:UniProtKB-KW
GO:0005198; F:structural molecule activity; IEA:InterPro
GO:0006915; P:apoptotic process; IEA:UniProtKB-KW
GO:0039663; P:membrane fusion involved in viral entry into host cell; IEA:UniProtKB-KW
GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW
Interpro
InterPro; IPR000777; HIV1_GP160
InterPro; IPR000328; Retroviral_envelope_protein
Pfam
Pfam; PF00516; GP120;
Pfam; PF00517; GP41;
SMART
PROSITE
PRINTS