Lipid Modification Database
Tag Content
LipidDB ID
LipidDB-11706-00053
Entry Name
UniProt Accession
Theoretical PI
9.17
Molecular Weight
97212.78
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
Transmembrane protein gp41
Protein Synonyms/Alias
Env polyprotein; SU; Glycoprotein 120; gp120; TM; Glycoprotein 41; gp41;
Gene Name
env
Gene Synonyms/Alias
Created Date
13-AUG-1987
 Lipid Modification Sites 
 Position   Sequence Form   Peptide   References   Modification Type 
764
Canonical
WDDLRSLCLFSYHRL
[1]
S-Palmitoylation
837
Canonical
IEVVQGACRAIRHIP
[1]
S-Palmitoylation
Organism
Human immunodeficiency virus type 1 group M subtype B (isolate HXB2 (HIV-1)
NCBI Taxa ID
11706
Reference
[1] Bhattacharya J, Peters PJ, Clapham PR. Human immunodeficiency virus type 1envelope glycoproteins that lack cytoplasmic domain cysteines: impact onassociation with membrane lipid rafts and incorporation onto budding virusparticles. J Virol. 2004 May;78(10):5500-6.[PMID:15113929]
Functional Description
The surface protein gp120 (SU) attaches the virus to the host lymphoid cell by binding to the primary receptor CD4. This interaction induces a structural rearrangement creating a high affinity binding site for a chemokine coreceptor like CXCR4 and/or CCR5. This peculiar 2 stage receptor-interaction strategy allows gp120 to maintain the highly conserved coreceptor-binding site in a cryptic conformation, protected from neutralizing antibodies. Since CD4 also displays a binding site for the disulfide-isomerase P4HB/PDI, a P4HB/PDI-CD4-CXCR4-gp120 complex may form. In that complex, P4HB/PDI could reach and reduce gp120 disulfide bonds, causing major conformational changes in gp120. TXN, another PDI family member could also be involved in disulfide rearrangements in Env during fusion. These changes are transmitted to the transmembrane protein gp41 and are thought to activate its fusogenic potential by unmasking its fusion peptide.
Sequence Annotation
Topological domain: 33 684 Extracellular.
Transmembrane: 685 705 Helical.
Topological domain: 706 856 Cytoplasmic.
Region: 131 156 V1.
Region: 157 196 V2.
Region: 296 330 V3.
Region: 364 374 CD4-binding loop.
Region: 385 418 V4.
Region: 461 471 V5.
Region: 512 532 Fusion peptide.
Region: 576 592 Immunosuppression.
Region: 662 683 MPER; binding to GalCer.
Motif: 712 715 YXXL motif; contains endocytosis signal.
Motif: 855 856 Di-leucine internalization motif.
Functional site: 511 512 Cleavage; by host furin.
Protein Length
856 AA.
Protein Sequence
(Canonical)
MRVKEKYQHL WRWGWRWGTM LLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA  60
YDTEVHNVWA THACVPTDPN PQEVVLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV  120
KLTPLCVSLK CTDLKNDTNT NSSSGRMIME KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD  180
IIPIDNDTTS YKLTSCNTSV ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT  240
NVSTVQCTHG IRPVVSTQLL LNGSLAEEEV VIRSVNFTDN AKTIIVQLNT SVEINCTRPN  300
NNTRKRIRIQ RGPGRAFVTI GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQFGNNKTII  360
FKQSSGGDPE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTW STEGSNNTEG SDTITLPCRI  420
KQIINMWQKV GKAMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR  480
SELYKYKVVK IEPLGVAPTK AKRRVVQREK RAVGIGALFL GFLGAAGSTM GAASMTLTVQ  540
ARQLLSGIVQ QQNNLLRAIE AQQHLLQLTV WGIKQLQARI LAVERYLKDQ QLLGIWGCSG  600
KLICTTAVPW NASWSNKSLE QIWNHTTWME WDREINNYTS LIHSLIEESQ NQQEKNEQEL  660
LELDKWASLW NWFNITNWLW YIKLFIMIVG GLVGLRIVFA VLSIVNRVRQ GYSPLSFQTH  720
LPTPRGPDRP EGIEEEGGER DRDRSIRLVN GSLALIWDDL RSLCLFSYHR LRDLLLIVTR  780
IVELLGRRGW EALKYWWNLL QYWSQELKNS AVSLLNATAI AVAEGTDRVI EVVQGACRAI  840
RHIPRRIRQG LERILL                                                  856
FASTA
(Canonical)
>LipidDB-11706-00053|P04578
MRVKEKYQHLWRWGWRWGTMLLGMLMICSATEKLWVTVYYGVPVWKEATTTLFCASDAKA
YDTEVHNVWATHACVPTDPNPQEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCV
KLTPLCVSLKCTDLKNDTNTNSSSGRMIMEKGEIKNCSFNISTSIRGKVQKEYAFFYKLD
IIPIDNDTTSYKLTSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCT
NVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTRPN
NNTRKRIRIQRGPGRAFVTIGKIGNMRQAHCNISRAKWNNTLKQIASKLREQFGNNKTII
FKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRI
KQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWR
SELYKYKVVKIEPLGVAPTKAKRRVVQREKRAVGIGALFLGFLGAAGSTMGAASMTLTVQ
ARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGCSG
KLICTTAVPWNASWSNKSLEQIWNHTTWMEWDREINNYTSLIHSLIEESQNQQEKNEQEL
LELDKWASLWNWFNITNWLWYIKLFIMIVGGLVGLRIVFAVLSIVNRVRQGYSPLSFQTH
LPTPRGPDRPEGIEEEGGERDRDRSIRLVNGSLALIWDDLRSLCLFSYHRLRDLLLIVTR
IVELLGRRGWEALKYWWNLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEVVQGACRAI
RHIPRRIRQGLERILL
Gene Ontology
GO:0044174; C:host cell endosome; IEA:UniProtKB-KW
GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-KW
GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW
GO:0019031; C:viral envelope; IEA:UniProtKB-KW
GO:0005198; F:structural molecule activity; IEA:InterPro
GO:0006915; P:apoptotic process; IEA:UniProtKB-KW
GO:0075512; P:clathrin-mediated endocytosis of virus by host cell; IEA:UniProtKB-KW
GO:0030260; P:entry into host cell; TAS:Reactome
GO:0030683; P:evasion or tolerance by virus of host immune response; IEA:UniProtKB-KW
GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW
GO:0045087; P:innate immune response; TAS:Reactome
GO:0019058; P:viral life cycle; TAS:Reactome
GO:0016032; P:viral process; TAS:Reactome
GO:0019082; P:viral protein processing; TAS:Reactome
GO:0019068; P:virion assembly; TAS:Reactome
GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW
Interpro
InterPro; IPR000777; HIV1_GP160
InterPro; IPR000328; Retroviral_envelope_protein
Pfam
Pfam; PF00516; GP120;
Pfam; PF00517; GP41;
SMART
PROSITE
PRINTS