Lipid Modification Database
Tag Content
LipidDB ID
LipidDB-9606-01000
Entry Name
UniProt Accession
Theoretical PI
8.55
Molecular Weight
160614.77
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
Arresten
Protein Synonyms/Alias
Gene Name
COL4A1
Gene Synonyms/Alias
Created Date
21-JUL-1986
 Lipid Modification Sites 
 Position   Sequence Form   Peptide   References   Modification Type 
434
Canonical
YTNGIVECQPGPPGD
[1]
S-Palmitoylation
1570
Canonical
QTIQIPPCPSGWSSL
[1]
S-Palmitoylation
1662
Canonical
LRTHVSRCQVCMRRT
[1]
S-Palmitoylation
Organism
Homo sapiens (Human)
NCBI Taxa ID
9606
Reference
[1] Predicted from GPS-Lipid
Functional Description
Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Sequence Annotation
Domain: 1445 1669 Collagen IV NC1.
Region: 173 1440 Triple-helical region.
Protein Length
1669 AA.
Protein Sequence
(Canonical)
MGPRLSVWLL LLPAALLLHE EHSRAAAKGG CAGSGCGKCD CHGVKGQKGE RGLPGLQGVI  60
GFPGMQGPEG PQGPPGQKGD TGEPGLPGTK GTRGPPGASG YPGNPGLPGI PGQDGPPGPP  120
GIPGCNGTKG ERGPLGPPGL PGFAGNPGPP GLPGMKGDPG EILGHVPGML LKGERGFPGI  180
PGTPGPPGLP GLQGPVGPPG FTGPPGPPGP PGPPGEKGQM GLSFQGPKGD KGDQGVSGPP  240
GVPGQAQVQE KGDFATKGEK GQKGEPGFQG MPGVGEKGEP GKPGPRGKPG KDGDKGEKGS  300
PGFPGEPGYP GLIGRQGPQG EKGEAGPPGP PGIVIGTGPL GEKGERGYPG TPGPRGEPGP  360
KGFPGLPGQP GPPGLPVPGQ AGAPGFPGER GEKGDRGFPG TSLPGPSGRD GLPGPPGSPG  420
PPGQPGYTNG IVECQPGPPG DQGPPGIPGQ PGFIGEIGEK GQKGESCLIC DIDGYRGPPG  480
PQGPPGEIGF PGQPGAKGDR GLPGRDGVAG VPGPQGTPGL IGQPGAKGEP GEFYFDLRLK  540
GDKGDPGFPG QPGMTGRAGS PGRDGHPGLP GPKGSPGSVG LKGERGPPGG VGFPGSRGDT  600
GPPGPPGYGP AGPIGDKGQA GFPGGPGSPG LPGPKGEPGK IVPLPGPPGA EGLPGSPGFP  660
GPQGDRGFPG TPGRPGLPGE KGAVGQPGIG FPGPPGPKGV DGLPGDMGPP GTPGRPGFNG  720
LPGNPGVQGQ KGEPGVGLPG LKGLPGLPGI PGTPGEKGSI GVPGVPGEHG AIGPPGLQGI  780
RGEPGPPGLP GSVGSPGVPG IGPPGARGPP GGQGPPGLSG PPGIKGEKGF PGFPGLDMPG  840
PKGDKGAQGL PGITGQSGLP GLPGQQGAPG IPGFPGSKGE MGVMGTPGQP GSPGPVGAPG  900
LPGEKGDHGF PGSSGPRGDP GLKGDKGDVG LPGKPGSMDK VDMGSMKGQK GDQGEKGQIG  960
PIGEKGSRGD PGTPGVPGKD GQAGQPGQPG PKGDPGISGT PGAPGLPGPK GSVGGMGLPG  1020
TPGEKGVPGI PGPQGSPGLP GDKGAKGEKG QAGPPGIGIP GLRGEKGDQG IAGFPGSPGE  1080
KGEKGSIGIP GMPGSPGLKG SPGSVGYPGS PGLPGEKGDK GLPGLDGIPG VKGEAGLPGT  1140
PGPTGPAGQK GEPGSDGIPG SAGEKGEPGL PGRGFPGFPG AKGDKGSKGE VGFPGLAGSP  1200
GIPGSKGEQG FMGPPGPQGQ PGLPGSPGHA TEGPKGDRGP QGQPGLPGLP GPMGPPGLPG  1260
IDGVKGDKGN PGWPGAPGVP GPKGDPGFQG MPGIGGSPGI TGSKGDMGPP GVPGFQGPKG  1320
LPGLQGIKGD QGDQGVPGAK GLPGPPGPPG PYDIIKGEPG LPGPEGPPGL KGLQGLPGPK  1380
GQQGVTGLVG IPGPPGIPGF DGAPGQKGEM GPAGPTGPRG FPGPPGPDGL PGSMGPPGTP  1440
SVDHGFLVTR HSQTIDDPQC PSGTKILYHG YSLLYVQGNE RAHGQDLGTA GSCLRKFSTM  1500
PFLFCNINNV CNFASRNDYS YWLSTPEPMP MSMAPITGEN IRPFISRCAV CEAPAMVMAV  1560
HSQTIQIPPC PSGWSSLWIG YSFVMHTSAG AEGSGQALAS PGSCLEEFRS APFIECHGRG  1620
TCNYYANAYS FWLATIERSE MFKKPTPSTL KAGELRTHVS RCQVCMRRT              1669
FASTA
(Canonical)
>LipidDB-9606-01000|P02462
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLK
GDKGDPGFPGQPGMTGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
Gene Ontology
GO:0005604; C:basement membrane; IC:BHF-UCL
GO:0005587; C:collagen type IV trimer; IMP:BHF-UCL
GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome
GO:0031012; C:extracellular matrix; IDA:UniProtKB
GO:0005576; C:extracellular region; NAS:UniProtKB
GO:0030023; F:extracellular matrix constituent conferring elasticity; IC:BHF-UCL
GO:0005201; F:extracellular matrix structural constituent; IMP:BHF-UCL
GO:0048407; F:platelet-derived growth factor binding; IDA:MGI
GO:0007411; P:axon guidance; TAS:Reactome
GO:0071711; P:basement membrane organization; IMP:BHF-UCL
GO:0048514; P:blood vessel morphogenesis; IMP:BHF-UCL
GO:0007420; P:brain development; IMP:BHF-UCL
GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl
GO:0030574; P:collagen catabolic process; TAS:Reactome
GO:0030855; P:epithelial cell differentiation; IEA:Ensembl
GO:0022617; P:extracellular matrix disassembly; TAS:Reactome
GO:0030198; P:extracellular matrix organization; TAS:Reactome
GO:0007528; P:neuromuscular junction development; IEA:Ensembl
GO:0001569; P:patterning of blood vessels; IMP:BHF-UCL
GO:0061333; P:renal tubule morphogenesis; IMP:BHF-UCL
GO:0061304; P:retinal blood vessel morphogenesis; IMP:BHF-UCL
Interpro
InterPro; IPR016187; C-type_lectin_fold
InterPro; IPR008160; Collagen
InterPro; IPR001442; Collagen_VI_NC
Pfam
Pfam; PF01413; C4;
Pfam; PF01391; Collagen;
SMART
SMART; SM00111; C4;
PROSITE
PROSITE; PS51403; NC1_IV;
PRINTS