Lipid Modification Database
Tag Content
LipidDB ID
LipidDB-9606-01337
Entry Name
UniProt Accession
Theoretical PI
8.89
Molecular Weight
167553.1
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
Canstatin
Protein Synonyms/Alias
Gene Name
COL4A2
Gene Synonyms/Alias
Created Date
01-AUG-1988
 Lipid Modification Sites 
 Position   Sequence Form   Peptide   References   Modification Type 
1705
Canonical
IRTHISRCQVCMKNL
[1]
S-Palmitoylation
1708
Canonical
HISRCQVCMKNL***
[1]
S-Palmitoylation
Organism
Homo sapiens (Human)
NCBI Taxa ID
9606
Reference
[1] Predicted from GPS-Lipid
Functional Description
Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Sequence Annotation
Domain: 1489 1712 Collagen IV NC1.
Region: 184 1484 Triple-helical region.
Protein Length
1712 AA.
Protein Sequence
(Canonical)
MGRDQRAVAG PALRRWLLLG TVTVGFLAQS VLAGVKKFDV PCGGRDCSGG CQCYPEKGGR  60
GQPGPVGPQG YNGPPGLQGF PGLQGRKGDK GERGAPGVTG PKGDVGARGV SGFPGADGIP  120
GHPGQGGPRG RPGYDGCNGT QGDSGPQGPP GSEGFTGPPG PQGPKGQKGE PYALPKEERD  180
RYRGEPGEPG LVGFQGPPGR PGHVGQMGPV GAPGRPGPPG PPGPKGQQGN RGLGFYGVKG  240
EKGDVGQPGP NGIPSDTLHP IIAPTGVTFH PDQYKGEKGS EGEPGIRGIS LKGEEGIMGF  300
PGLRGYPGLS GEKGSPGQKG SRGLDGYQGP DGPRGPKGEA GDPGPPGLPA YSPHPSLAKG  360
ARGDPGFPGA QGEPGSQGEP GDPGLPGPPG LSIGDGDQRR GLPGEMGPKG FIGDPGIPAL  420
YGGPPGPDGK RGPPGPPGLP GPPGPDGFLF GLKGAKGRAG FPGLPGSPGA RGPKGWKGDA  480
GECRCTEGDE AIKGLPGLPG PKGFAGINGE PGRKGDRGDP GQHGLPGFPG LKGVPGNIGA  540
PGPKGAKGDS RTITTKGERG QPGVPGVPGM KGDDGSPGRD GLDGFPGLPG PPGDGIKGPP  600
GDPGYPGIPG TKGTPGEMGP PGLGLPGLKG QRGFPGDAGL PGPPGFLGPP GPAGTPGQID  660
CDTDVKRAVG GDRQEAIQPG CIGGPKGLPG LPGPPGPTGA KGLRGIPGFA GADGGPGPRG  720
LPGDAGREGF PGPPGFIGPR GSKGAVGLPG PDGSPGPIGL PGPDGPPGER GLPGEVLGAQ  780
PGPRGDAGVP GQPGLKGLPG DRGPPGFRGS QGMPGMPGLK GQPGLPGPSG QPGLYGPPGL  840
HGFPGAPGQE GPLGLPGIPG REGLPGDRGD PGDTGAPGPV GMKGLSGDRG DAGFTGEQGH  900
PGSPGFKGID GMPGTPGLKG DRGSPGMDGF QGMPGLKGRP GFPGSKGEAG FFGIPGLKGL  960
AGEPGFKGSR GDPGPPGPPP VILPGMKDIK GEKGDEGPMG LKGYLGAKGI QGMPGIPGLS  1020
GIPGLPGRPG HIKGVKGDIG VPGIPGLPGF PGVAGPPGIT GFPGFIGSRG DKGAPGRAGL  1080
YGEIGATGDF GDIGDTINLP GRPGLKGERG TTGIPGLKGF FGEKGTEGDI GFPGITGVTG  1140
VQGPPGLKGQ TGFPGLTGPP GSQGELGRIG LPGGKGDDGW PGAPGLPGFP GLRGIRGLHG  1200
LPGTKGFPGS PGSDIHGDPG FPGPPGERGD PGEANTLPGP VGVPGQKGDQ GAPGERGPPG  1260
SPGLQGFPGI TPPSNISGAP GDKGAPGIFG LKGYRGPPGP PGSAALPGSK GDTGNPGAPG  1320
TPGTKGWAGD SGPQGRPGVF GLPGEKGPRG EQGFMGNTGP TGAVGDRGPK GPKGDPGFPG  1380
APGTVGAPGI AGIPQKIAVQ PGTVGPQGRR GPPGAPGEMG PQGPPGEPGF RGAPGKAGPQ  1440
GRGGVSAVPG FRGDEGPIGH QGPIGQEGAP GRPGSPGLPG MPGRSVSIGY LLVKHSQTDQ  1500
EPMCPVGMNK LWSGYSLLYF EGQEKAHNQD LGLAGSCLAR FSTMPFLYCN PGDVCYYASR  1560
NDKSYWLSTT APLPMMPVAE DEIKPYISRC SVCEAPAIAI AVHSQDVSIP HCPAGWRSLW  1620
IGYSFLMHTA AGDEGGGQSL VSPGSCLEDF RATPFIECNG GRGTCHYYAN KYSFWLTTIP  1680
EQSFQGSPSA DTLKAGLIRT HISRCQVCMK NL                                1712
FASTA
(Canonical)
>LipidDB-9606-01337|P08572
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GECRCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
Gene Ontology
GO:0005587; C:collagen type IV trimer; TAS:UniProtKB
GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome
GO:0031012; C:extracellular matrix; IDA:UniProtKB
GO:0005576; C:extracellular region; TAS:Reactome
GO:0070062; C:extracellular vesicular exosome; IDA:UniProtKB
GO:0043231; C:intracellular membrane-bounded organelle; IDA:HPA
GO:0005201; F:extracellular matrix structural constituent; TAS:UniProtKB
GO:0001525; P:angiogenesis; IEA:UniProtKB-KW
GO:0007411; P:axon guidance; TAS:Reactome
GO:0071560; P:cellular response to transforming growth factor beta stimulus; IEA:Ensembl
GO:0030574; P:collagen catabolic process; TAS:Reactome
GO:0035987; P:endodermal cell differentiation; IEP:UniProtKB
GO:0022617; P:extracellular matrix disassembly; TAS:Reactome
GO:0030198; P:extracellular matrix organization; NAS:UniProtKB
GO:0016525; P:negative regulation of angiogenesis; IDA:UniProtKB
GO:0006351; P:transcription, DNA-templated; IEA:Ensembl
Interpro
InterPro; IPR016187; C-type_lectin_fold
InterPro; IPR008160; Collagen
InterPro; IPR001442; Collagen_VI_NC
Pfam
Pfam; PF01413; C4;
Pfam; PF01391; Collagen;
SMART
SMART; SM00111; C4;
PROSITE
PROSITE; PS51403; NC1_IV;
PRINTS