dbPAF Protein Information


Tag Content
dbPAF ID dbPAF-0004705
Uniprot Accession P08120; CO4A1_DROME; A4V070; Q9VMV4;
Genbank Protein ID NP_723044.1; NP_723045.1; NP_723046.1;
Genbank Nucleotide ID NM_164615.2; NM_164616.2; NM_164617.2;
Protein Name Collagen alpha-1(IV) chain
Protein Synonyms/Alias
Gene Name Cg25C
Gene Synonyms/Alias DCg1;CG4145;
Organism Drosophila melanogaster(Fruit fly)
NCBI Taxa ID 7227
Functional Description
(View all)
Collagen type IV is specific for basement membranes.
Phosphorylation Sites
dbPAF PTMs: 1
PositionPeptidesSourceReferences ( PMIDs )
42IQDSVKHYNRNEPKFcurated22817900
Sequence
(Fasta)
MLPFWKRLLY AAVIAGALVG ADAQFWKTAG TAGSIQDSVK HYNRNEPKFP IDDSYDIVDS 60
AGVARGDLPP KNCTAGYAGC VPKCIAEKGN RGLPGPLGPT GLKGEMGFPG MEGPSGDKGQ 120
KGDPGPYGQR GDKGERGSPG LHGQAGVPGV QGPAGNPGAP GINGKDGCDG QDGIPGLEGL 180
SGMPGPRGYA GQLGSKGEKG EPAKENGDYA KGEKGEPGWR GTAGLAGPQG FPGEKGERGD 240
SGPYGAKGPR GEHGLKGEKG ASCYGPMKPG APGIKGEKGE PASSFPVKPT HTVMGPRGDM 300
GQKGEPGLVG RKGEPGPEGD TGLDGQKGEK GLPGGPGDRG RQGNFGPPGS TGQKGDRGEP 360
GLNGLPGNPG QKGEPGRAGA TGKPGLLGPP GPPGGGRGTP GPPGPKGPRG YVGAPGPQGL 420
NGVDGLPGPQ GYNGQKGGAG LPGRPGNEGP PGKKGEKGTA GLNGPKGSIG PIGHPGPPGP 480
EGQKGDAGLP GYGIQGSKGD AGIPGYPGLK GSKGERGFKG NAGAPGDSKL GRPGTPGAAG 540
APGQKGDAGR PGTPGQKGDM GIKGDVGGKC SSCRAGPKGD KGTSGLPGIP GKDGARGPPG 600
ERGYPGERGH DGINGQTGPP GEKGEDGRTG LPGATGEPGK PALCDLSLIE PLKGDKGYPG 660
APGAKGVQGF KGAEGLPGIP GPKGEFGFKG EKGLSGAPGN DGTPGRAGRD GYPGIPGQSI 720
KGEPGFHGRD GAKGDKGSFG RSGEKGEPGS CALDEIKMPA KGNKGEPGQT GMPGPPGEDG 780
SPGERGYTGL KGNTGPQGPP GVEGPRGLNG PRGEKGNQGA VGVPGNPGKD GLRGIPGRNG 840
QPGPRGEPGI SRPGPMGPPG LNGLQGEKGD RGPTGPIGFP GADGSVGYPG DRGDAGLPGV 900
SGRPGIVGEK GDVGPIGPAG VAGPPGVPGI DGVRGRDGAK GEPGSPGLVG MPGNKGDRGA 960
PGNDGPKGFA GVTGAPGKRG PAGIPGVSGA KGDKGATGLT GNDGPVGGRG PPGAPGLMGI 1020
KGDQGLAGAP GQQGLDGMPG EKGNQGFPGL DGPPGLPGDA SEKGQKGEPG PSGLRGDTGP 1080
AGTPGWPGEK GLPGLAVHGR AGPPGEKGDQ GRSGIDGRDG INGEKGEQGL QGVWGQPGEK 1140
GSVGAPGIPG APGMDGLPGA AGAPGAVGYP GDRGDKGEPG LSGLPGLKGE TGPVGLQGFT 1200
GAPGPKGERG IRGQPGLPAT VPDIRGDKGS QGERGYTGEK GEQGERGLTG PAGVAGAKGD 1260
RGLQGPPGAS GLNGIPGAKG DIGPRGEIGY PGVTIKGEKG LPGRPGRNGR QGLIGAPGLI 1320
GERGLPGLAG EPGLVGLPGP IGPAGSKGER GLAGSPGQPG QDGFPGAPGL KGDTGPQGFK 1380
GERGLNGFEG QKGDKGDRGL QGPSGLPGLV GQKGDTGYPG LNGNDGPVGA PGERGFTGPK 1440
GRDGRDGTPG LPGQKGEPGM LPPPGPKGEP GQPGRNGPKG EPGRPGERGL IGIQGERGEK 1500
GERGLIGETG NVGRPGPKGD RGEPGERGYE GAIGLIGQKG EPGAPAPAAL DYLTGILITR 1560
HSQSETVPAC SAGHTELWTG YSLLYVDGND YAHNQDLGSP GSCVPRFSTL PVLSCGQNNV 1620
CNYASRNDKT FWLTTNAAIP MMPVENIEIR QYISRCVVCE APANVIAVHS QTIEVPDCPN 1680
GWEGLWIGYS FLMHTAVGNG GGGQALQSPG SCLEDFRATP FIECNGAKGT CHFYETMTSF 1740
WMYNLESSQP FERPQQQTIK AGERQSHVSR CQVCMKNSS
Keyword

KW-0084--Basement membrane
KW-0176--Collagen
KW-0181--Complete proteome
KW-1015--Disulfide bond
KW-0272--Extracellular matrix
KW-0325--Glycoprotein
KW-0379--Hydroxylation
KW-1185--Reference proteome
KW-0677--Repeat
KW-0964--Secreted
KW-0732--Signal

Interpro

IPR016187--C-type_lectin_fold
IPR008160--Collagen
IPR001442--Collagen_VI_NC

PROSITE

PS51403--NC1_IV

Pfam

PF01413--C4
PF01391--Collagen

Gene Ontology

GO:0005587--C:collagen type IV trimer
GO:0005201--F:extracellular matrix structural constituent
GO:0055013--P:cardiac muscle cell development
GO:0007391--P:dorsal closure
GO:0035848--P:oviduct morphogenesis