dbPAF Protein Information


Tag Content
dbPAF ID dbPAF-0050957
Uniprot Accession E9Q7P1; E9Q7P1_MOUSE;
Genbank Protein ID NP_081450.1; XP_006521441.2; XP_006521444.1; XP_006521445.1;
Genbank Nucleotide ID NM_027174.1; XM_006521378.2; XM_006521381.2; XM_006521382.2;
Protein Name Protein Col22a1
Protein Synonyms/Alias
Gene Name Col22a1
Gene Synonyms/Alias
Organism Mus musculus(Mouse)
NCBI Taxa ID 10090
Functional Description
(View all)
Phosphorylation Sites
dbPAF PTMs: 2
PositionPeptidesSourceReferences ( PMIDs )
199EELDEIASEPKSAHVPhosphoSitePlus22135298
1321GSSGSPGSPGDPGRPcurated25338131
Sequence
(Fasta)
MPVFTSRRAM AVPRGNPLAC LLWILLLWGG DGGCQAQRAG CKSVQYDLVF LLDTSSSVGK 60
EDFEKVRQWV ANLVDTFEVG PGHTRVGVVR YSDRPTTAFE LGHFNSREEV KAAARRITYH 120
GGNTNTGDAL RYITSRSFSA QAGGRPGNRA FKQVAILLTD GRSQDLVLDA AAAAHAAGIR 180
IFAVGVGAAL KEELDEIASE PKSAHVFHVS DFNAIDKIRG KLRRRLCENV LCPSVRVEGD 240
RFKHTNGGTK EITGFDLMDL FSVKEILGKR ENGAQSSYVR MGSFPVVQRT EDVFPQGLPD 300
EYAFVTTFRF RKTSRKEDWY IWQVIDQYGI PQVSIRLDGE NKAVEYNAVG AMKDAVRVVF 360
RGPRVDDLFD RDWHKMALSI QAQNVSLYID CLLVQTLPIE ERENIDIQGK TVIGKRLYDS 420
VPIDFDLQRI VIYCDSRHAE LETCCDIPLG PCQVTVVTEP PPAPPQLPTP GSEQIGFLKT 480
INCSCPPGEK GERGFAGPLG LPGQKGDAGP IGLMGAPGPK GEKGDSGRGP FIHGEKGEKG 540
SLGPPGPPGR DGSKGMRGEP GELGEPGLPG EVGMRGPQGP PGLPGPAGPV GAPGLRGERG 600
ERGPPGEKGE RGLDGFPGKP GETGEQGRPG PPGVAGLQGE KGDVGPAGPP GVPGSVVQRE 660
GLKGEQGAPG PRGHQGLPGP PGAPGLIGPE GRDGPPGPPG LRGKKGEMGP PGTPGALGPQ 720
GPPGPPGVPG PPGPGGPPGL PGELGFPGKP GPAGHAGTPG KDGLNGPPGL PGSKGEPGDS 780
GESGVPGMPG PRGEVGERGL AGHPGEKGEV GLPGAPGFPG VHGEKGDQGE KGELGLPGLK 840
GARGEKGEVG PAGPPGLPGS PSVFTPHPRM PGEQGPKGEK GDPGEPGALG PQGHPGELGP 900
RGPIGPPGAK GHDGAQGPPG AAGNPGAPGP AGPPGLSGPP GSLGSPGVRG APGKDGERGE 960
KGTAGEEGSP GPAGPRGDPG APGLPGPPGK GKDGEPGLRG PPGLPGPLGI KGDRGTPGIP 1020
GSPGSRGDPG IGVAGPPGPS GRPGDKGPPG SRGLPGFPGP QGPAGQDGAP GNPGERGPPG 1080
KPGPSSLLSP EDINLLVKDV CNDCPPGPPG LPGLPGFKGD KGLPGKQGRE GTEGKKGDTG 1140
PPGPPGPPGV AGPQGSQGER GAEGEVGQKG EQGHPGVPGF MGPPGNPGPP GADGNAGVAG 1200
PPGPQGPQGK EGPPGPQGPS GIPGVPGEEG KQGRDGKPGP PGEPGKTGEP GLSGAEGARG 1260
PPGFKGHTGD PGPPGLRGEP GIAGPSGRDG SPGKDGDTGP AGPQGPRGTR GPPGSSGSPG 1320
SPGDPGRPGA LGQKGNKGES GSPGLPGFQG PRGPPGEAGE VGAPGKEGAP GKPGEPGSKG 1380
ERGDPGIKGD KGPPGGKGQP GDPGTPGHKG HTGLMGPQGQ PGESGPPGPP GPPGQPGFPG 1440
LRGESPSMDT LRRLIQEELG KQLEAKLAYL LAQMPPAHMK SSQGRPGPPG PPGKDGLPGR 1500
TGPMGEPGRP GQGGLEGPSG PMGPKGERGA KGDPGTPGVG LRGEMGPPGI PGQPGEPGYA 1560
KDGLPGSPGP QGETGLAGHP GPPGPPGPPG LCDPSQCAYF ASLAARPSNV KGP 1614
Keyword

KW-0181--Complete proteome
KW-0272--Extracellular matrix
KW-1267--Proteomics identification
KW-1185--Reference proteome
KW-0964--Secreted
KW-0732--Signal

Interpro

IPR008160--Collagen
IPR013320--ConA-like_dom
IPR001791--Laminin_G
IPR002035--VWF_A

PROSITE

PS50234--VWFA

Pfam

PF01391--Collagen
PF00092--VWA

Gene Ontology

GO:0005578--C:proteinaceous extracellular matrix