dbPAF Protein Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
dbPAF ID | dbPAF-0050957 | ||||||||||||
Uniprot Accession | E9Q7P1; E9Q7P1_MOUSE; | ||||||||||||
Genbank Protein ID | NP_081450.1; XP_006521441.2; XP_006521444.1; XP_006521445.1; | ||||||||||||
Genbank Nucleotide ID | NM_027174.1; XM_006521378.2; XM_006521381.2; XM_006521382.2; | ||||||||||||
Protein Name | Protein Col22a1 | ||||||||||||
Protein Synonyms/Alias | |||||||||||||
Gene Name | Col22a1 | ||||||||||||
Gene Synonyms/Alias | |||||||||||||
Organism | Mus musculus(Mouse) | ||||||||||||
NCBI Taxa ID | 10090 | ||||||||||||
Functional Description (View all) |
|||||||||||||
Phosphorylation Sites |
dbPAF PTMs: 2
|
||||||||||||
Sequence (Fasta) | MPVFTSRRAM AVPRGNPLAC LLWILLLWGG DGGCQAQRAG CKSVQYDLVF LLDTSSSVGK 60 EDFEKVRQWV ANLVDTFEVG PGHTRVGVVR YSDRPTTAFE LGHFNSREEV KAAARRITYH 120 GGNTNTGDAL RYITSRSFSA QAGGRPGNRA FKQVAILLTD GRSQDLVLDA AAAAHAAGIR 180 IFAVGVGAAL KEELDEIASE PKSAHVFHVS DFNAIDKIRG KLRRRLCENV LCPSVRVEGD 240 RFKHTNGGTK EITGFDLMDL FSVKEILGKR ENGAQSSYVR MGSFPVVQRT EDVFPQGLPD 300 EYAFVTTFRF RKTSRKEDWY IWQVIDQYGI PQVSIRLDGE NKAVEYNAVG AMKDAVRVVF 360 RGPRVDDLFD RDWHKMALSI QAQNVSLYID CLLVQTLPIE ERENIDIQGK TVIGKRLYDS 420 VPIDFDLQRI VIYCDSRHAE LETCCDIPLG PCQVTVVTEP PPAPPQLPTP GSEQIGFLKT 480 INCSCPPGEK GERGFAGPLG LPGQKGDAGP IGLMGAPGPK GEKGDSGRGP FIHGEKGEKG 540 SLGPPGPPGR DGSKGMRGEP GELGEPGLPG EVGMRGPQGP PGLPGPAGPV GAPGLRGERG 600 ERGPPGEKGE RGLDGFPGKP GETGEQGRPG PPGVAGLQGE KGDVGPAGPP GVPGSVVQRE 660 GLKGEQGAPG PRGHQGLPGP PGAPGLIGPE GRDGPPGPPG LRGKKGEMGP PGTPGALGPQ 720 GPPGPPGVPG PPGPGGPPGL PGELGFPGKP GPAGHAGTPG KDGLNGPPGL PGSKGEPGDS 780 GESGVPGMPG PRGEVGERGL AGHPGEKGEV GLPGAPGFPG VHGEKGDQGE KGELGLPGLK 840 GARGEKGEVG PAGPPGLPGS PSVFTPHPRM PGEQGPKGEK GDPGEPGALG PQGHPGELGP 900 RGPIGPPGAK GHDGAQGPPG AAGNPGAPGP AGPPGLSGPP GSLGSPGVRG APGKDGERGE 960 KGTAGEEGSP GPAGPRGDPG APGLPGPPGK GKDGEPGLRG PPGLPGPLGI KGDRGTPGIP 1020 GSPGSRGDPG IGVAGPPGPS GRPGDKGPPG SRGLPGFPGP QGPAGQDGAP GNPGERGPPG 1080 KPGPSSLLSP EDINLLVKDV CNDCPPGPPG LPGLPGFKGD KGLPGKQGRE GTEGKKGDTG 1140 PPGPPGPPGV AGPQGSQGER GAEGEVGQKG EQGHPGVPGF MGPPGNPGPP GADGNAGVAG 1200 PPGPQGPQGK EGPPGPQGPS GIPGVPGEEG KQGRDGKPGP PGEPGKTGEP GLSGAEGARG 1260 PPGFKGHTGD PGPPGLRGEP GIAGPSGRDG SPGKDGDTGP AGPQGPRGTR GPPGSSGSPG 1320 SPGDPGRPGA LGQKGNKGES GSPGLPGFQG PRGPPGEAGE VGAPGKEGAP GKPGEPGSKG 1380 ERGDPGIKGD KGPPGGKGQP GDPGTPGHKG HTGLMGPQGQ PGESGPPGPP GPPGQPGFPG 1440 LRGESPSMDT LRRLIQEELG KQLEAKLAYL LAQMPPAHMK SSQGRPGPPG PPGKDGLPGR 1500 TGPMGEPGRP GQGGLEGPSG PMGPKGERGA KGDPGTPGVG LRGEMGPPGI PGQPGEPGYA 1560 KDGLPGSPGP QGETGLAGHP GPPGPPGPPG LCDPSQCAYF ASLAARPSNV KGP 1614 |
||||||||||||
Keyword | KW-0181--Complete proteome |
||||||||||||
Interpro | IPR008160--Collagen |
||||||||||||
PROSITE | PS50234--VWFA |
||||||||||||
Pfam | |||||||||||||
Gene Ontology | GO:0005578--C:proteinaceous extracellular matrix |