dbPAF Protein Information
Tag | Content | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
dbPAF ID | dbPAF-0051048 | ||||||||||||||||
Uniprot Accession | E9QAQ8; E9QAQ8_MOUSE; | ||||||||||||||||
Genbank Protein ID | |||||||||||||||||
Genbank Nucleotide ID | |||||||||||||||||
Protein Name | Protein Muc5ac | ||||||||||||||||
Protein Synonyms/Alias | |||||||||||||||||
Gene Name | Muc5ac | ||||||||||||||||
Gene Synonyms/Alias | |||||||||||||||||
Organism | Mus musculus(Mouse) | ||||||||||||||||
NCBI Taxa ID | 10090 | ||||||||||||||||
Functional Description (View all) |
|||||||||||||||||
Phosphorylation Sites |
dbPAF PTMs: 3
|
||||||||||||||||
Sequence (Fasta) | MGVGRRKLVP FWVLALALAC SQCTGQAQQD SLKSYHEHRS DVPHPQGHVG TPLNRVTIIP 60 PLKTIPVVRA FNPGHTRRVC STWGNFHYKT FDGQVFYFPG LCNYVFSAHC GDAYEDFNIQ 120 LRRVQESNTT TLSRVTMKLD GLVVELTKSS VLVNNHPVQL PFSQSGVLIE LSNGYLKVVA 180 RLGLLFVWNE DDSLLLELDT KYTNKTCGLC GDFNGSPKSN EFLSNNVRLT PLEFGNLQKM 240 DGPTEQCQDP LPVPQKNCSA RSGICEMILK GELFSGCAAL VDISSYVEAC RQDVCLCESL 300 DPSDCICHTL AEYSRQCAHA GGQPQDWRGP NLCSQTCPLN MQHQECGSPC VDTCSNPQHS 360 QVCEDHCIAG CFCPEGMVLD DINQMGCVPV SQCACLYNGT LYAPGTNYST DCTKCTCSGG 420 QWSCQDIPCA GTCSVMGGSH MSTFDGRQYT VHGDCTYVLS KPCDSNAFTV LVELRKCGLT 480 ESETCLKTVT LNLGGGQTVI MVKATGEVFV NQIYTQLPVS TANATFFRPS TFFIVGETNL 540 GLQLEIQLSP IMQTSVRLKP GLRGLTCGLC GNFNSMQADD FQTISGVVEG TAAAFFNTFK 600 TQAACPNVKN IFQDPCSLSV ENEKYAQHWC SLLTNASGPF SQCHATVNPS TFFSNCMYDT 660 CNCEKSEDCM CAALSSYVRA CAAKGVLLSD WRDGICTKPT ITCPKSMTYQ YHISTCQPTC 720 RALNEKDVTC HVSFIPVDGC TCPKGTFLDD LGKCVQATSC PCYYKGSTVP NGESVQDSGA 780 ICTCTQGALT CIGGPAPTPV CDAPMIYFDC HNATPGDTGA GCQKSCHTLD MTCYSSECVP 840 GCVCPNGLVA DGNGGCVVTE DCPCVHNEAT YRPGETIQVG CNNCTCENRM WQCTDKPCLA 900 TCAVYGDGHY ITFDGQRYSF NGDCEYTLLQ DNCGGNGSSQ DAFRVITENI PCGTTGTTCS 960 KSIKIFLGNY ELKLSDSKME VVQKDVGQEP PYFVHQMGNY LVVETDIGLV LLWDKKTSIF 1020 LRLSPEFKGR VCGLCGNFDD NAINDFTTRS QSVVSDMLEF GNSWKLSPSC PDVLVPKDPC 1080 TANPYRKSWA QKQCSIINSE TFSACHAHVE PAKYYEACVN DACACDSGGD CECFCTTVAA 1140 YAQACHEVGV CVSWRTPDIC PLFCDYYNPE GQCEWHYQPC GAPCMRTCQN PTGQCLQDLR 1200 GLEGCYPKCP PTAPIFDEGT MQCVSNCTVT FPCRVNGKLY RPGASVPSDK NCDSCICTES 1260 GVRCTHNAGA CVCTYNGQQF HPGEIIYHTT DGIGGCISAH CRANGTIERS VDTCNSTTPT 1320 PPTTFSFSTP PVMTSMQPSS THSSPTPSVG SSGASSKAAS TTSSILSVKS PVTAPMTMST 1380 SASAVTTSGC REECLWSPWM DVSRPGRGID SGDFDTLENL RAHGYPICQV PKAVECRAEA 1440 SPGVPLPELQ QHLECSTTVG LICYNSDQLS GLCDNYQIKV QCCTPVSCPT SQTTHVISSS 1500 RTTNLDNTTS SVPVTSTEHP YSSTVTSGSS THTPGLSPSS SVPSSPTPAS STPAPVSSTT 1560 VKTTLPITSP TPEPTPAISS VSISTSGSTM PSSETTHECK QELCNWTNWL DGSYPGSGRN 1620 SGDFDTFVNL RSKGYKFCEK PRNVECRAQF FPNTPLEELG QNVTCSREEG LICLNKNQLP 1680 PMCYNYEIRI ECCTVVNNCS TASVTTHPTS HGVSTKTETN WTTHVYSSPT KDTSSHSATI 1740 DTKTWTSGIS HTTTQPVTTH CQLQCNWTKW FDTDFPVPGP HGGDLETYSN IERSGERLCH 1800 REEITQLQCR AKNYPEREME DLGQVVKCDP SVGLVCNNRD QGGDSGMCLN YEVRLLCCHI 1860 PEGCSMTTHV TLLSSTSEIV TSSTPGTTSM HVASSTSMPQ TSSPNTGKTS TISTTQTSSP 1920 NTGKTSTTST TQTSSPNTGK TSTISTTQTS SPNTGKTSTT STTQTSSPNT GKTSTISTTQ 1980 TSSPNTGKAS TPSTPHTSSP NTGKTSTIST TQTSSPNTGK TSTTSTTQTS SPNTGKTSTI 2040 STTQTSSPNT GKASTPSTPH TSSPNTGKTS TISTTQTSSP NTGKASTPST PQTSSPNTGK 2100 TSTISTTQTS SPNTGKGSTP STPQTSSPNT GKTSTISTTQ TSSPNTGKTS TTSTTQTSSP 2160 NTGKTSTIST TQTSSPNTGK ASTPSTPHTS SPNTGKTSTI STTQTSSPNT GKASTPSTPQ 2220 TSSPNTGKTS TISTTQTSSP NTGKGSTPST PQTSSPNTGK TSTTSTTQTS SPNTGKASTI 2280 STTQTISTSG STMPSSETTH ECKQELCNWT NWLDGSYPGS GRNSGDFDTF VNLRSKGYKF 2340 CEKPRNVECR AQFFPNTPLE ELGQNVTCSR EEGLICLNKN QLPPMCYNYE IRIECCTVVN 2400 NCSTASVTTH PTSHGVSTKT ETNWTTHVYS SPTKDTSSHS ATIDTKTWTS GISHTTTQPV 2460 TTHCQLQCNW TKWFDTDFPV PGPHGGDLET YSNIERSGER LCHREEITQL QCRAKNYPER 2520 EMEDLGQVVK CDPSVGLVCN NRDQGGDSGM CLNYEVRLLC CHIPEDCPRT DQTSPVTLSH 2580 KPSSAVVSPS SVSPSLSTSH RVHSTTPCFC SVSGQLYPLG SIIYNQTDLD GHCYYAMCSQ 2640 DCQVVKRVSQ DCPSTMPPPA TTLSTSTTPP VTGRDRCNVF PPRLRGETWP MPNCSQATCE 2700 GNNVISLSPR QCPELNEPSC ANGYPPLKVD DQDGCCQHYQ CQCVCSGWGD PHYITFDGTY 2760 YTFLDNCTYV LVQQIVPVFG YFRVLIDNYY CDVGDSVSCP QSIIVEYHQD RVVLTRRPVS 2820 GVMTNQIIFN NKVVSPGFQQ NGIVTSRVGI KMYVTIQEIG VRVMFSGLIF SVEVPFNLFA 2880 NNTEGQCGTC TNDKKDECRL PGGSIASSCS EMSLHWKVPN QPSCQGPPPT PTSVVPRPSP 2940 TPCPPSPLCE LILSNTFKLC HDVIPPLQFY QGCLFDYCHM LDLEVVCSGL ELYASLCAAQ 3000 GVCIPWRSQT NNTCSFTCPD NQVYQPCGPS NPHYCYRDDS ISPSLTLQEA GPKTEGCFCP 3060 DSTTLFSTND SICVPSCQWC LGPRGEPVEP GHTISIDCQD CICKEATLTC QKKACPQPTC 3120 PEPGFVPVPV ALEAGQCCPQ FSCACNSSHC PPPLHCPKNS SLIVTYEEGA CCPTQNCSSQ 3180 KGCEVNGTLY QPGDVVSSSL CERCLCEVSS NPLSDVFMVS CETELCNTQC PKGSEYQAMP 3240 GQCCGKCIPK TCPFKNNSGS TYFYQPGELW AEPGNPCVTH KCEKFQDVLM VVTMKTECPK 3300 INCPQGQAQL REDGCCYDCP LPNQQKCTVH QRQQIIRQQN CSSEGPVSIS YCQGNCGDSI 3360 SMYSLEANKV EHTCECCQEL QTSQRNVTLR CDDGSSQTFS YTQVEKCGCL GQQCHALGDT 3420 SHAESSEQEF KSKESEEHGQ QLAFRVSEDM LGPFQ 3456 |
||||||||||||||||
Keyword | KW-0181--Complete proteome |
||||||||||||||||
Interpro | IPR006207--Cys_knot_C |
||||||||||||||||
PROSITE | PS01185--CTCK_1 |
||||||||||||||||
Pfam | |||||||||||||||||
Gene Ontology | GO:0005737--C:cytoplasm |