; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G201470 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G201470
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionKnob-associated histidine-rich protein
Genome locationCmU531Chr10:33332992..33337703
RNA-Seq ExpressionCmUC10G201470
SyntenyCmUC10G201470
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044020.1 putative uncharacterized protein [Cucumis melo var. makuwa]1.3e-7082.47Show/hide
Query:  MDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVK
        MDQTI  +PSSSSS   SSDNISLVSQEDY+N+EDKD+QD K+ES  D+SLNNLKFSVGRKLNKS SCKSLGELELEEVKGFMDLGFEFKRE+LSPQMVK
Subjt:  MDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVK

Query:  LVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        LVPGLQRLRTQ NKQ  EE+DD DGD D  ++DD + +DDDKKR+IARPYLSEAW I+RPNSPLLNLRMPKVSSTSDMKKHL+SWAKTVAFEIQ
Subjt:  LVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

XP_004137815.3 uncharacterized protein LOC101215662 [Cucumis sativus]4.4e-8777.82Show/hide
Query:  MAMNT-NTLCLVSTMDRLWYHQIILW-SDPL-SSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKD
        MAMNT NTLCLVS MDRLWYHQIIL  SDPL +SH PN +  T+SFPFT   F PS    P SPL+DQTILPSSSSS   SSDNISLVSQE+YSN+EDK+
Subjt:  MAMNT-NTLCLVSTMDRLWYHQIILW-SDPL-SSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKD

Query:  EQDGKKE-SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK
        +QD K+E S D+SLN LKFSVGRKLNKS SCKSLGELELEEVKGFMDLGFEFKRE+LSPQMVKLVPGLQRLRTQINKQ    EDD+DGDGD D ++D + 
Subjt:  EQDGKKE-SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK

Query:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        +D +KKR+IARPYLSEAW I+RPNSPLL+LRMPKVSSTSDMKKHL+SWAKTVAFEIQ
Subjt:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

XP_008442668.1 PREDICTED: putative uncharacterized protein YGR160W [Cucumis melo]5.9e-9279.38Show/hide
Query:  MAMNT-NTLCLVSTMDRLWYHQIILWSDPLSSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKD
        MAMNT NTLCLVS MDRLWYHQIIL SDP +SH PNF+  T+SFPFT   F PS    P SPLMDQTI  +PSSSSS   SSDNISLVSQEDY+N+EDKD
Subjt:  MAMNT-NTLCLVSTMDRLWYHQIILWSDPLSSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKD

Query:  EQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK
        +QD K+ES  D+SLNNLKFSVGRKLNKS SCKSLGELELEEVKGFMDLGFEFKRE+LSPQMVKLVPGLQRLRTQ NKQ  EE+DD DGD D  ++DD + 
Subjt:  EQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK

Query:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        +DDDKKR+IARPYLSEAW I+RPNSPLLNLRMPKVSSTSDMKKHL+SWAKTVAFEIQ
Subjt:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

XP_022145659.1 uncharacterized protein LOC111015056 [Momordica charantia]6.0e-7668.9Show/hide
Query:  MAMNTNTLCLVSTMDRLWYHQIILWSDPL-SSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSS-DNISLVSQEDYSNDEDKDEQ
        MAMNT TLCLVS MDRLWYHQIILWSDPL SSHLPNF +T   PFTKFP CPSPS    SPL ++TI+PSS S  SVSS ++ISL S E  SND+DK+++
Subjt:  MAMNTNTLCLVSTMDRLWYHQIILWSDPL-SSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSS-DNISLVSQEDYSNDEDKDEQ

Query:  DGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDD
          K+ES ++  NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRENL+PQMV L+PGLQRL   INK+   EE++D+ +           +D+
Subjt:  DGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDD

Query:  DKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        D KRD +RPYLSEAWTIKRPNSPLL LRM KVSSTSDMKKHLK WAKTVA EIQ
Subjt:  DKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

XP_038903410.1 uncharacterized protein LOC120090009 isoform X1 [Benincasa hispida]2.5e-10684.58Show/hide
Query:  MAMNTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDG
        MAMNTNTLCLVSTMDRLWYHQIILWSDPLSSH+PNF+ T+SF FT FP  PSPSPLPFSPLMDQ+ILPSSS SPSVSSDNISLVSQ+ YSNDEDK++QDG
Subjt:  MAMNTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDG

Query:  KKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRT-QINKQGPEEEDDDDGDGDGDDDDDGEKNDDD
        KKE  +ESLNNLK SVG KLNKS SCKSLGELELEEVKGFMDLGFEFK+ENLSP+MVKL+PGLQRLRT QINKQ  EEEDDDD D          +NDDD
Subjt:  KKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRT-QINKQGPEEEDDDDGDGDGDDDDDGEKNDDD

Query:  KKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        KKRDIARPYLSEAWTIKR NSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
Subjt:  KKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein2.9e-9279.38Show/hide
Query:  MAMNT-NTLCLVSTMDRLWYHQIILWSDPLSSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKD
        MAMNT NTLCLVS MDRLWYHQIIL SDP +SH PNF+  T+SFPFT   F PS    P SPLMDQTI  +PSSSSS   SSDNISLVSQEDY+N+EDKD
Subjt:  MAMNT-NTLCLVSTMDRLWYHQIILWSDPLSSHLPNFV-KTTSFPFTKFPFCPSPSPLPFSPLMDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKD

Query:  EQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK
        +QD K+ES  D+SLNNLKFSVGRKLNKS SCKSLGELELEEVKGFMDLGFEFKRE+LSPQMVKLVPGLQRLRTQ NKQ  EE+DD DGD D  ++DD + 
Subjt:  EQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEK

Query:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        +DDDKKR+IARPYLSEAW I+RPNSPLLNLRMPKVSSTSDMKKHL+SWAKTVAFEIQ
Subjt:  NDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

A0A5D3DPB9 Uncharacterized protein6.2e-7182.47Show/hide
Query:  MDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVK
        MDQTI  +PSSSSS   SSDNISLVSQEDY+N+EDKD+QD K+ES  D+SLNNLKFSVGRKLNKS SCKSLGELELEEVKGFMDLGFEFKRE+LSPQMVK
Subjt:  MDQTI--LPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESC-DESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVK

Query:  LVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        LVPGLQRLRTQ NKQ  EE+DD DGD D  ++DD + +DDDKKR+IARPYLSEAW I+RPNSPLLNLRMPKVSSTSDMKKHL+SWAKTVAFEIQ
Subjt:  LVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

A0A6J1CVW5 uncharacterized protein LOC1110150561.7e-7669.29Show/hide
Query:  MAMNTNTLCLVSTMDRLWYHQIILWSDPL-SSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSS-DNISLVSQEDYSNDEDKDEQ
        MAMNT TLCLVS MDRLWYHQIILWSDPL SSHLPNF +T   PFTKFP CPSPS    SPL ++TI+PSS S  SVSS D+ISL S E  SND+DK+++
Subjt:  MAMNTNTLCLVSTMDRLWYHQIILWSDPL-SSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSS-DNISLVSQEDYSNDEDKDEQ

Query:  DGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDD
          K+ES ++  NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRENL+PQMV L+PGLQRL   INK+   EE++D+ +           +D+
Subjt:  DGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDD

Query:  DKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        D KRD +RPYLSEAWTIKRPNSPLL LRM KVSSTSDMKKHLK WAKTVA EIQ
Subjt:  DKKRDIARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X23.3e-6459.84Show/hide
Query:  NTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKE
        NTNTLCLVS MDRLW+HQIIL S                        P PSP    P    +  PSS SS  +  D+ SLVSQED SND DK +QDGK+E
Subjt:  NTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKE

Query:  SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRD
        + +ESL + +F++ +KLNK++SCKSLGELE+EEVKGFMDLGF+F+ ENLSPQMVKLVPGLQR +T+++KQ  E++D                 DDDKKRD
Subjt:  SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRD

Query:  IARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        IARPYLSEAWTI RPNSPLL LRMPKVSSTSDMKK LKSWA+TVA EIQ
Subjt:  IARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X11.4e-6762.25Show/hide
Query:  NTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKE
        NTNTLCLVS MDRLW+HQIIL S                        P PSP    P    +  PSS SS  +  D+ SLVSQED SND DK +QDGK+E
Subjt:  NTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKE

Query:  SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRD
        + +ESL + +F++ +KLNK++SCKSLGELE+EEVKGFMDLGF+F+ ENLSPQMVKLVPGLQR +T+++KQ  E++DD       DDDDD ++NDD KKRD
Subjt:  SCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRD

Query:  IARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ
        IARPYLSEAWTI RPNSPLL LRMPKVSSTSDMKK LKSWA+TVA EIQ
Subjt:  IARPYLSEAWTIKRPNSPLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42760.1 unknown protein1.9e-0830.92Show/hide
Query:  PLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKR-ENLS
        P+  +P+  QTIL     +    ++   L+S        +K+EQ  KK    +  +N++   G         KS+ +LE EE+KGFMDLGF F   ++  
Subjt:  PLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESCDESLNNLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKR-ENLS

Query:  PQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTI------KRPNSPLLNLRMPKVSSTS--DMKKHLKSWAK
          +V ++PGLQRL  + +    EEE++++ D  G +               ARPYLSEAW        K+  +P +  R+P  ++ S  D+K +L+ WA 
Subjt:  PQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTI------KRPNSPLLNLRMPKVSSTS--DMKKHLKSWAK

Query:  TVAFEIQ
         VA  I+
Subjt:  TVAFEIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGAACACAAATACTTTATGTCTAGTTTCAACCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCCGATCCATTGAGTTCCCATCTCCCCAATTTTGT
TAAAACAACATCTTTTCCTTTCACAAAATTCCCCTTTTGCCCATCTCCCTCCCCTCTCCCATTCTCACCTCTAATGGATCAAACAATCCTTCCCTCCTCTTCATCGTCCC
CTTCGGTTTCCTCCGACAACATCTCCCTTGTTTCACAGGAAGATTATAGTAATGATGAAGACAAAGACGAACAAGATGGAAAGAAAGAGTCGTGCGATGAAAGCCTCAAC
AATCTCAAATTCTCAGTAGGGAGAAAATTGAACAAATCTGTAAGTTGTAAAAGCTTGGGGGAGTTGGAACTTGAGGAAGTGAAAGGGTTTATGGATTTAGGGTTTGAATT
TAAGAGAGAAAATTTGAGCCCTCAAATGGTGAAGTTGGTACCTGGTTTACAAAGGCTTAGAACTCAAATAAACAAACAAGGTCCCGAAGAAGAAGACGACGACGATGGTG
ACGGTGACGGTGACGATGATGACGACGGCGAAAAAAATGATGATGATAAGAAGAGAGATATAGCAAGACCATATCTTTCAGAAGCATGGACAATAAAAAGACCAAATTCT
CCTCTTTTAAATCTAAGGATGCCAAAGGTTTCTTCAACCTCTGACATGAAGAAACACCTCAAGTCTTGGGCTAAAACTGTTGCATTTGAAATTCAATAA
mRNA sequenceShow/hide mRNA sequence
CATTCCATATTCTTTCATTTCCACCACAACTCTCTCCTCTTCCATGGTTTTTTATATCCCAAACTTTCCTTCTCTTTTTTCTCCCTTTCTTTTACTCTTATTTTCTTTTC
TCCCTCTCTTCCATGGCTATGAACACAAATACTTTATGTCTAGTTTCAACCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCCGATCCATTGAGTTCCCATCT
CCCCAATTTTGTTAAAACAACATCTTTTCCTTTCACAAAATTCCCCTTTTGCCCATCTCCCTCCCCTCTCCCATTCTCACCTCTAATGGATCAAACAATCCTTCCCTCCT
CTTCATCGTCCCCTTCGGTTTCCTCCGACAACATCTCCCTTGTTTCACAGGAAGATTATAGTAATGATGAAGACAAAGACGAACAAGATGGAAAGAAAGAGTCGTGCGAT
GAAAGCCTCAACAATCTCAAATTCTCAGTAGGGAGAAAATTGAACAAATCTGTAAGTTGTAAAAGCTTGGGGGAGTTGGAACTTGAGGAAGTGAAAGGGTTTATGGATTT
AGGGTTTGAATTTAAGAGAGAAAATTTGAGCCCTCAAATGGTGAAGTTGGTACCTGGTTTACAAAGGCTTAGAACTCAAATAAACAAACAAGGTCCCGAAGAAGAAGACG
ACGACGATGGTGACGGTGACGGTGACGATGATGACGACGGCGAAAAAAATGATGATGATAAGAAGAGAGATATAGCAAGACCATATCTTTCAGAAGCATGGACAATAAAA
AGACCAAATTCTCCTCTTTTAAATCTAAGGATGCCAAAGGTTTCTTCAACCTCTGACATGAAGAAACACCTCAAGTCTTGGGCTAAAACTGTTGCATTTGAAATTCAATA
AAAAATACTTTAAGACTTATTTTTTAAGTTTTTATTTCCAAACAATTTCTTTTTCTTTTGAACAATATTCATATACAGTAATAGTATTATGTACAAGCTACCCTTTCACA
ATTTTGAAGATGTTTTTTTGTAATATTATTTTTTTTTAAAGAAAAAACCAATAATTATGCAGACAAGCCAATTTTGAATATAATTTGGGTCCCTAGTCGAATGTTTTTGG
CATGTTTTTTTTTTAAGTTTTGGAGATCGAGTAATCGATTCTTATTCATTGAATTATGTTCAACTCTTACTATTTTGTTATTACAACCTAGAATTGTAAAAAAAAAAAAT
TTTAAAAAAAGAAAAATTACACTGTGTATATAATTTTTTCTTCGTTAAAATATCATTTTGGTCTACATACTTTGAAATTTTTTCAATTCTAGTATCAATTAAAAGTACAA
GAACTAAATTGAACAATTAAAAGTACAATAACTAAATTGAACAATTAAAAGTACATCCATGAAAATTAAACAAATTTTAGACAAATTTTAAGTAGGGACCACAATGGTAT
TTTTAAAACTTTTTTTCTTTCATTTATTTTTAAATAAAAATAAAAAGAGAGTGAAAGTGAAAGGAGAGATTGTAGAGTGTTGTAAGACAAGTGAATAATAAATGAATTGG
GAAAAGTCCAATAAGCAAGTTGAGTTTATGAGTATGTGAAGATTCCCAATGTCACTCTTGATTGAGTTTTTGTCACATCATGACCAAGTGGAAGTTCATTTTTCCTTTGC
TTGTAAATGGGTTCACACTTTTTCAAAATAATCATTTGCATATCTATCTCAACATATATAGAAATTTATTTTAAATGATAAAATTATTTAAAATATTTATAAATATAACC
AAATTTTAATTTATATTTGTGATAGATCAAAATTAAGGATTTACCTAGGTCCATTGCTATCGTAAAAGATGTTTGAGTTTACCACAAATAAAAAGTAAATTTTTGCTATA
TTTGTAATTATTTTCTAGATTTTATTTTTCAAAATTAAAAATATATATACACACATGTATTAAACATGATTGTGGGCCTTAAATGGGCCTGTTTAATTGGAAAGCCCGGG
CCTTTGGAGCAATGGGCCTCATTGGGTGCTGAACCTTTTGTAATTTAATTTTTTGAAAGCTACCTTAATATATAAAAATAATAATAATAATAATAATTATTATTATTATT
AAACTTTTAAAATATTTGATATGCAATGTTGGATAACTTAAGTGATTAAATTTTTATTTAAAAAAAAATAGAAAAACATGAAAATGATTATATACGAAAGTATTTTTTAT
AATGAAAACAAAAGGATGCCTATGAAAAATTATCTACTTTTTATATATGTAATTGTCAGATTTATAAAATAAATACTTTGATTTCTTTGACATATATGAGATTGATTCTT
CAAGCATACGTTTCCGTTTTTGAAAAATAATAAAAGAAAAAAAAAAAATCAATCCAAAGATATGGATGTATTTTTAAGTTTTCTTTTTCAAACTTCTTATCAAGTAGATT
TTAGAATCACCTATTGAGTATTAAGAATCTAGCTCATATGCATCAACTACGTCTCTGGTAATGATTAGTTTTGAAAAGAAACTACGTGTGTTTATAAAATACATACATAC
GCACATACATATATTTTCATTATTGAACTTACAAAATGTTACTAGTCATTCAACTTTTGAATTTGTAATATTTTAGTTTTTTAAATTTTAAAGCGTTTGCTTAAAATAAA
AAATTCCTGACTAGCTTTGATATGCTTGCTAGCACGTTAATGTGAGCACGATTTCGAGCTACATAAATTTAAAGACTACTTTAAGGAAGTAAAAAATAGATTTAAAGAAA
AACTTAAGGTGTGTTTGGAATACATTTTCAAATGTTTAATTTAAAAAATAAGTCATTTTGGAAAAATGAGAATGTTTAGAAACCACGAAAAATAGATTTTGAAATGTATT
TTAAACAATATTTTAAAAAAAAAAACATTTTTTTCTTAAGTCAATTCAAACGAGCTCTTATATATTAAGGACCAAAATAGATATTACAAAAATCCAAGATAAAAAAAAAT
GAAACAAACATGGAGGATAAGAGGCTAAAATAGACAATTTAAGAATCAAAATTAAACAAAGTTAAAAGTTTAAAATTCAAGAAATTCCTTTAATCGACCAAATTTTTAAA
AATATTTACAAATAATTATAAAATATTATAATCTATCGCTCAATAAAATTTTGAATATTTTAAGTTCATTTTCGTCTATATAAAACAATTCTGAGATTTAAATCTATAAA
AGAAATACCAAGTCAAATTTGCCAACGTAGGTTTAGCTAAGTCATAATTAATAAAATAAATAAAAACTAAAAAAAAATAAAAAAAATAAAAATAAAACTCAAAATGAAGT
TAGAGAATTATTGATCCTCGTGTGAAAACGCTACAAATAAAAATAGTTAGATATTACGAATTAAAAACAAACAAATAAATTTATCCATCCACGAGAACGTGCATGTGTTC
AACTATAAATTTATAAATTCAAATGTTTTTTTAATGAAAAATAAATTATAAATTCAAATGATAATAAAACAAACTCATAAGACCAAATATATGGATAAAGTGTATCCTAA
TAAAAGAAATACAGGGGGGAAAACTTTAGCTAATCTTTTTAAGAAAAAAAAAAAAAAAAGAAGGTATTTATATATATAAAAAAAATTATTAAAGAAGTTTATCAATCAAA
TTATGATCCAGTGGACCCTGTACTATTTCTAATAATTAGAGGAAAAAAAAAAACCAATTATAATCTCTGAATTTATTTGAAACAATTTTATATTTCTGTGGTGTTGAATA
TTTCCTTTAATAAATAAATAAATAAACTTCCAATATGATCAAATTGGATAGCTTCCAAGGATGCTGTCCGCGATCCCATCGTCATTTTCAATTATTTTATTATTATTTGT
CTTTATGGAAACTATTTTCAGTTATTGATTGTATTAAAAAAAGAAAAAATTAGAGTTATTGTCCAAAGAAAAAAATATATAACAATTTTTTAAAAATATAATCAAATAAG
TTAAACTATTTATAAATATAGAAAGTTCTTTATCTGCAATAAACTGTGATAGATTGCAGTAGATTTCTATAGACAATAAATTTTTTTCGTAAATAATTTGATATTTTTTC
TATTTATAATAATTTAAATTTTTTAAAATATAAATAAGTCTAAATTATAAATGGGTACGTTTTGCATTTCACCCAAGTCAGGACAACATGCTAGAATTTTCTTTTTTGGA
AATATCTTGATGGTTTAAGCATGTTTTTGTAAAGAAAAATATTTTTGTTATTGTATATTACATTTTTTAAATAAAAATATTTAGTTGCAGTATATTGAAAAGTAATTATG
TTCATTATAAGTATATTTTTTTACTTCATATTTTGAATATTTTCAATGAATGGTTTAAAAGCCATTTTGAATAACATATCTTTAGAAAGTTTTCTTAACG
Protein sequenceShow/hide protein sequence
MAMNTNTLCLVSTMDRLWYHQIILWSDPLSSHLPNFVKTTSFPFTKFPFCPSPSPLPFSPLMDQTILPSSSSSPSVSSDNISLVSQEDYSNDEDKDEQDGKKESCDESLN
NLKFSVGRKLNKSVSCKSLGELELEEVKGFMDLGFEFKRENLSPQMVKLVPGLQRLRTQINKQGPEEEDDDDGDGDGDDDDDGEKNDDDKKRDIARPYLSEAWTIKRPNS
PLLNLRMPKVSSTSDMKKHLKSWAKTVAFEIQ