; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC08G149070 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC08G149070
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRRM domain-containing protein
Genome locationCicolChr08:14911863..14915197
RNA-Seq ExpressionCcUC08G149070
SyntenyCcUC08G149070
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592987.1 hypothetical protein SDJN03_12463, partial [Cucurbita argyrosperma subsp. sororia]3.5e-8167.48Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS  D+Q+A FEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPN+TEPINSSQCALVEMKD KEA SVISVI+QFPFM+SGMPRPVRA PAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRP+KPGRKISF WL KDDPDFEVAKK+K LTQ+HVAE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQ+ALK NYKKYEIVE VM DGTARRLA+ YNMRV+DD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

XP_004138411.1 uncharacterized protein LOC101221287 [Cucumis sativus]3.6e-8670.73Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS KD+QFAMFEEKVKRTVYVDNLS QVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKD KEAKSVI+VI+QFPFM+SGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRPVKPGRKISF WLE DDPDFEVA++IK L++KHVAE  + + +                                               H+A
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQE LKGNYKKYEIV+SVMADGTARRLAKHYNMR+SDD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

XP_008441485.1 PREDICTED: uncharacterized protein LOC103485592 [Cucumis melo]8.1e-8670.33Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS KD+QF+MFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVV+VHFIPNYTEPINSSQCALVEMKD KEAKSVI+VI+QFPFM+SGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
         EMFDDRPVKPGRKISF WLE DDPDFEVA++IK LT+KHVAE  + + +                                               HLA
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQE L+GNYKKYEIV+SVMADGTARRLAK+YNMRVSDD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

XP_023004840.1 uncharacterized protein LOC111498019 [Cucurbita maxima]2.1e-8167.48Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS  D+Q+A FEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPN+TEPINSSQCALVEMKD KEA SVISVISQFPFM+SGMPRPVRA PAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRP+KPGRKISF WL K+DPDFEVAKK+K LTQ+HVAE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQ+ALKGNYKKYEI+E VM DGTARRLA+ YNMRV+DD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

XP_038886330.1 uncharacterized protein LOC120076542 [Benincasa hispida]2.9e-9174.8Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS KD+QFAMFEEKVKRTVYVDNLSP VTEPVLRTALDQFGTVVSVHFIPNYTEPIN+SQCALVEMKDLKEAKSVISVI+QFPFM+SGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKK+KLLT+KHVAE  + +                                                HHLA
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAK YNMRVSDD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

TrEMBL top hitse value%identityAlignment
A0A0A0KAE7 RRM domain-containing protein1.8e-8670.73Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS KD+QFAMFEEKVKRTVYVDNLS QVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKD KEAKSVI+VI+QFPFM+SGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRPVKPGRKISF WLE DDPDFEVA++IK L++KHVAE  + + +                                               H+A
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQE LKGNYKKYEIV+SVMADGTARRLAKHYNMR+SDD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

A0A1S3B3J3 uncharacterized protein LOC1034855923.9e-8670.33Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS KD+QF+MFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVV+VHFIPNYTEPINSSQCALVEMKD KEAKSVI+VI+QFPFM+SGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
         EMFDDRPVKPGRKISF WLE DDPDFEVA++IK LT+KHVAE  + + +                                               HLA
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQE L+GNYKKYEIV+SVMADGTARRLAK+YNMRVSDD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

A0A6J1DB90 uncharacterized protein LOC1110185371.0e-7865.45Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS  ++QFA FEEKVKRTVYVDNLSPQVTEPV+RTALDQFGTVV V FIPNY EP+NSSQCALVEMKDLKEAKSVISVI++FPFMISGMPRPVRARPAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRP+KPG KISF+WL+++DPDF+VAKK+KL T+KH AE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQE LKGNYKK+EIVESVM DGTARRLAK Y+MRV DD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

A0A6J1H6P8 uncharacterized protein LOC1114606151.1e-8066.67Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS  D+Q+  FEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPN+TEPINSSQCALVEMKD KEA SVIS+I+QFPFM+SGMPRPVRA PAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRP+KPGRKISF WL KDDPDFEVAKK+K LTQ+HVAE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQ+ALK NYKKYEIVE VM DGTARRLA+ YNMRV+DD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

A0A6J1KVQ1 uncharacterized protein LOC1114980191.0e-8167.48Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        MGS  D+Q+A FEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPN+TEPINSSQCALVEMKD KEA SVISVISQFPFM+SGMPRPVRA PAE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
        VEMFDDRP+KPGRKISF WL K+DPDFEVAKK+K LTQ+HVAE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD
        EEEKLAKQQQ+ALKGNYKKYEI+E VM DGTARRLA+ YNMRV+DD
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMRVSDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05970.1 RNA-binding (RRM/RBD/RNP motifs) family protein7.1e-4036.78Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        M + ++  +  F E+V+RTVYVD L+P  T PV+ +A +QFGTV  V FIPNY  P       LVEM++ +  ++VIS +SQ PFM++GMPRPVRA  AE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
          MF D+P KPGR + F W++ +DPDF+ A+++K L +KH AE  + +                                                  L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMR
        E EKL+KQQ E    ++KK+E+++ ++ DG A++LA  Y+++
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMR

AT1G05970.2 RNA-binding (RRM/RBD/RNP motifs) family protein3.2e-4036.78Show/hide
Query:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE
        M + ++  +  F E+V+RTVYVD L+P  T PV+ +A +QFGTV  V FIPNY  P       LVEM++ +  ++VIS +SQ PFM++GMPRPVRA  AE
Subjt:  MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAE

Query:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA
          MF D+P KPGR + F W++ +DPDF+ A+++K L +KH AE  + + +                                                L 
Subjt:  VEMFDDRPVKPGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLA

Query:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMR
        E EKL+KQQ E    ++KK+E+++ ++ DG A++LA  Y+++
Subjt:  EEEKLAKQQQEALKGNYKKYEIVESVMADGTARRLAKHYNMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCAACGAAGGATGATCAATTTGCCATGTTTGAGGAGAAGGTAAAACGGACTGTCTATGTTGATAATCTCTCTCCCCAAGTAACTGAGCCTGTTTTGAGAACTGC
TTTAGATCAGTTTGGGACTGTTGTTAGCGTCCATTTTATCCCAAACTACACAGAGCCAATTAATAGCTCTCAATGTGCTCTAGTAGAGATGAAGGATTTGAAGGAGGCGA
AGTCTGTTATCTCCGTGATATCTCAGTTCCCTTTCATGATATCTGGAATGCCGAGACCGGTGAGGGCGCGCCCTGCCGAAGTGGAAATGTTCGATGATCGCCCTGTAAAG
CCTGGTAGGAAGATTAGCTTTACCTGGCTGGAAAAGGATGATCCTGATTTTGAAGTGGCCAAGAAAATTAAGCTTCTTACTCAGAAACATGTGGCCGAAAGGATCTATCC
TATCTCTCGTCTTCCATTTTCGTGCCACGTTGTGCTAAATCAATCCATTAGCATTTGTTTATCTAATAAGAGTATTGGAATCATAATTGAAAGTTTGTTTCATATACTTT
TAAATCTCTCTAGTAGGTCTCCTCAATTTTCCGCTTGTCACCATTTGGCGGAAGAGGAGAAGCTTGCAAAGCAGCAACAAGAAGCACTGAAAGGAAACTACAAGAAATAT
GAGATTGTAGAAAGTGTAATGGCTGATGGAACTGCCCGCAGGTTAGCAAAACATTATAATATGCGAGTTTCAGATGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCAACGAAGGATGATCAATTTGCCATGTTTGAGGAGAAGGTAAAACGGACTGTCTATGTTGATAATCTCTCTCCCCAAGTAACTGAGCCTGTTTTGAGAACTGC
TTTAGATCAGTTTGGGACTGTTGTTAGCGTCCATTTTATCCCAAACTACACAGAGCCAATTAATAGCTCTCAATGTGCTCTAGTAGAGATGAAGGATTTGAAGGAGGCGA
AGTCTGTTATCTCCGTGATATCTCAGTTCCCTTTCATGATATCTGGAATGCCGAGACCGGTGAGGGCGCGCCCTGCCGAAGTGGAAATGTTCGATGATCGCCCTGTAAAG
CCTGGTAGGAAGATTAGCTTTACCTGGCTGGAAAAGGATGATCCTGATTTTGAAGTGGCCAAGAAAATTAAGCTTCTTACTCAGAAACATGTGGCCGAAAGGATCTATCC
TATCTCTCGTCTTCCATTTTCGTGCCACGTTGTGCTAAATCAATCCATTAGCATTTGTTTATCTAATAAGAGTATTGGAATCATAATTGAAAGTTTGTTTCATATACTTT
TAAATCTCTCTAGTAGGTCTCCTCAATTTTCCGCTTGTCACCATTTGGCGGAAGAGGAGAAGCTTGCAAAGCAGCAACAAGAAGCACTGAAAGGAAACTACAAGAAATAT
GAGATTGTAGAAAGTGTAATGGCTGATGGAACTGCCCGCAGGTTAGCAAAACATTATAATATGCGAGTTTCAGATGATTAATTGAAAGCTTTGAAGCTTTGCTTTTGTAA
CTCCTAATGCCCCCTGATTTTTACATGTAAATATGTACAAATGGTCATTGATATGCCCAATATTGATGAATAAGAGACTTCTTTTGTGATTTAGAGGGAGAAAGATAGAT
AATTAGTATAATACTCCTAACAACACTCTTTTGCATTTCAGCCCCCCCCCCCTCCTCCCTCCCTTTTTTTCTTTCTATTTTTTTTTCCTCTCCTAACATTGTATTTTCCC
ATCATGAAGGGTGTTTTATGAAAATTTGAACACTTGTTTGGTAGTGACTTTGGTCATGATATTTTATTTTAAGAGAGTTTCAGAGGGAGAATTCACTGTGTACTTTGAAC
TCAGCATAATGTAAACTTTTAGCTTAAACTTTCTGTTCATTGGAGCAAGTGTCATATATTTTATCTTTCAATTACTGTTGTATTCATCCACTAATTTTAACAAAAGTAGA
GGTGACCATTAGTTGGCTGGCGTCGGTTTTGAGCGAAAACCGCCGATGGCCACTGACAAATTGGTTTCTATTGGTTGGTTTTTGGCAGTTTTAGGGTCAGTGGAGACGTC
GTTTGGCGACCACACGAATGGTTGGTTGGTTTTTATAGGTTTTCAGAGGTGAACTTTGCTGTTGTGGAGATGAGAATAGAAGAAAGGGAGAAGAAAAGACAAATGACAAC
GTTGGTTATAGAGTTATTGGCTGGTGGCTTCGATAGATAGTGTGCACCACGATCTTGGTCATGGAAGACTTCTTGAAGGAAAACTTGAAAAAAAAGAGAAAAAAAGAAAA
AAAATCGGATGAGACAATTAGTCCCTAAGTTTTAAGAATAGGTTTAATGGGTTCCTCAATATTAACAGACTGTTAAATGCAAACAGAAATCTAACTGGACATCCACATGG
CCGCTGATTAGACTTTAAAAATTGTCCAAATAGACTTTTATTTTTAAACCAAATCAAATTTTAGAAAATAAAACCTTAGAATCCATTAAACCCACCTAATTTCTTTTCTC
AAATTGCAACTTCGCAACCTTCAGCCTCTTACCCCTATAAAAATCAATCGATAATCTTCTTGGTCTTCATCACCGGTCATTTCAGCCGGCAAATTCCGCATCTTCGATGA
GTATTTGATTCACGAGGATGAAGCTTCATCTTCGTTGGTAAGTACAAATTAATCCCACTTCCTTCACGGATAGCTTTTACCTTTGGGATGATTGAGGGAACAACTTCACC
AGTGAAACTAGGGTGAAAATTAATTGTGATCTGAAACATCCTTTGGGGATGACTGAGGGTACCACTTCCTTTGGGGCTCTGATGATTCTATTTCAATCGGGAGTTCTAAT
TGAGCGGCTTTATCATCGATGAATTCCGACTACAAATCCATCATGGCAGAAAGATGGCGAATTCCGATTGAGCAGCTTAATAATGAAGTAAAAGCTACAAAGATGGTGAT
GGGAACATACTATGCGGAGACGATCATGGATAAAGATGCTCCCATTGCAGTTTGATTATTTTAATATAGTGTTGGAGAAGCAGAATTTGATTGCAATTTTAATATCTATT
TTAACAATAACAAATGTTGGTTTGGGGATGATTGTGAAATTCAGTATAAAATTAATATTGATTTTATGTT
Protein sequenceShow/hide protein sequence
MGSTKDDQFAMFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSVHFIPNYTEPINSSQCALVEMKDLKEAKSVISVISQFPFMISGMPRPVRARPAEVEMFDDRPVK
PGRKISFTWLEKDDPDFEVAKKIKLLTQKHVAERIYPISRLPFSCHVVLNQSISICLSNKSIGIIIESLFHILLNLSSRSPQFSACHHLAEEEKLAKQQQEALKGNYKKY
EIVESVMADGTARRLAKHYNMRVSDD