; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g06550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g06550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGATA transcription factor-like protein
Genome locationchr9:5106133..5108702
RNA-Seq ExpressionMoc09g06550
SyntenyMoc09g06550
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]1.9e-9375.93Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    W FSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+DDVSC+GFDGGP P E+  R   R+E+E+DEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVIGWLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSL+R TEIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

XP_022151127.1 uncharacterized protein LOC111019128 [Momordica charantia]2.3e-131100Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

XP_022953997.1 uncharacterized protein LOC111456388 [Cucurbita moschata]4.7e-9274.69Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    W FSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+ DVSC+GFDGGP P E+  R   R+E+++DEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKD+IGWLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSL+R TEIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

XP_022991237.1 uncharacterized protein LOC111487953 [Cucurbita maxima]2.5e-9377.18Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    WAFSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD AKANY  +DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+ DVSC+GFDGGP PEE+  R   R+E+EEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI WLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSLRR TEIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]2.5e-9376.35Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    W+FSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AKANY+R+DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+ DVSC+GFDGGP P E+  R   R+E+EEDEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVIGWLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSLRR  EIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein1.3e-8468.58Show/hide
Query:  NPTTSTLLSSHSEKTGIMQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHA---VDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGK-
        N  TS   ++  ++TG MQS L AIAP S WAF + Q   LRRG     T RTADPS+HA    DDNDPAV SGEPE+SQ+  EPDNAKANYDR D  K 
Subjt:  NPTTSTLLSSHSEKTGIMQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHA---VDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGK-

Query:  GAKNGPFGPTKAQYGSSPRLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITR
        G   GPFG   AQ+ SSPRLE+T VGQ SKPITQQKR H   IDDVSC+G  GGP  + K+ R T  +EEEED R+YYKHHKASPLAEIEFADTRKPITR
Subjt:  GAKNGPFGPTKAQYGSSPRLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITR

Query:  ATDGTAYDGGGKDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        ATDGTAYDG    VIGWLPEQVDT +DSLRR TEIWK+NA+RGDPDAPQSRVLRALRGE+F
Subjt:  ATDGTAYDGGGKDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

A0A5D3D3D5 Uncharacterized protein8.7e-8470.92Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDD--NDPAVPSGEPEKSQEVSEPDNAKANYD-REDSGKGAKNGPFGPTKAQYGSSP
        MQSRL AIAP S WA  + QL  LRRG     TGRTADPSVHA DD  NDP+V SGEPE+SQ+  EPDNAKANY+ R+D  +G  NGPFGP+KAQ+ SSP
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDD--NDPAVPSGEPEKSQEVSEPDNAKANYD-REDSGKGAKNGPFGPTKAQYGSSP

Query:  RLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRT---GREEEEEDE-----REYYKHHKASPLAEIEFADTRKPITRATDGTAYDGG
        RLE+T VGQ SKPITQQKR H   IDDVSC+G  GGP  E K+ R T    +EEEEE+E     R+YYKHHKASPLAEIEF DTRKPITRATDGTA  G 
Subjt:  RLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRT---GREEEEEDE-----REYYKHHKASPLAEIEFADTRKPITRATDGTAYDGG

Query:  GKDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        GK VIGWLPEQVDT +DSLRR TEIWK+NA+RGDPDAPQSRVLRALRGE F
Subjt:  GKDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

A0A6J1DCP1 uncharacterized protein LOC1110191281.1e-131100Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

A0A6J1GPT4 uncharacterized protein LOC1114563882.3e-9274.69Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    W FSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+ DVSC+GFDGGP P E+  R   R+E+++DEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKD+IGWLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSL+R TEIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

A0A6J1JL82 uncharacterized protein LOC1114879531.2e-9377.18Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        M SRLTAIA    WAFSLAQ  RLRR GL    T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD AKANY  +DS +G  NGPF P KAQY SSPRL
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRR-GLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE
        E+T V Q SKPITQQKR H TV+ DVSC+GFDGGP PEE+  R   R+E+EEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI WLPE
Subjt:  ESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPE

Query:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF
        Q DT +DSLRR TEIWK+NA+RGDPDAPQSRVLRALRGEQF
Subjt:  QVDTAEDSLRRGTEIWKRNAVRGDPDAPQSRVLRALRGEQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein7.4e-5149.8Show/hide
Query:  IMQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDN-DPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPR
        +MQSRL A A  ++         RL  G +   +GRTADP +HA +D  DPA+   +PE   +V+ P  A      +      +  P  P K+   ++ +
Subjt:  IMQSRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDN-DPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPR

Query:  LESTGVGQPSKPITQQKRPHGTV----IDDVSCVGFDGGPAP----EEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGG
        LEST VG PS+P  QQKR + T     +D VSC G DG P P    E ++ RR  RE+E E ++E+YKHHKASPL+EIEFADTRKPIT+ATDGTAY   G
Subjt:  LESTGVGQPSKPITQQKRPHGTV----IDDVSCVGFDGGPAP----EEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGG

Query:  KDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDA-PQSRVLRALRGEQF
        KDVIGWLPEQ+DTAE+SL + T I+KRNA RGDP+  P SR+LR +RGE F
Subjt:  KDVIGWLPEQVDTAEDSLRRGTEIWKRNAVRGDPDA-PQSRVLRALRGEQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGTTTGTGCACCTTTGAGGATGGAATCAAAGTTCAGTTGGAATCGGATTGCTTGGAAGCCATGAACATCATTAATAATTTCTCTGTGGAGCTAACTGAG
ACCTCTTTGTTGGCGGAAGATATTCGGAAGGTGGCGGATTCCATGCCAATTGAAAGATTTCACCACATTTTGAGGAAGGCGAACAGAGCAGCTCACAGCTTAGTG
AGTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTCGGA
CCTCGACCAGGTTCACCTCGGCCCTCATACTTAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCCATAATTCCCCAACATTCGGCGCATCTGCAACCATACCA
CATTGCCACACGACAAGACACAAAGTTTCAAAAACAAAAACTGGAATTCCCCAGAAGTTTTTAAAGGCAAAGTACACGTGGAATGCTGCTGTCTATGCCACCTGC
TCATCTGACACGTGTCATATTCCAGAAACTCCCCCTCTTTTCCTTTTTAATCCCACAACTTCAACTCTGCTCTCCAGTCACAGTGAAAAAACTGGAATTATGCAA
TCCAGATTGACGGCGATAGCGCCGAATTCGAAGTGGGCCTTCTCTCTGGCCCAATTACACCGTCTCCGGCGAGGCCTGGCCAGGCCGGAGACTGGTCGGACGGCT
GACCCCTCCGTTCATGCCGTCGACGACAACGATCCCGCCGTTCCTTCCGGTGAACCCGAAAAATCACAGGAAGTTTCAGAACCGGATAATGCCAAAGCCAATTAC
GACAGAGAGGATTCCGGGAAAGGGGCAAAAAATGGGCCGTTTGGGCCAACGAAGGCCCAATACGGTTCCTCCCCGCGTTTAGAGAGCACGGGTGTGGGCCAGCCC
TCGAAGCCCATCACCCAGCAGAAGAGGCCCCACGGGACGGTGATCGACGACGTGAGCTGCGTCGGGTTCGACGGCGGGCCGGCGCCGGAGGAGAAGGACGGCCGA
CGGACCGGCAGAGAAGAGGAGGAGGAAGACGAGAGAGAGTACTACAAGCACCACAAGGCATCGCCGTTGGCGGAGATCGAGTTCGCGGATACGCGGAAGCCGATA
ACACGAGCGACGGACGGGACGGCCTACGACGGCGGCGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGGTGGACACGGCGGAGGATTCACTCCGGCGAGGGACG
GAGATTTGGAAACGGAACGCCGTCCGCGGGGACCCCGATGCTCCCCAATCCAGGGTTCTTAGGGCTTTGCGTGGGGAACAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGTTTGTGCACCTTTGAGGATGGAATCAAAGTTCAGTTGGAATCGGATTGCTTGGAAGCCATGAACATCATTAATAATTTCTCTGTGGAGCTAACTGAG
ACCTCTTTGTTGGCGGAAGATATTCGGAAGGTGGCGGATTCCATGCCAATTGAAAGATTTCACCACATTTTGAGGAAGGCGAACAGAGCAGCTCACAGCTTAGTG
AGTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTCGGA
CCTCGACCAGGTTCACCTCGGCCCTCATACTTAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCCATAATTCCCCAACATTCGGCGCATCTGCAACCATACCA
CATTGCCACACGACAAGACACAAAGTTTCAAAAACAAAAACTGGAATTCCCCAGAAGTTTTTAAAGGCAAAGTACACGTGGAATGCTGCTGTCTATGCCACCTGC
TCATCTGACACGTGTCATATTCCAGAAACTCCCCCTCTTTTCCTTTTTAATCCCACAACTTCAACTCTGCTCTCCAGTCACAGTGAAAAAACTGGAATTATGCAA
TCCAGATTGACGGCGATAGCGCCGAATTCGAAGTGGGCCTTCTCTCTGGCCCAATTACACCGTCTCCGGCGAGGCCTGGCCAGGCCGGAGACTGGTCGGACGGCT
GACCCCTCCGTTCATGCCGTCGACGACAACGATCCCGCCGTTCCTTCCGGTGAACCCGAAAAATCACAGGAAGTTTCAGAACCGGATAATGCCAAAGCCAATTAC
GACAGAGAGGATTCCGGGAAAGGGGCAAAAAATGGGCCGTTTGGGCCAACGAAGGCCCAATACGGTTCCTCCCCGCGTTTAGAGAGCACGGGTGTGGGCCAGCCC
TCGAAGCCCATCACCCAGCAGAAGAGGCCCCACGGGACGGTGATCGACGACGTGAGCTGCGTCGGGTTCGACGGCGGGCCGGCGCCGGAGGAGAAGGACGGCCGA
CGGACCGGCAGAGAAGAGGAGGAGGAAGACGAGAGAGAGTACTACAAGCACCACAAGGCATCGCCGTTGGCGGAGATCGAGTTCGCGGATACGCGGAAGCCGATA
ACACGAGCGACGGACGGGACGGCCTACGACGGCGGCGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGGTGGACACGGCGGAGGATTCACTCCGGCGAGGGACG
GAGATTTGGAAACGGAACGCCGTCCGCGGGGACCCCGATGCTCCCCAATCCAGGGTTCTTAGGGCTTTGCGTGGGGAACAGTTTTAA
Protein sequenceShow/hide protein sequence
MKRLCTFEDGIKVQLESDCLEAMNIINNFSVELTETSLLAEDIRKVADSMPIERFHHILRKANRAAHSLVSVYARICTTVYFRLQLELGLRTDLNTWADLHKKVG
PRPGSPRPSYLASVNASGGDLGHNSPTFGASATIPHCHTTRHKVSKTKTGIPQKFLKAKYTWNAAVYATCSSDTCHIPETPPLFLFNPTTSTLLSSHSEKTGIMQ
SRLTAIAPNSKWAFSLAQLHRLRRGLARPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLESTGVGQP
SKPITQQKRPHGTVIDDVSCVGFDGGPAPEEKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQVDTAEDSLRRGT
EIWKRNAVRGDPDAPQSRVLRALRGEQF