; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0382 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0382
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNAC domain-containing protein
Genome locationMC06:3052551..3054448
RNA-Seq ExpressionMC06g0382
SyntenyMC06g0382
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134590.1 uncharacterized protein LOC111006819 [Momordica charantia]4.43e-172100Show/hide
Query:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV

Query:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
        LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
Subjt:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD

Query:  SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
        SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
Subjt:  SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

XP_022933859.1 uncharacterized protein LOC111441147 [Cucurbita moschata]4.14e-13884.43Show/hide
Query:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL
        MKMKRKDLDQ NDEFSDFSLSSPA KIRRLDVGLPPIIEEE PPEI+ L+E+ +MP N+VA   GLRIEELPDASSV+A SACAMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL

Query:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS
        FKPVNT+FLQSSPLSVSVDSDIISGF+S+ILRENR   RLK GEDDDEMGTEN+NLAVVPWVPR+QVPA TNM V QEEAPQ+MEAEE+  ATM+IEDD+
Subjt:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS

Query:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
         SSQQ PYGYGGMDGANAIHQWHQQHCMIPQLPQQTS+PITWFR
Subjt:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

XP_022968758.1 uncharacterized protein LOC111467899 [Cucurbita maxima]3.78e-13482.38Show/hide
Query:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL
        MKMKRKDLDQ NDEFSDFSLSSPA KIRRLDVGLPPIIEEE PPEIA L+E+ ++P N+VA   G+RIEELPDASSV+A SACAMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL

Query:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS
        FKPVNT+FLQS PLSVSVDSDIISGF+S+ILRENR   RLK GEDDDEMGTEN+NLAVVPWVPR+QVPA TNM V QEEAPQ+MEAEE+  ATM+IEDD+
Subjt:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS

Query:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
         SSQQ PYGYGGMDGANAIHQW Q  CMIPQLPQQTS+PITWFR
Subjt:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

XP_023531776.1 uncharacterized protein LOC111793929 [Cucurbita pepo subsp. pepo]1.96e-13684.02Show/hide
Query:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL
        MKMKRKDLDQ NDEFSDFSLSSPA KIRRLDVGLPPIIEEE PPEIA L+E+ +MP N+VA   GLRIEELPDASSV+A SA AMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL

Query:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS
        FKPVNT+FLQSSPLSVSVDSDIISGF+S+ILRENR   RLK GE+DDEMGTEN+NLAVVPWVPR+QVPA TNM V QEEAPQ+MEAEE+  ATM+IEDD+
Subjt:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS

Query:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
         SSQQ PYGYGGMDGANAIHQWHQQHCMIPQLPQQTS+PITWFR
Subjt:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

XP_038880770.1 uncharacterized protein LOC120072359 [Benincasa hispida]7.16e-14284.55Show/hide
Query:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLS++PL+PENFV     LRIEELP+ SSVSASA AMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV

Query:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
        LFKPVN+ FLQSSPLSVSVDSDIISGF+SE LRENR   R++  EDD+EMG EN+NLAVVPWVPR+QVPAPT+MDVPQEEAPQ+MEAEE+  ATMEIE+D
Subjt:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD

Query:  SFSS-QQPYGY-GGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
        + SS QQ YGY GGMDGANAIHQWHQQHCMIPQLPQQTS+PITWFR
Subjt:  SFSS-QQPYGY-GGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

TrEMBL top hitse value%identityAlignment
A0A1S3BX94 uncharacterized protein LOC1034942112.54e-13379.44Show/hide
Query:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
        MKMKMKRKDLDQIND+FSDFSLSSPARKIRRLDVGLPPIIEEEEPPEI+VLS+QPL+PE+F     G+RIEEL DASSVS S  AMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV

Query:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
        LFKP+NT+F QSSPLSVSVDSDIISGF+SE LREN    R+K GEDD++M TEN+NLAVVPWVPR+QVP  + M+VPQEEAPQ+MEAEE+  ATMEIE++
Subjt:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD

Query:  SFS---SQQPYGYGGMDGANAIHQWH-QQHCMIPQLPQQTSAPITWFR
          +   SQQ YGYGGMDGANAIHQWH QQHCMIPQLPQQTS+PITWFR
Subjt:  SFS---SQQPYGYGGMDGANAIHQWH-QQHCMIPQLPQQTSAPITWFR

A0A6J1BYN8 uncharacterized protein LOC1110068192.14e-172100Show/hide
Query:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
        MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
Subjt:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV

Query:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
        LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD
Subjt:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDD

Query:  SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
        SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
Subjt:  SFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

A0A6J1E5E7 uncharacterized protein LOC1114296501.87e-13280Show/hide
Query:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV
        MKMKMKRKDLDQ NDEFS+FSLSSPARKIRRLDVGLPPIIEEEEPPEIAVL  +PLMPENFV    GLRIEELPDASSVSASACA  DRPF DN ERAIV
Subjt:  MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIV

Query:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEG-ATMEIED
        LF   NT FLQSSP SVSVD DIISG +S+I+REN    RLK GE+D+EMGTEN+NLAVVPWVPR+QVPA T MDV QEEAPQ+MEAEE  G ATME+ED
Subjt:  LFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEG-ATMEIED

Query:  DSFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
        ++ SSQ+ YGYGGMDGA+ IHQWHQQHCMI QLPQQTS+PITWFR
Subjt:  DSFSSQQPYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

A0A6J1F0X3 uncharacterized protein LOC1114411472.00e-13884.43Show/hide
Query:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL
        MKMKRKDLDQ NDEFSDFSLSSPA KIRRLDVGLPPIIEEE PPEI+ L+E+ +MP N+VA   GLRIEELPDASSV+A SACAMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL

Query:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS
        FKPVNT+FLQSSPLSVSVDSDIISGF+S+ILRENR   RLK GEDDDEMGTEN+NLAVVPWVPR+QVPA TNM V QEEAPQ+MEAEE+  ATM+IEDD+
Subjt:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS

Query:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
         SSQQ PYGYGGMDGANAIHQWHQQHCMIPQLPQQTS+PITWFR
Subjt:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

A0A6J1HZ13 uncharacterized protein LOC1114678991.83e-13482.38Show/hide
Query:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL
        MKMKRKDLDQ NDEFSDFSLSSPA KIRRLDVGLPPIIEEE PPEIA L+E+ ++P N+VA   G+RIEELPDASSV+A SACAMEDRP CDNQERAIVL
Subjt:  MKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSA-SACAMEDRPFCDNQERAIVL

Query:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS
        FKPVNT+FLQS PLSVSVDSDIISGF+S+ILRENR   RLK GEDDDEMGTEN+NLAVVPWVPR+QVPA TNM V QEEAPQ+MEAEE+  ATM+IEDD+
Subjt:  FKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDS

Query:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR
         SSQQ PYGYGGMDGANAIHQW Q  CMIPQLPQQTS+PITWFR
Subjt:  FSSQQ-PYGYGGMDGANAIHQWHQQHCMIPQLPQQTSAPITWFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G35320.1 unknown protein4.8e-2837.64Show/hide
Query:  KMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEE---PPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERA
        ++ MKRKD+D++ND+FSDFSLSSPARKIRRLDV LPPI+EEEE   P +  V  E  L P                                   N ERA
Subjt:  KMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEE---PPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERA

Query:  IVLFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTE--NRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEE------I
        IVLFKP++  + Q S  +V VD  +ISGF++  LR+   A       DD++   E  N+  AVV W P     + +     Q    ++ E +E      +
Subjt:  IVLFKPVNTTFLQSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTE--NRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEE------I

Query:  EGATMEIEDDSFSSQQPYGYGGMDGAN------AIHQWHQ-QHCMIPQLPQ--QTSAPITWFR
        + A+ EIE+D+ S+   +   G            +H W Q Q+CMIPQLPQ   T  PITWFR
Subjt:  EGATMEIEDDSFSSQQPYGYGGMDGAN------AIHQWHQ-QHCMIPQLPQ--QTSAPITWFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGAAGATGAAGAGGAAAGATCTCGATCAAATTAACGATGAGTTCTCCGATTTCTCTCTCTCCTCGCCTGCCAGGAAGATTCGCCGTTTGGATGTCGGTTTGCC
GCCTATTATTGAAGAAGAAGAACCGCCGGAAATTGCTGTTTTAAGCGAGCAGCCATTGATGCCTGAGAATTTTGTTGCAAGTCGGGGCGGTCTGAGGATTGAGGAATTGC
CAGATGCTTCCTCTGTCTCTGCTTCAGCTTGTGCTATGGAGGATCGTCCTTTCTGCGACAACCAAGAGAGGGCTATTGTTCTCTTTAAGCCTGTGAACACGACTTTCTTG
CAGTCATCTCCTCTTTCGGTCTCTGTTGATTCGGACATTATATCTGGCTTCAGAAGCGAAATTCTCCGTGAAAACCGTAATGCTAGCAGGCTGAAACTGGGTGAAGACGA
CGACGAAATGGGGACAGAGAACAGGAATTTAGCTGTTGTTCCATGGGTTCCACGGATGCAGGTTCCTGCTCCTACAAACATGGATGTGCCCCAAGAGGAAGCTCCACAGA
TGATGGAAGCTGAGGAGATTGAAGGGGCAACAATGGAGATTGAAGATGACAGCTTCAGCAGTCAACAACCTTATGGCTATGGTGGAATGGATGGCGCTAATGCTATACAT
CAATGGCACCAACAGCACTGCATGATCCCACAGCTGCCCCAACAAACGTCCGCACCCATAACATGGTTCCGTTGA
mRNA sequenceShow/hide mRNA sequence
GTCAGACTCAGGAAGGAAGCGTGAGAGTATCTGATAATCCCTCTGCCTCCAATTGTAAGACGCCACAAACAAATTCTGATTAACCGCTATTATAGCAGGTATATTATATA
TATAAGTTTATGTAACGTAAAACTATATGAATATTAAAAATTAAAGAAAAAGAATTTTTATTTATTATTTATTATTATTCTTTTAGAAGAATTGGGCTTCTTCTTCGTCT
TCTGTATCAGTGGGGCTTCCAGAGAGGTCTAGAGAAAGCCAGAAACAGAAACCCTTTTTCCTTCTCTCTCATTGGGCTTCGTTCTAACAATTTCTCCTGTTTTTTCCTAC
TCTTGTTTATATAAAGATACGATTCCATTTGTTCTTCGCTGAATTTGGTCGGAGGGGAATCAGATTTTTTCAACCACTACAGATTTGGTGGAGGAGTTTCTGTTTGATCG
GACATGAAGATGAAGATGAAGAGGAAAGATCTCGATCAAATTAACGATGAGTTCTCCGATTTCTCTCTCTCCTCGCCTGCCAGGAAGATTCGCCGTTTGGATGTCGGTTT
GCCGCCTATTATTGAAGAAGAAGAACCGCCGGAAATTGCTGTTTTAAGCGAGCAGCCATTGATGCCTGAGAATTTTGTTGCAAGTCGGGGCGGTCTGAGGATTGAGGAAT
TGCCAGATGCTTCCTCTGTCTCTGCTTCAGCTTGTGCTATGGAGGATCGTCCTTTCTGCGACAACCAAGAGAGGGCTATTGTTCTCTTTAAGCCTGTGAACACGACTTTC
TTGCAGTCATCTCCTCTTTCGGTCTCTGTTGATTCGGACATTATATCTGGCTTCAGAAGCGAAATTCTCCGTGAAAACCGTAATGCTAGCAGGCTGAAACTGGGTGAAGA
CGACGACGAAATGGGGACAGAGAACAGGAATTTAGCTGTTGTTCCATGGGTTCCACGGATGCAGGTTCCTGCTCCTACAAACATGGATGTGCCCCAAGAGGAAGCTCCAC
AGATGATGGAAGCTGAGGAGATTGAAGGGGCAACAATGGAGATTGAAGATGACAGCTTCAGCAGTCAACAACCTTATGGCTATGGTGGAATGGATGGCGCTAATGCTATA
CATCAATGGCACCAACAGCACTGCATGATCCCACAGCTGCCCCAACAAACGTCCGCACCCATAACATGGTTCCGTTGATGGTATGTACATGGTGAAGAATGCCATCTTTT
GATCTCATGGGGGGCGCTAGGAAATAGGCTAATTTGCAGCAAAATTGACTTCTAAACATCATATGGGTATAAATATAATGCCTTTTTGTTCCTCTTGCTCTTGTGTTCAT
CATCATGTGGCAGCTTTTCTTATCTGAGGAGTTGCTGTGTTGGAGAATTGTGAGCTCTTGTACAAATCTTTCTCCCCTTCAAAACTTCCCAGCTCAGCTGGATTCTAATG
AATCAGTATTTTCCACTGGCAAATGGCATTATGATATCTTGTGTGAGAAAAGTTCCAAATTTTGATGAGAG
Protein sequenceShow/hide protein sequence
MKMKMKRKDLDQINDEFSDFSLSSPARKIRRLDVGLPPIIEEEEPPEIAVLSEQPLMPENFVASRGGLRIEELPDASSVSASACAMEDRPFCDNQERAIVLFKPVNTTFL
QSSPLSVSVDSDIISGFRSEILRENRNASRLKLGEDDDEMGTENRNLAVVPWVPRMQVPAPTNMDVPQEEAPQMMEAEEIEGATMEIEDDSFSSQQPYGYGGMDGANAIH
QWHQQHCMIPQLPQQTSAPITWFR