; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:13326119..13329031
RNA-Seq ExpressionMoc08g17610
SyntenyMoc08g17610
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]1.2e-6956.49Show/hide
Query:  LNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSA
        +N + LA D  + IR YA P F + N  I  P I+A +FELKP MFQMLQTV  F G+ ++DPH HLR F++V++SFKI+GV++E + LKLFP+SLRD A
Subjt:  LNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSA

Query:  GAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLS
         +WL++L   S+T+ NDLAEKFL +YFPP++NA+ RS+I +FQQL  ES S++WE+FK LL+KCPHHGIP CIQ+ET+ NGLN A+++V+DASAN A+LS
Subjt:  GAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLS

Query:  KLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK
        K Y EAF+ILE I+ N +QWS ++A    TS+  AG  +
Subjt:  KLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK

XP_017216983.1 PREDICTED: uncharacterized protein LOC108194534 [Daucus carota subsp. sativus]8.6e-7148.84Show/hide
Query:  MNDFPTLEFEFNPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEAN
        M++   + F F+P IERTF RRR  QR  ++ ++ + +  +  ++              L  P  A F+  D ++ IR YA P F + N  I  P I+A 
Subjt:  MNDFPTLEFEFNPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEAN

Query:  RFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRS
        +FELKP MFQMLQT+  F G+ ++DPH HLR FM++++SFK +GVT++A+ LKLFPY +RD A  WL+SL A S+T+ NDL EKFL +YFPP+ NA+LR+
Subjt:  RFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRS

Query:  KINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGA
        +IN+FQQ   ESL ++WE+FK L++KCPHHGI  CIQ+ET+ NGLN  T++V+DASAN ALLSK Y +A++ILE I+ N +QWS S+A    T K  AG 
Subjt:  KINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGA

Query:  F
        +
Subjt:  F

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.0e-7146.61Show/hide
Query:  FEFNPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAM
        F F+P+IERTF RRR  QR  ++ ++ MD+        V      +V R A        F+  D ++ IR YA P F + N  I  P I+A +FELKP M
Subjt:  FEFNPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAM

Query:  FQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQL
        FQMLQT+  F G+ ++DPH HLR FM++++SFK +GV ++A+ LKLFPYS+RD A  WL+SL A S+T+ NDL EKFL +YFPP+ NA+LR++IN+FQQ 
Subjt:  FQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQL

Query:  HGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFKAKNQQT
          ESL ++WE+FK LL+KCPHHGI  CIQ+ET+ NGLN  T++V+DASAN ALLSK Y +A++ILE I+   +QWS S+A    T K  AG +   +  +
Subjt:  HGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFKAKNQQT

Query:  APKHPETMTLEDMFKAYMVKHESYMAKNDTIVQSLAASL
              +M        +M+K+ S M  N +  QSL++ +
Subjt:  APKHPETMTLEDMFKAYMVKHESYMAKNDTIVQSLAASL

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]2.5e-7060.27Show/hide
Query:  LAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLD
        +A D +Q IR YA P F + N  I  P I+A +FELKP MFQMLQTV  F G+ ++DPH HLR FM+V++SFK+ GVT++A+ LKLFPYSLRD A AWL+
Subjt:  LAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLD

Query:  SLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAE
        SL + S+T+  +LAE+FLM+YFPP+KNA+LR +I +FQQ   ESL E+WE+FK LL+KCPHHGIP CIQ+ET+ NGLN  T++V+DASAN ALL+K Y E
Subjt:  SLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAE

Query:  AFDILERISRNKHQWSKSK
        A+DI+ERIS N +QW  ++
Subjt:  AFDILERISRNKHQWSKSK

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]1.1e-7050Show/hide
Query:  NPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQM
        +P+IERTFR+RR  Q+ ++R  +             +G +  V G H  A   N + LA D  + IR YA P F + N  I  P I+A  FELKP MFQM
Subjt:  NPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQM

Query:  LQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGE
        LQTV  F G  ++DPH H+R F++V++SFK++GV++EA+ LKLFP+SLRD A AWL++L   S+T+ NDLAEKFL +YFPP++NA+ RS+I +FQQL  E
Subjt:  LQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGE

Query:  SLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK--AKNQQTA
        + S++WE+FK LL+KCPHHGIP CIQ+ET+ NGLN A ++V+DA AN A+LSK Y EAF+ILERI+ N +QWS ++A    TS+  AG  +  A    TA
Subjt:  SLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK--AKNQQTA

Query:  PKHPETMTLEDM
             T  L++M
Subjt:  PKHPETMTLEDM

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333941.7e-6452.23Show/hide
Query:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG
        NA+ LA D E+ IRAYA P   + N  I  P ++A  FELKP MFQMLQT+  FHGL S+DPH HL+ F+ V++SF+ + V K+ + L LFPYSLRD A 
Subjt:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG

Query:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK
        +WL++L   +I S N L EKFL++YFPP++NA  R++I  FQQ   ++LSE+WE+FK +L+KCPHHG+P CIQ+ET+ NGLN AT+ V+DASAN A+LSK
Subjt:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK

Query:  LYAEAFDILERISRNKHQWS--KSKAASTTTSKSDAGAFKAKNQQTA
         Y EA++ILERI+ N  QW+  +S     T    +  A  + N Q A
Subjt:  LYAEAFDILERISRNKHQWS--KSKAASTTTSKSDAGAFKAKNQQTA

A0A6J1EQ90 uncharacterized protein LOC1114364112.1e-6244.44Show/hide
Query:  MNDFPTLEFEFNPKIERTFRRR------RANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIAD
        MN    LEF  +P+IERTFRRR         Q +Q+ E     N +      +   +R+           N + LA D E+ IRAYA P   + N  I  
Subjt:  MNDFPTLEFEFNPKIERTFRRR------RANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIAD

Query:  PIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQV-------ANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLM
        P I+   FELKP MFQMLQT+  FHGL  +DPH HL+ F+ V       ++SF+ +GV K+ + L LFPY LRD A +WL++L   +I S N LAE FL+
Subjt:  PIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQV-------ANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLM

Query:  QYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWS--K
        +YFPP++NA  +++I  FQQ   E+LSE+ E+FK +L+KCPHHG+P CIQ+ET+ NGLN  T+ V+DASAN A+LSK Y EA++ILERI+ N  QW+  +
Subjt:  QYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWS--K

Query:  SKAASTTTSKSDAGAFKAKNQQTA
        S     T    +  A  + N Q A
Subjt:  SKAASTTTSKSDAGAFKAKNQQTA

A0A6J1G7Q6 uncharacterized protein LOC1114515987.1e-6351.82Show/hide
Query:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG
        NA+ +A D E+ IRAYA P   + N  I  P ++A  FELKP MFQMLQT+  FHGL+SKDPH HL+ F+ V++SF+ +GV K+ + L  F YSLRD A 
Subjt:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG

Query:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK
        +WL+ L    I S N LAEKFL +YFPP+++A  R++I  FQ+   E+LSE+WE+FK  L+KCPHHG+P CIQIET+ NGLN AT+ V+DASAN  +LSK
Subjt:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK

Query:  LYAEAFDILERISRNKHQW--SKSKAASTTTSKSDAGAFKAKNQQTA
         Y EA++ILERI+ N  QW   +S     T    +  A  + N Q A
Subjt:  LYAEAFDILERISRNKHQW--SKSKAASTTTSKSDAGAFKAKNQQTA

A0A6J1H7E4 uncharacterized protein LOC1114611684.0e-6653.44Show/hide
Query:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG
        NA+ LA D E+ IRAYA P   + N  I  P ++A  FELKP MFQMLQT+  FHGL S+DPH HL+ F+ V++SF+ +GV K+ + L LFPYSLRD A 
Subjt:  NAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAG

Query:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK
        +WL++L   +I S N LAEKFL++YFPP++NA  R++I  FQQ   E+LSE+WE+FK +L+KCPHHG+P CIQ+ET+ NGLN AT+ V+DASAN A+LSK
Subjt:  AWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSK

Query:  LYAEAFDILERISRNKHQWS--KSKAASTTTSKSDAGAFKAKNQQTA
         Y EA++ILERI+ N  QW+  +S     T    +  A  + N Q A
Subjt:  LYAEAFDILERISRNKHQWS--KSKAASTTTSKSDAGAFKAKNQQTA

U5CUI2 Retrotrans_gag domain-containing protein6.0e-7056.49Show/hide
Query:  LNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSA
        +N + LA D  + IR YA P F + N  I  P I+A +FELKP MFQMLQTV  F G+ ++DPH HLR F++V++SFKI+GV++E + LKLFP+SLRD A
Subjt:  LNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELKPAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSA

Query:  GAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLS
         +WL++L   S+T+ NDLAEKFL +YFPP++NA+ RS+I +FQQL  ES S++WE+FK LL+KCPHHGIP CIQ+ET+ NGLN A+++V+DASAN A+LS
Subjt:  GAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHGESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLS

Query:  KLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK
        K Y EAF+ILE I+ N +QWS ++A    TS+  AG  +
Subjt:  KLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTTTCCTACTTTGGAATTTGAGTTTAACCCAAAAATTGAACGAACTTTTAGAAGGAGAAGGGCCAATCAAAGAGTACAACGTAGAGAAGAAATTAAA
ATGGATAACCCACAACATCCACCAGAGTTGAGAGTAGAGGGAGTACAAAGGGTTGTAGTTGGAAGACATGCTCTAGCACCTCCTCTGAATGCTGTTTTTCTCGCA
TATGATAGTGAGCAGGAAATTAGAGCTTATGCGACCCCAACATTCTATGACTTCAATCATATGATTGCAGATCCTATTATTGAAGCCAATAGATTTGAGCTGAAG
CCCGCAATGTTTCAGATGCTCCAGACCGTATGGTATTTTCATGGGCTCGCATCAAAAGATCCCCATCGTCACCTACGATATTTCATGCAAGTGGCAAACTCATTT
AAGATAGAAGGAGTAACTAAGGAGGCTATGTGGCTGAAACTATTTCCTTATTCCTTGAGGGACAGTGCTGGAGCATGGTTGGATTCTTTACTTGCAGTGTCAATC
ACTTCATTGAATGATTTAGCAGAGAAGTTCCTAATGCAATACTTCCCACCATCGAAAAATGCGGAGCTTAGGAGCAAGATAAACAATTTTCAACAACTCCATGGA
GAGTCCTTGAGCGAGTCATGGGAGAAATTTAAAGGACTGCTCCAAAAGTGCCCTCATCATGGAATCCCACGGTGCATTCAAATAGAGACGTATGATAATGGTTTG
AATGAGGCCACGCAGTTGGTGATAGATGCTTCAGCGAATGAAGCATTATTATCAAAGTTGTATGCTGAAGCCTTCGATATTTTGGAAAGAATCTCGCGCAATAAG
CACCAATGGTCAAAGTCCAAGGCTGCTTCAACGACTACATCTAAGTCAGATGCTGGAGCATTTAAAGCTAAGAATCAACAGACCGCTCCAAAGCATCCAGAGACT
ATGACCCTAGAGGATATGTTTAAGGCATACATGGTAAAGCACGAGTCTTATATGGCGAAGAATGATACTATTGTTCAAAGTCTAGCTGCATCCTTGGGAAATATA
GAGGGGCGAATAGATGAAACATTAATAGAGACACTGAGCTCAGAAGCTATGTTTAAGCATCCGGGTGGTAGTGGATTAGCTATACTGGAGCATACTGTAAGCAGG
GATGCTGGAGCTCAAGAAGGTTTTACTGGTGGAGATAAAGTGCACATCATTGTAGAAGAGCCACCCGACATCAAGCCTACTCTAAAAGGTCGTGCATCAAAGACG
AGAAAATCGAAAAGGGAGTTAGAATCATCATCCAACAAAGGAAAAAATGCAAAACCACAAGTCACCACTTCCAAGAAGAACAAAAAGGTAGAAAAGGGAACTTCT
GTTTCCTCAAATGAGGGGAAGAGATATGCTACATCTATTTTGCAGGAGCCTCCCTCAATGGCAAAGAGAGGGAAAATCCAAAGACGAAGATCAGCCTATGGGCGA
CTCGTTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGATTTTCCTACTTTGGAATTTGAGTTTAACCCAAAAATTGAACGAACTTTTAGAAGGAGAAGGGCCAATCAAAGAGTACAACGTAGAGAAGAAATTAAA
ATGGATAACCCACAACATCCACCAGAGTTGAGAGTAGAGGGAGTACAAAGGGTTGTAGTTGGAAGACATGCTCTAGCACCTCCTCTGAATGCTGTTTTTCTCGCA
TATGATAGTGAGCAGGAAATTAGAGCTTATGCGACCCCAACATTCTATGACTTCAATCATATGATTGCAGATCCTATTATTGAAGCCAATAGATTTGAGCTGAAG
CCCGCAATGTTTCAGATGCTCCAGACCGTATGGTATTTTCATGGGCTCGCATCAAAAGATCCCCATCGTCACCTACGATATTTCATGCAAGTGGCAAACTCATTT
AAGATAGAAGGAGTAACTAAGGAGGCTATGTGGCTGAAACTATTTCCTTATTCCTTGAGGGACAGTGCTGGAGCATGGTTGGATTCTTTACTTGCAGTGTCAATC
ACTTCATTGAATGATTTAGCAGAGAAGTTCCTAATGCAATACTTCCCACCATCGAAAAATGCGGAGCTTAGGAGCAAGATAAACAATTTTCAACAACTCCATGGA
GAGTCCTTGAGCGAGTCATGGGAGAAATTTAAAGGACTGCTCCAAAAGTGCCCTCATCATGGAATCCCACGGTGCATTCAAATAGAGACGTATGATAATGGTTTG
AATGAGGCCACGCAGTTGGTGATAGATGCTTCAGCGAATGAAGCATTATTATCAAAGTTGTATGCTGAAGCCTTCGATATTTTGGAAAGAATCTCGCGCAATAAG
CACCAATGGTCAAAGTCCAAGGCTGCTTCAACGACTACATCTAAGTCAGATGCTGGAGCATTTAAAGCTAAGAATCAACAGACCGCTCCAAAGCATCCAGAGACT
ATGACCCTAGAGGATATGTTTAAGGCATACATGGTAAAGCACGAGTCTTATATGGCGAAGAATGATACTATTGTTCAAAGTCTAGCTGCATCCTTGGGAAATATA
GAGGGGCGAATAGATGAAACATTAATAGAGACACTGAGCTCAGAAGCTATGTTTAAGCATCCGGGTGGTAGTGGATTAGCTATACTGGAGCATACTGTAAGCAGG
GATGCTGGAGCTCAAGAAGGTTTTACTGGTGGAGATAAAGTGCACATCATTGTAGAAGAGCCACCCGACATCAAGCCTACTCTAAAAGGTCGTGCATCAAAGACG
AGAAAATCGAAAAGGGAGTTAGAATCATCATCCAACAAAGGAAAAAATGCAAAACCACAAGTCACCACTTCCAAGAAGAACAAAAAGGTAGAAAAGGGAACTTCT
GTTTCCTCAAATGAGGGGAAGAGATATGCTACATCTATTTTGCAGGAGCCTCCCTCAATGGCAAAGAGAGGGAAAATCCAAAGACGAAGATCAGCCTATGGGCGA
CTCGTTGGATGA
Protein sequenceShow/hide protein sequence
MNDFPTLEFEFNPKIERTFRRRRANQRVQRREEIKMDNPQHPPELRVEGVQRVVVGRHALAPPLNAVFLAYDSEQEIRAYATPTFYDFNHMIADPIIEANRFELK
PAMFQMLQTVWYFHGLASKDPHRHLRYFMQVANSFKIEGVTKEAMWLKLFPYSLRDSAGAWLDSLLAVSITSLNDLAEKFLMQYFPPSKNAELRSKINNFQQLHG
ESLSESWEKFKGLLQKCPHHGIPRCIQIETYDNGLNEATQLVIDASANEALLSKLYAEAFDILERISRNKHQWSKSKAASTTTSKSDAGAFKAKNQQTAPKHPET
MTLEDMFKAYMVKHESYMAKNDTIVQSLAASLGNIEGRIDETLIETLSSEAMFKHPGGSGLAILEHTVSRDAGAQEGFTGGDKVHIIVEEPPDIKPTLKGRASKT
RKSKRELESSSNKGKNAKPQVTTSKKNKKVEKGTSVSSNEGKRYATSILQEPPSMAKRGKIQRRRSAYGRLVG