; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:13948998..13950497
RNA-Seq ExpressionMoc03g20530
SyntenyMoc03g20530
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046553.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-6745.48Show/hide
Query:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTE---------NMDNSSSRRSTNKIEELTKEVN-TFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSR
        D+ PF++ ESHF D KFY+K+++V E + TE         N     + + TNK + L  + N          +  + K    +KEV  +  +LRYI LSR
Subjt:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTE---------NMDNSSSRRSTNKIEELTKEVN-TFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSR

Query:  RKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKL
        RKK ESPF E SKNL   + E+LK+NFT PLTKI K E KK +   ++  LPE+ T +G DPKAYKL+AKAGYDFT+ TE  S++IFD++ ELS T+KKL
Subjt:  RKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKL

Query:  LKEGYNIPASKAGL---------------------------------------------------------DDISEEDVEEAPSSLEDGSQSTVDELKEV
         K+GY+IP S+AG+                                                          DI EED E AP  LEDG QST+DELKEV
Subjt:  LKEGYNIPASKAGL---------------------------------------------------------DDISEEDVEEAPSSLEDGSQSTVDELKEV

Query:  NLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        NL+T EEPRPTFIS+ LS + ENEY++LL +Y+D+FAWSYKE+  LDPKV VH LAIK  +RPVK
Subjt:  NLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-5738.75Show/hide
Query:  FSEVESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE
        F++      D KFY KND   E     +P  N +++   +S     E  K   TF +  +EASTS++K+     E   +  ILRY+ LSRRKK ESPF+E
Subjt:  FSEVESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE

Query:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPAS
        + + LK  D+E+LK++FT PLTKI+KQEIK   DL ++ +LP++ TKDG DPKAYK +AKAGYDFT+HTEF SL+I +  + LS+T+KKLL+EG+ IP S
Subjt:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPAS

Query:  KAGLD-----------------------------------------------------------------------------------------------
        + GL                                                                                                
Subjt:  KAGLD-----------------------------------------------------------------------------------------------

Query:  --------------------------------------DISEEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRD
                                              +I EED E+AP SLEDG QS VD+LKEVNL T EEP PTFIS+SLS + E +YMSLLT Y+D
Subjt:  --------------------------------------DISEEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRD

Query:  IFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        IFAWSYKEM   DPKV VHHLAIK GYRP+K
Subjt:  IFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

TYJ99655.1 uncharacterized protein E5676_scaffold562G00360 [Cucumis melo var. makuwa]9.0e-5848.17Show/hide
Query:  ESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKN
        ESHF + KFY KND   E     +P  N +++   +S     E  K   TF +   E STS++K+     E   +  IL Y+ +SRRKK ESPF+E  K 
Subjt:  ESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKN

Query:  LKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL
        LK  D+E+LK++FT PLTKI+KQ+IK   DL  + +LP+  TKDG + KAYKL+AK GYDF +HTEF SL++  ++ ELS+ +KKLL+EG+ IP S+ GL
Subjt:  LKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL

Query:  D-------------------------------DIS----------EEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLT
        +                               DI+          EED E+ P SLEDG QSTVDELKEVNL T EEPRPTFIS+SL    E +YMSLLT
Subjt:  D-------------------------------DIS----------EEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLT

Query:  SYRDIFAWSYKEMLRLDPKVVVHHLAIK
         YRDIFA SYKEM  L PKV VHHL IK
Subjt:  SYRDIFAWSYKEMLRLDPKVVVHHLAIK

TYK06279.1 uncharacterized protein E5676_scaffold157G00630 [Cucumis melo var. makuwa]1.9e-6352.26Show/hide
Query:  MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQ
        M T    N  +  ++ +  E T E+          +  + K    +KEV  + T+LRYI LSRR+K ESPF E SKNL   + E+LK+NFT PLTKI K 
Subjt:  MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQ

Query:  EIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL----------------DDISEED
        E KK +   ++  LPE+ T +G DPKAYKL+AKAGYDFT+ TE  S++IFD++ ELS T+KKL K+GY+IP S+A +                 DI EED
Subjt:  EIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL----------------DDISEED

Query:  VEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
         + AP SLEDG QST+DELKEVNL T EEPRPTFIS+ LS + ENEY++LL +Y+D+FAWSYKEM  LDPKVVVH LAIK  + PVK
Subjt:  VEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

TYK16284.1 uncharacterized protein E5676_scaffold21G00550 [Cucumis melo var. makuwa]2.6e-5746.85Show/hide
Query:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE
        D+NPFSE ESHF D KFY KND   E +P                          E    EASTS++K+     E   +  ILRYI LS  KK ESPF+E
Subjt:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE

Query:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTE-FTSLRIFDDKRELSATRKKLL--------
          + LK  D+++LK++FT  LTKI+KQEIK   DL ++  LP++ TKDG DP AYKL+ KAGYDFT+HTE     R+ +  R +   ++K++        
Subjt:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTE-FTSLRIFDDKRELSATRKKLL--------

Query:  -------KEGYNI------------PASKAGL-DDISEEDVE-----EAP-SSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSY
               KEG ++            P S+  +   I   DVE     E P  SL+DG QST+DELKE+NL T EEPRPTFIS+SLS + E++YMSLLT Y
Subjt:  -------KEGYNI------------PASKAGL-DDISEEDVE-----EAP-SSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSY

Query:  RDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        +DIFAWSYKEM  LDPKV +HHLAIK GYRP+K
Subjt:  RDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

TrEMBL top hitse value%identityAlignment
A0A5A7TSN4 Ty3-gypsy retrotransposon protein1.3e-6745.48Show/hide
Query:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTE---------NMDNSSSRRSTNKIEELTKEVN-TFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSR
        D+ PF++ ESHF D KFY+K+++V E + TE         N     + + TNK + L  + N          +  + K    +KEV  +  +LRYI LSR
Subjt:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTE---------NMDNSSSRRSTNKIEELTKEVN-TFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSR

Query:  RKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKL
        RKK ESPF E SKNL   + E+LK+NFT PLTKI K E KK +   ++  LPE+ T +G DPKAYKL+AKAGYDFT+ TE  S++IFD++ ELS T+KKL
Subjt:  RKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKL

Query:  LKEGYNIPASKAGL---------------------------------------------------------DDISEEDVEEAPSSLEDGSQSTVDELKEV
         K+GY+IP S+AG+                                                          DI EED E AP  LEDG QST+DELKEV
Subjt:  LKEGYNIPASKAGL---------------------------------------------------------DDISEEDVEEAPSSLEDGSQSTVDELKEV

Query:  NLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        NL+T EEPRPTFIS+ LS + ENEY++LL +Y+D+FAWSYKE+  LDPKV VH LAIK  +RPVK
Subjt:  NLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

A0A5D3BMW4 Uncharacterized protein4.3e-5848.17Show/hide
Query:  ESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKN
        ESHF + KFY KND   E     +P  N +++   +S     E  K   TF +   E STS++K+     E   +  IL Y+ +SRRKK ESPF+E  K 
Subjt:  ESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKN

Query:  LKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL
        LK  D+E+LK++FT PLTKI+KQ+IK   DL  + +LP+  TKDG + KAYKL+AK GYDF +HTEF SL++  ++ ELS+ +KKLL+EG+ IP S+ GL
Subjt:  LKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL

Query:  D-------------------------------DIS----------EEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLT
        +                               DI+          EED E+ P SLEDG QSTVDELKEVNL T EEPRPTFIS+SL    E +YMSLLT
Subjt:  D-------------------------------DIS----------EEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLT

Query:  SYRDIFAWSYKEMLRLDPKVVVHHLAIK
         YRDIFA SYKEM  L PKV VHHL IK
Subjt:  SYRDIFAWSYKEMLRLDPKVVVHHLAIK

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.3e-5738.75Show/hide
Query:  FSEVESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE
        F++      D KFY KND   E     +P  N +++   +S     E  K   TF +  +EASTS++K+     E   +  ILRY+ LSRRKK ESPF+E
Subjt:  FSEVESHFTDPKFYSKNDEVEET----MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE

Query:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPAS
        + + LK  D+E+LK++FT PLTKI+KQEIK   DL ++ +LP++ TKDG DPKAYK +AKAGYDFT+HTEF SL+I +  + LS+T+KKLL+EG+ IP S
Subjt:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPAS

Query:  KAGLD-----------------------------------------------------------------------------------------------
        + GL                                                                                                
Subjt:  KAGLD-----------------------------------------------------------------------------------------------

Query:  --------------------------------------DISEEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRD
                                              +I EED E+AP SLEDG QS VD+LKEVNL T EEP PTFIS+SLS + E +YMSLLT Y+D
Subjt:  --------------------------------------DISEEDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRD

Query:  IFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        IFAWSYKEM   DPKV VHHLAIK GYRP+K
Subjt:  IFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

A0A5D3C7G5 Uncharacterized protein9.0e-6452.26Show/hide
Query:  MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQ
        M T    N  +  ++ +  E T E+          +  + K    +KEV  + T+LRYI LSRR+K ESPF E SKNL   + E+LK+NFT PLTKI K 
Subjt:  MPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIEYSKNLKSDDVEMLKDNFTLPLTKISKQ

Query:  EIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL----------------DDISEED
        E KK +   ++  LPE+ T +G DPKAYKL+AKAGYDFT+ TE  S++IFD++ ELS T+KKL K+GY+IP S+A +                 DI EED
Subjt:  EIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGL----------------DDISEED

Query:  VEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
         + AP SLEDG QST+DELKEVNL T EEPRPTFIS+ LS + ENEY++LL +Y+D+FAWSYKEM  LDPKVVVH LAIK  + PVK
Subjt:  VEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

A0A5D3CZ99 Reverse transcriptase domain-containing protein1.3e-5746.85Show/hide
Query:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE
        D+NPFSE ESHF D KFY KND   E +P                          E    EASTS++K+     E   +  ILRYI LS  KK ESPF+E
Subjt:  DTNPFSEVESHFTDPKFYSKNDEVEETMPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFIE

Query:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTE-FTSLRIFDDKRELSATRKKLL--------
          + LK  D+++LK++FT  LTKI+KQEIK   DL ++  LP++ TKDG DP AYKL+ KAGYDFT+HTE     R+ +  R +   ++K++        
Subjt:  YSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTE-FTSLRIFDDKRELSATRKKLL--------

Query:  -------KEGYNI------------PASKAGL-DDISEEDVE-----EAP-SSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSY
               KEG ++            P S+  +   I   DVE     E P  SL+DG QST+DELKE+NL T EEPRPTFIS+SLS + E++YMSLLT Y
Subjt:  -------KEGYNI------------PASKAGL-DDISEEDVE-----EAP-SSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSY

Query:  RDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK
        +DIFAWSYKEM  LDPKV +HHLAIK GYRP+K
Subjt:  RDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTATTGGAAAATACGGGTTGTTACACTGATACTAATCCATTCTCGGAGGTAGAATCTCATTTCACAGACCCTAAGTTTTATTCGAAGAATGATGAAGTAGAAGA
AACTATGCCGACAGAAAATATGGACAACTCTAGCTCAAGACGGTCAACAAACAAGATAGAAGAATTGACAAAAGAAGTTAATACTTTTGAAGCCCTTCCAAATGAAGCAT
CTACTAGTTCTTCGAAAACAGGAGCTTTTAAGAAAGAAGTCCCACAGAGCTCTACCATCTTGCGGTACATCCTTTTGTCTCGACGGAAAAAGAGTGAATCACCATTCATT
GAGTATTCCAAGAATTTGAAGTCCGATGATGTAGAAATGCTGAAGGATAATTTCACTTTACCTCTTACTAAAATATCAAAGCAAGAGATTAAGAAGTCCAAAGATCTCCA
AATGAAACAAGCTCTTCCTGAGAAATGTACAAAGGATGGCTGCGATCCTAAAGCATACAAACTTGTAGCGAAAGCAGGCTATGACTTCACATCTCACACTGAGTTTACAA
GCTTGAGAATTTTCGATGACAAGCGGGAACTTTCTGCAACACGGAAAAAGCTTTTGAAGGAAGGTTATAACATACCTGCATCAAAGGCTGGACTTGATGATATATCGGAA
GAAGATGTCGAAGAAGCACCATCATCACTAGAGGATGGCAGTCAATCGACTGTTGACGAACTTAAAGAGGTGAACCTCAACACAGCAGAAGAACCACGTCCAACCTTTAT
AAGCTCTTCACTTAGTCGCGATGCAGAAAATGAATACATGAGTCTGTTGACTTCATACAGAGACATATTTGCTTGGTCTTATAAAGAAATGCTAAGACTCGACCCAAAGG
TTGTAGTTCATCATCTTGCTATTAAACAGGGATATCGACCAGTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGTATTGGAAAATACGGGTTGTTACACTGATACTAATCCATTCTCGGAGGTAGAATCTCATTTCACAGACCCTAAGTTTTATTCGAAGAATGATGAAGTAGAAGA
AACTATGCCGACAGAAAATATGGACAACTCTAGCTCAAGACGGTCAACAAACAAGATAGAAGAATTGACAAAAGAAGTTAATACTTTTGAAGCCCTTCCAAATGAAGCAT
CTACTAGTTCTTCGAAAACAGGAGCTTTTAAGAAAGAAGTCCCACAGAGCTCTACCATCTTGCGGTACATCCTTTTGTCTCGACGGAAAAAGAGTGAATCACCATTCATT
GAGTATTCCAAGAATTTGAAGTCCGATGATGTAGAAATGCTGAAGGATAATTTCACTTTACCTCTTACTAAAATATCAAAGCAAGAGATTAAGAAGTCCAAAGATCTCCA
AATGAAACAAGCTCTTCCTGAGAAATGTACAAAGGATGGCTGCGATCCTAAAGCATACAAACTTGTAGCGAAAGCAGGCTATGACTTCACATCTCACACTGAGTTTACAA
GCTTGAGAATTTTCGATGACAAGCGGGAACTTTCTGCAACACGGAAAAAGCTTTTGAAGGAAGGTTATAACATACCTGCATCAAAGGCTGGACTTGATGATATATCGGAA
GAAGATGTCGAAGAAGCACCATCATCACTAGAGGATGGCAGTCAATCGACTGTTGACGAACTTAAAGAGGTGAACCTCAACACAGCAGAAGAACCACGTCCAACCTTTAT
AAGCTCTTCACTTAGTCGCGATGCAGAAAATGAATACATGAGTCTGTTGACTTCATACAGAGACATATTTGCTTGGTCTTATAAAGAAATGCTAAGACTCGACCCAAAGG
TTGTAGTTCATCATCTTGCTATTAAACAGGGATATCGACCAGTAAAATAG
Protein sequenceShow/hide protein sequence
MTVLENTGCYTDTNPFSEVESHFTDPKFYSKNDEVEETMPTENMDNSSSRRSTNKIEELTKEVNTFEALPNEASTSSSKTGAFKKEVPQSSTILRYILLSRRKKSESPFI
EYSKNLKSDDVEMLKDNFTLPLTKISKQEIKKSKDLQMKQALPEKCTKDGCDPKAYKLVAKAGYDFTSHTEFTSLRIFDDKRELSATRKKLLKEGYNIPASKAGLDDISE
EDVEEAPSSLEDGSQSTVDELKEVNLNTAEEPRPTFISSSLSRDAENEYMSLLTSYRDIFAWSYKEMLRLDPKVVVHHLAIKQGYRPVK