; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g05720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g05720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr3:4225930..4232566
RNA-Seq ExpressionMoc03g05720
SyntenyMoc03g05720
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0066080.1 MuDRA-like transposase [Cucumis melo var. makuwa]6.5e-2429.53Show/hide
Query:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK
        R+ V YGG  +E    YEGGV+ G+ V + IT+ DL +  Y    +DP +F+I I CIY+   + E P F + +D  L+FY+    +P +V LY+S +P 
Subjt:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK

Query:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPST-
            + V + D  S+S  + +Q                     NL    P+  D L++NEVD     E++ GL   D     +  +      YH +    
Subjt:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPST-

Query:  TTSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKLDVRSCHLFPRPELC----TGPCRL-SPR---------------YDKIWCAKDVALSLLMGSLKD
        T     +      +  +  L   I  + ++     L++ S   +   EL      G  RL  PR               Y+K W A++ A   + GSL++
Subjt:  TTSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKLDVRSCHLFPRPELC----TGPCRL-SPR---------------YDKIWCAKDVALSLLMGSLKD

Query:  SYTLLRKYGEALKAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL
        SY LL +YGEALK  N G      YL  +G  +W+     G RY+ MT+N+AES+N++L
Subjt:  SYTLLRKYGEALKAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]6.5e-4033.64Show/hide
Query:  GKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK-------
        G+ NE GTVYEGGV+GGL+VDE ITY DLV+A +R T I+PD FNIV++CIYKF+ QY VP F+IFDD SL FYL G   PSQV LYVS+ PK       
Subjt:  GKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK-------

Query:  -----------ETYG-------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDEND
                   ET+        QNV S  P   + S ++ V   S V  +TPLTDNVVPCNLG+DE  H+ +      D   + + EY ++ DD ED++D
Subjt:  -----------ETYG-------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDEND

Query:  -QVVTPQDVIYHPRPSTTTSQAPLY-FSPSSLTPRVAL------ALDIYPQGLNAPLQKLDVRSC-----------------------------------
         Q +  +D           +  P+Y   PS  TP V +      A     +G++A L+++D   C                                   
Subjt:  -QVVTPQDVIYHPRPSTTTSQAPLY-FSPSSLTPRVAL------ALDIYPQGLNAPLQKLDVRSC-----------------------------------

Query:  ------------------------------------------HLFPRP--------------------------------ELCTGPCR---LSPRYDKIW
                                                  H+  R                                 ++ T   R   ++ RY+K W
Subjt:  ------------------------------------------HLFPRP--------------------------------ELCTGPCR---LSPRYDKIW

Query:  CAKDVALSLLMGSLKDSYTLLRKYGEALKAVNVG
         A++VALSLLMGS K+SYT L KYG ALKA NVG
Subjt:  CAKDVALSLLMGSLKDSYTLLRKYGEALKAVNVG

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]2.0e-0452.54Show/hide
Query:  GFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVLV-------TCLASDCTS
        GF  V  YLE IG +KW   +Q G+RY+QMTSN+AES+NAVLV       T L  +C S
Subjt:  GFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVLV-------TCLASDCTS

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]5.9e-2529.68Show/hide
Query:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK
        R+ V YGG  +E    YEGGV+ G+ V + IT+ DL +  Y    +DP +F+I I CIY+   + E P F + +D  L+FY+    +P +V LY+S +P 
Subjt:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK

Query:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPSTT
            + V + D  S+S  + +Q                     NL    P+  D L++NEVD     E++ GL         D ++     I+    S  
Subjt:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPSTT

Query:  TSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKL-----DVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL
        +   P  +    +   +    DI  Q     LQK+       +      +P       R    ++  Y+K W A++ A   + GSL++SY LL +YGEAL
Subjt:  TSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKL-----DVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL

Query:  KAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL
        K  N G      YL  +G  +W+     G RY+ MT+N+AES+N++L
Subjt:  KAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL

XP_022156802.1 uncharacterized protein LOC111023635 [Momordica charantia]5.7e-2028.26Show/hide
Query:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ
        M RLFVIYGGK NE GTVYEGG +GGLDVDE ITY +LV+A +  T ID DQF+++++C+Y                                       
Subjt:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ

Query:  PKETYGQNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYD-ELYENEVDYEIENEIEYGL--------LEDD-------------
                                      + +ITPL DNV+ CNL +DE    + +LYENEV+YE +++ EY          +EDD             
Subjt:  PKETYGQNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYD-ELYENEVDYEIENEIEYGL--------LEDD-------------

Query:  -TEDENDQ--------------------VVTPQDVIYHPRPSTTT-----------------SQAPLYFSPSSLTPRVALALDIYPQ-------------
         TED  D+                      T ++V+  P  +  T                 S+  L F  S L  ++     +                
Subjt:  -TEDENDQ--------------------VVTPQDVIYHPRPSTTT-----------------SQAPLYFSPSSLTPRVALALDIYPQ-------------

Query:  ---GLNA--------------------------------------PLQKLDVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSY
           GL A                                       L K +V       RP+      R    ++ RY+K W AK+VAL+LLMG  K SY
Subjt:  ---GLNA--------------------------------------PLQKLDVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSY

Query:  TLLRKYGEALKAVN
        TLLRKYGEALKAVN
Subjt:  TLLRKYGEALKAVN

XP_022156834.1 uncharacterized protein LOC111023667 [Momordica charantia]2.9e-5639.41Show/hide
Query:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ
        M RLFVIYGGK NE GT+YEGGV+GGLDVDE ITY +LV+A +  T IDPDQF+++++C+Y+F  +YEVP + IFDD SL+FYLNG  DPSQV LYV++ 
Subjt:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ

Query:  PKETYG-----------------------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLED
        PK +YG                       QN   F  +ISI SPL  V   SF+PQITPL DNV+PCNL +DE  +Y +LYENEV+Y+ +++ EY   E 
Subjt:  PKETYG-----------------------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLED

Query:  DTED-------------ENDQVVTPQDVI--------YHPRPSTTTSQAPLYFSPSSLTPRVALAL----------DIYPQGLNAPLQKL----------
         TED             E D+  T +DV+         H   S T S    Y +   +  R    +          +I  +G+     +L          
Subjt:  DTED-------------ENDQVVTPQDVI--------YHPRPSTTTSQAPLYFSPSSLTPRVALAL----------DIYPQGLNAPLQKL----------

Query:  ------------DVRSC--------HLFPR--------------------PELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL
                    DV  C        H   R                    P+      R    ++ RY+K W AK+VAL+LL+GS K SYTLL KYGEAL
Subjt:  ------------DVRSC--------HLFPR--------------------PELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL

Query:  KAVNVG
        K VN G
Subjt:  KAVNVG

TrEMBL top hitse value%identityAlignment
A0A5A7VFZ7 MuDRA-like transposase3.1e-2429.53Show/hide
Query:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK
        R+ V YGG  +E    YEGGV+ G+ V + IT+ DL +  Y    +DP +F+I I CIY+   + E P F + +D  L+FY+    +P +V LY+S +P 
Subjt:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK

Query:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPST-
            + V + D  S+S  + +Q                     NL    P+  D L++NEVD     E++ GL   D     +  +      YH +    
Subjt:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPST-

Query:  TTSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKLDVRSCHLFPRPELC----TGPCRL-SPR---------------YDKIWCAKDVALSLLMGSLKD
        T     +      +  +  L   I  + ++     L++ S   +   EL      G  RL  PR               Y+K W A++ A   + GSL++
Subjt:  TTSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKLDVRSCHLFPRPELC----TGPCRL-SPR---------------YDKIWCAKDVALSLLMGSLKD

Query:  SYTLLRKYGEALKAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL
        SY LL +YGEALK  N G      YL  +G  +W+     G RY+ MT+N+AES+N++L
Subjt:  SYTLLRKYGEALKAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL

A0A6J1DLB0 uncharacterized protein LOC1110219693.1e-4033.64Show/hide
Query:  GKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK-------
        G+ NE GTVYEGGV+GGL+VDE ITY DLV+A +R T I+PD FNIV++CIYKF+ QY VP F+IFDD SL FYL G   PSQV LYVS+ PK       
Subjt:  GKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK-------

Query:  -----------ETYG-------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDEND
                   ET+        QNV S  P   + S ++ V   S V  +TPLTDNVVPCNLG+DE  H+ +      D   + + EY ++ DD ED++D
Subjt:  -----------ETYG-------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDEND

Query:  -QVVTPQDVIYHPRPSTTTSQAPLY-FSPSSLTPRVAL------ALDIYPQGLNAPLQKLDVRSC-----------------------------------
         Q +  +D           +  P+Y   PS  TP V +      A     +G++A L+++D   C                                   
Subjt:  -QVVTPQDVIYHPRPSTTTSQAPLY-FSPSSLTPRVAL------ALDIYPQGLNAPLQKLDVRSC-----------------------------------

Query:  ------------------------------------------HLFPRP--------------------------------ELCTGPCR---LSPRYDKIW
                                                  H+  R                                 ++ T   R   ++ RY+K W
Subjt:  ------------------------------------------HLFPRP--------------------------------ELCTGPCR---LSPRYDKIW

Query:  CAKDVALSLLMGSLKDSYTLLRKYGEALKAVNVG
         A++VALSLLMGS K+SYT L KYG ALKA NVG
Subjt:  CAKDVALSLLMGSLKDSYTLLRKYGEALKAVNVG

A0A6J1DLB0 uncharacterized protein LOC1110219699.5e-0552.54Show/hide
Query:  GFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVLV-------TCLASDCTS
        GF  V  YLE IG +KW   +Q G+RY+QMTSN+AES+NAVLV       T L  +C S
Subjt:  GFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVLV-------TCLASDCTS

A0A6J1DLB0 uncharacterized protein LOC1110219692.8e-2529.68Show/hide
Query:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK
        R+ V YGG  +E    YEGGV+ G+ V + IT+ DL +  Y    +DP +F+I I CIY+   + E P F + +D  L+FY+    +P +V LY+S +P 
Subjt:  RLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPK

Query:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPSTT
            + V + D  S+S  + +Q                     NL    P+  D L++NEVD     E++ GL         D ++     I+    S  
Subjt:  ETYGQNVASFD-PSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPSTT

Query:  TSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKL-----DVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL
        +   P  +    +   +    DI  Q     LQK+       +      +P       R    ++  Y+K W A++ A   + GSL++SY LL +YGEAL
Subjt:  TSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKL-----DVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL

Query:  KAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL
        K  N G      YL  +G  +W+     G RY+ MT+N+AES+N++L
Subjt:  KAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVL

A0A6J1DSY0 uncharacterized protein LOC1110236352.8e-2028.26Show/hide
Query:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ
        M RLFVIYGGK NE GTVYEGG +GGLDVDE ITY +LV+A +  T ID DQF+++++C+Y                                       
Subjt:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ

Query:  PKETYGQNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYD-ELYENEVDYEIENEIEYGL--------LEDD-------------
                                      + +ITPL DNV+ CNL +DE    + +LYENEV+YE +++ EY          +EDD             
Subjt:  PKETYGQNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYD-ELYENEVDYEIENEIEYGL--------LEDD-------------

Query:  -TEDENDQ--------------------VVTPQDVIYHPRPSTTT-----------------SQAPLYFSPSSLTPRVALALDIYPQ-------------
         TED  D+                      T ++V+  P  +  T                 S+  L F  S L  ++     +                
Subjt:  -TEDENDQ--------------------VVTPQDVIYHPRPSTTT-----------------SQAPLYFSPSSLTPRVALALDIYPQ-------------

Query:  ---GLNA--------------------------------------PLQKLDVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSY
           GL A                                       L K +V       RP+      R    ++ RY+K W AK+VAL+LLMG  K SY
Subjt:  ---GLNA--------------------------------------PLQKLDVRSCHLFPRPELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSY

Query:  TLLRKYGEALKAVN
        TLLRKYGEALKAVN
Subjt:  TLLRKYGEALKAVN

A0A6J1DUS4 uncharacterized protein LOC1110236671.4e-5639.41Show/hide
Query:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ
        M RLFVIYGGK NE GT+YEGGV+GGLDVDE ITY +LV+A +  T IDPDQF+++++C+Y+F  +YEVP + IFDD SL+FYLNG  DPSQV LYV++ 
Subjt:  MHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNIVIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQ

Query:  PKETYG-----------------------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLED
        PK +YG                       QN   F  +ISI SPL  V   SF+PQITPL DNV+PCNL +DE  +Y +LYENEV+Y+ +++ EY   E 
Subjt:  PKETYG-----------------------QNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYEIENEIEYGLLED

Query:  DTED-------------ENDQVVTPQDVI--------YHPRPSTTTSQAPLYFSPSSLTPRVALAL----------DIYPQGLNAPLQKL----------
         TED             E D+  T +DV+         H   S T S    Y +   +  R    +          +I  +G+     +L          
Subjt:  DTED-------------ENDQVVTPQDVI--------YHPRPSTTTSQAPLYFSPSSLTPRVALAL----------DIYPQGLNAPLQKL----------

Query:  ------------DVRSC--------HLFPR--------------------PELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL
                    DV  C        H   R                    P+      R    ++ RY+K W AK+VAL+LL+GS K SYTLL KYGEAL
Subjt:  ------------DVRSC--------HLFPR--------------------PELCTGPCR----LSPRYDKIWCAKDVALSLLMGSLKDSYTLLRKYGEAL

Query:  KAVNVG
        K VN G
Subjt:  KAVNVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAACAGTGTGTTCCCGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTAT
AGGTCGGATTCCAAGTTTAGTTCGAGGCCAGAAATCCTCGTACCTGATCGGGGAAATGCATCGTCTATTTGTAATCTATGGTGGTAAGTCGAATGAGGTAGGAACTGTTT
ATGAAGGTGGGGTCATAGGAGGTTTAGACGTCGACGAAATGATAACGTATGATGACCTAGTAAATGCATTTTACAGGCGTACGATAATTGATCCTGATCAGTTCAATATT
GTTATAGAATGCATATATAAATTTCAAATGCAATATGAAGTCCCTAAGTTCTTTATTTTCGATGATATTAGCCTCAGATTTTATTTGAATGGCACTCTAGACCCTTCTCA
AGTGTCGTTGTATGTATCTATCCAACCGAAAGAAACATATGGTCAGAATGTTGCATCATTCGATCCTTCAATTTCTATCCAGTCCCCGTTACAAGGTGTTTCTGCCCTCT
CCTTTGTCCCACAAATAACCCCCTTGACTGACAACGTCGTCCCATGCAACCTAGGCAATGATGAACCACTACATTATGATGAATTGTATGAGAACGAGGTTGATTATGAG
ATTGAGAATGAGATTGAGTATGGGCTTCTTGAAGATGACACTGAAGACGAGAATGATCAAGTTGTAACACCCCAGGATGTTATATACCATCCTAGGCCTAGCACCACTAC
TTCGCAAGCACCACTCTACTTTAGTCCTTCAAGCCTAACTCCTCGAGTCGCATTGGCCCTTGACATTTATCCCCAAGGCCTCAATGCTCCTTTACAGAAACTTGATGTTC
GGTCTTGCCATTTATTCCCAAGGCCTGAACTGTGCACCGGTCCTTGCCGTTTATCCCCAAGGTATGACAAGATATGGTGCGCAAAAGACGTAGCATTGAGTCTTTTGATG
GGATCACTAAAAGACTCCTATACTCTTTTACGTAAGTATGGAGAGGCTTTGAAAGCTGTGAATGTTGGATTTCTAGAAGTTCAAGGATACTTAGAAGGAATCGGTTTTGA
GAAATGGACACATGCATTTCAACTGGGATTGAGGTACGACCAAATGACTTCTAATGTTGCCGAGTCTGTGAACGCGGTTCTTGTCACGTGCCTTGCCAGTGATTGCACTT
CTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAACAGTGTGTTCCCGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTAT
AGGTCGGATTCCAAGTTTAGTTCGAGGCCAGAAATCCTCGTACCTGATCGGGGAAATGCATCGTCTATTTGTAATCTATGGTGGTAAGTCGAATGAGGTAGGAACTGTTT
ATGAAGGTGGGGTCATAGGAGGTTTAGACGTCGACGAAATGATAACGTATGATGACCTAGTAAATGCATTTTACAGGCGTACGATAATTGATCCTGATCAGTTCAATATT
GTTATAGAATGCATATATAAATTTCAAATGCAATATGAAGTCCCTAAGTTCTTTATTTTCGATGATATTAGCCTCAGATTTTATTTGAATGGCACTCTAGACCCTTCTCA
AGTGTCGTTGTATGTATCTATCCAACCGAAAGAAACATATGGTCAGAATGTTGCATCATTCGATCCTTCAATTTCTATCCAGTCCCCGTTACAAGGTGTTTCTGCCCTCT
CCTTTGTCCCACAAATAACCCCCTTGACTGACAACGTCGTCCCATGCAACCTAGGCAATGATGAACCACTACATTATGATGAATTGTATGAGAACGAGGTTGATTATGAG
ATTGAGAATGAGATTGAGTATGGGCTTCTTGAAGATGACACTGAAGACGAGAATGATCAAGTTGTAACACCCCAGGATGTTATATACCATCCTAGGCCTAGCACCACTAC
TTCGCAAGCACCACTCTACTTTAGTCCTTCAAGCCTAACTCCTCGAGTCGCATTGGCCCTTGACATTTATCCCCAAGGCCTCAATGCTCCTTTACAGAAACTTGATGTTC
GGTCTTGCCATTTATTCCCAAGGCCTGAACTGTGCACCGGTCCTTGCCGTTTATCCCCAAGGTATGACAAGATATGGTGCGCAAAAGACGTAGCATTGAGTCTTTTGATG
GGATCACTAAAAGACTCCTATACTCTTTTACGTAAGTATGGAGAGGCTTTGAAAGCTGTGAATGTTGGATTTCTAGAAGTTCAAGGATACTTAGAAGGAATCGGTTTTGA
GAAATGGACACATGCATTTCAACTGGGATTGAGGTACGACCAAATGACTTCTAATGTTGCCGAGTCTGTGAACGCGGTTCTTGTCACGTGCCTTGCCAGTGATTGCACTT
CTTGA
Protein sequenceShow/hide protein sequence
MHNSVFPIVARTRPPDRPEHLGGPAQKGEHSDDQVSIGRIPSLVRGQKSSYLIGEMHRLFVIYGGKSNEVGTVYEGGVIGGLDVDEMITYDDLVNAFYRRTIIDPDQFNI
VIECIYKFQMQYEVPKFFIFDDISLRFYLNGTLDPSQVSLYVSIQPKETYGQNVASFDPSISIQSPLQGVSALSFVPQITPLTDNVVPCNLGNDEPLHYDELYENEVDYE
IENEIEYGLLEDDTEDENDQVVTPQDVIYHPRPSTTTSQAPLYFSPSSLTPRVALALDIYPQGLNAPLQKLDVRSCHLFPRPELCTGPCRLSPRYDKIWCAKDVALSLLM
GSLKDSYTLLRKYGEALKAVNVGFLEVQGYLEGIGFEKWTHAFQLGLRYDQMTSNVAESVNAVLVTCLASDCTS