; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF659 domain-containing protein
Genome locationchr8:20059382..20060686
RNA-Seq ExpressionMoc08g27760
SyntenyMoc08g27760
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5034983.1 hypothetical protein JHK87_009893 [Glycine soja]1.4e-6250.53Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S +WT +K +K+PKGK A + V+MPSFWNSVVYTLK   PLV+VLRLVD   KPAM YIY+AMD+AKE I   FN  E+KY+ ++ I DKRW+CQLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD
        RPLHA  ++LNP  FYDN + +  +   HSKK NRLE KRL+DLV++KY+Q LK+RY  +D++DPI L+ ID  NEWL+  ++++D  +  N+LVF+DDD
Subjt:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD

Query:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD
         L W NV +ASGV E     R Y R K K   T    A T+   +K   VV S     + +++  +E+ ED+  EEN+DL+ +++
Subjt:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD

KAG5066293.1 hypothetical protein JHK86_010024 [Glycine max]1.1e-6250.53Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S +WT +K +K+PKGK A + V+MPSFWNSVVYTLK   PLV+VLRLVDG  KPAM YIY+AMD+AKE I   FN  E+KY+ ++ I DKRW+CQLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD
        RPLHA  ++LNP  FYDN + +  +   HSKK NRLE KRL+DLV++KY+Q LK+RY  +D++DPI L+ ID  NEWL+  ++++D  +  N+LVF+DDD
Subjt:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD

Query:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD
         L W  V +ASGV E     R Y R K K   T    A T+   +K   VV S     + +++  +E+ ED+  EEN+DL+ +++
Subjt:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD

RWR92244.1 hypothetical protein CKAN_02145300 [Cinnamomum micranthum f. kanehirae]6.5e-6345.59Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S++WTTSKWAK+ +GK  A+ ++MPSFWN+VVY LK SGPL+ VLRLVDG  KP M YIY+AMDRAKEAI + F G E +Y  I+EI D+RWD QLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDN--------------------------KERIMQD-------------------PETHSKKGNRLEQKRLNDLVYIKYSQALKE
        RPLHA  Y+LNP  FY N                          +++I Q+                      +SKK NRL+QKRLNDLV++KY++ALK 
Subjt:  RPLHAVRYYLNPSLFYDN--------------------------KERIMQD-------------------PETHSKKGNRLEQKRLNDLVYIKYSQALKE

Query:  RYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDD
        RY ++D LDPI+L +IDESNEWL+G ++ E   E++E VFDDD L WG+VA ASGV E  ++TR  + S+ ++        P  +R    +Q+V  ++ +
Subjt:  RYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDD

Query:  DTEEEELEKEDCEDV-QDEENLDLDEDDD
        +T+EE+++     D  QD + L +D +DD
Subjt:  DTEEEELEKEDCEDV-QDEENLDLDEDDD

XP_022143395.1 uncharacterized protein LOC111013272 [Momordica charantia]4.1e-6591.77Show/hide
Query:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS
        HSKK N+LEQKRLNDLVYIKY+QALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQ ENELVFDDDDLTW +VAEASGVREPIRQTRTYARSKGKS
Subjt:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS

Query:  PATPATPAPTTSRVRKSVQVVVSDDDDDT--EEEELEKEDCEDVQDEENLDLDEDDDV
           PATPAPTTSRVRKSVQVVVSDDDDDT  EEEELEKEDCEDVQDEENLDLDEDD++
Subjt:  PATPATPAPTTSRVRKSVQVVVSDDDDDT--EEEELEKEDCEDVQDEENLDLDEDDDV

XP_022157603.1 uncharacterized protein LOC111024254 [Momordica charantia]6.3e-6691.82Show/hide
Query:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS
        HSKK N LEQKRLNDLVYIKY+QALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWG+VAEASGVREPIRQTRTY RSKGKS
Subjt:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS

Query:  PATPATPAPTTSRVRKSVQVVVSDDDDDT---EEEELEKEDCEDVQDEENLDLDEDDDV
           PATPAPTTSRVRKSVQVVVSDDDDDT   EEEELEKEDCEDVQDEENLDLDEDD++
Subjt:  PATPATPAPTTSRVRKSVQVVVSDDDDDT---EEEELEKEDCEDVQDEENLDLDEDDDV

TrEMBL top hitse value%identityAlignment
A0A0R0K8D4 Uncharacterized protein5.4e-6350.53Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S +WT +K +K+PKGK A + V+MPSFWNSVVYTLK   PLV+VLRLVDG  KPAM YIY+AMD+AKE I   FN  E+KY+ ++ I DKRW+CQLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD
        RPLHA  ++LNP  FYDN + +  +   HSKK NRLE KRL+DLV++KY+Q LK+RY  +D++DPI L+ ID  NEWL+  ++++D  +  N+LVF+DDD
Subjt:  RPLHAVRYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEE-NELVFDDDD

Query:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD
         L W  V +ASGV E     R Y R K K   T    A T+   +K   VV S     + +++  +E+ ED+  EEN+DL+ +++
Subjt:  -LTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD

A0A443PN90 DUF659 domain-containing protein3.1e-6345.59Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S++WTTSKWAK+ +GK  A+ ++MPSFWN+VVY LK SGPL+ VLRLVDG  KP M YIY+AMDRAKEAI + F G E +Y  I+EI D+RWD QLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDN--------------------------KERIMQD-------------------PETHSKKGNRLEQKRLNDLVYIKYSQALKE
        RPLHA  Y+LNP  FY N                          +++I Q+                      +SKK NRL+QKRLNDLV++KY++ALK 
Subjt:  RPLHAVRYYLNPSLFYDN--------------------------KERIMQD-------------------PETHSKKGNRLEQKRLNDLVYIKYSQALKE

Query:  RYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDD
        RY ++D LDPI+L +IDESNEWL+G ++ E   E++E VFDDD L WG+VA ASGV E  ++TR  + S+ ++        P  +R    +Q+V  ++ +
Subjt:  RYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDD

Query:  DTEEEELEKEDCEDV-QDEENLDLDEDDD
        +T+EE+++     D  QD + L +D +DD
Subjt:  DTEEEELEKEDCEDV-QDEENLDLDEDDD

A0A445M0R9 DUF659 domain-containing protein9.1e-6349Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S +W  SK +K+PKGK A E V+MPSFWN VVY LKA GPLV VLRLVD   KPAM +IY+AMDRAKE I+  F+  E +Y  I  I DKRWDCQLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNK----------------ERIMQDPE-THSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLE
         PLHA  YYLN   FY N                 +++ +D +  HSKK +RLE ++L DLVY+KY+QAL +R+   D +DPI L+ ID+SNEWL+G LE
Subjt:  RPLHAVRYYLNPSLFYDNK----------------ERIMQDPE-THSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLE

Query:  EEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD
         E  +  N+LVFDDDDL W +VAEA+G  EP++ T    RS+       A  APT  + +K V+V   ++D+  EE      + +  Q   +LDL+ED+D
Subjt:  EEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDD

A0A6J1CNP5 uncharacterized protein LOC1110132722.0e-6591.77Show/hide
Query:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS
        HSKK N+LEQKRLNDLVYIKY+QALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQ ENELVFDDDDLTW +VAEASGVREPIRQTRTYARSKGKS
Subjt:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS

Query:  PATPATPAPTTSRVRKSVQVVVSDDDDDT--EEEELEKEDCEDVQDEENLDLDEDDDV
           PATPAPTTSRVRKSVQVVVSDDDDDT  EEEELEKEDCEDVQDEENLDLDEDD++
Subjt:  PATPATPAPTTSRVRKSVQVVVSDDDDDT--EEEELEKEDCEDVQDEENLDLDEDDDV

A0A6J1DTS8 uncharacterized protein LOC1110242543.0e-6691.82Show/hide
Query:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS
        HSKK N LEQKRLNDLVYIKY+QALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWG+VAEASGVREPIRQTRTY RSKGKS
Subjt:  HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGVREPIRQTRTYARSKGKS

Query:  PATPATPAPTTSRVRKSVQVVVSDDDDDT---EEEELEKEDCEDVQDEENLDLDEDDDV
           PATPAPTTSRVRKSVQVVVSDDDDDT   EEEELEKEDCEDVQDEENLDLDEDD++
Subjt:  PATPATPAPTTSRVRKSVQVVVSDDDDDT---EEEELEKEDCEDVQDEENLDLDEDDDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13020.1 hAT transposon superfamily protein8.8e-1837.21Show/hide
Query:  ISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG--NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLHR
        ++  + +S W K+ +GK  +  V   SFW +V   LK + PL   LRL     N   + YIY  +D  K +IK  FN  +  Y  +W++ D  W+  LH 
Subjt:  ISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG--NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLHR

Query:  PLHAVRYYLNPSLFYDNKERIMQDPETHS
        PLHA  YYLNP+ FY     +  DPE  S
Subjt:  PLHAVRYYLNPSLFYDNKERIMQDPETHS

AT3G13030.1 hAT transposon superfamily protein1.7e-1636.21Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVD-GNKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        MF S  W   +          +  V   SFW +V   LK + PL+  L L    N   + Y+Y  MD  KE+I   FN     Y+P+W++ D  W+  LH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVD-GNKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFY
         PLHA  Y+LNP+ FY
Subjt:  RPLHAVRYYLNPSLFY

AT4G15020.1 hAT transposon superfamily9.8e-1736.8Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDGNK-PAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        M  S +W    ++++P G      +   +FW +V      + PL+R LR+V   K PAM Y+Y A+ RAK+AIK+     E  Y   W+I D+ W+ Q H
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDGNK-PAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKERIMQD
         PL A  ++LNP LFY+  E I  +
Subjt:  RPLHAVRYYLNPSLFYDNKERIMQD

AT4G15020.2 hAT transposon superfamily9.8e-1736.8Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDGNK-PAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        M  S +W    ++++P G      +   +FW +V      + PL+R LR+V   K PAM Y+Y A+ RAK+AIK+     E  Y   W+I D+ W+ Q H
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDGNK-PAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKERIMQD
         PL A  ++LNP LFY+  E I  +
Subjt:  RPLHAVRYYLNPSLFYDNKERIMQD

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related3.2e-4433.07Show/hide
Query:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH
        M  S +W  SKW K+  G          SFW +V++ LK  GPL++VLR+VDG  KP M YIY AMD+AKE I   F   E  Y+  +EI D+RWD QLH
Subjt:  MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDG-NKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLH

Query:  RPLHAVRYYLNPSLFYDNKE----------------RIMQDPET--------------------------------------------------------
        RPLHA  YYLNP   Y   +                R++   ET                                                        
Subjt:  RPLHAVRYYLNPSLFYDNKE----------------RIMQDPET--------------------------------------------------------

Query:  -----------------HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEEN-ELVFDDDDLTWGNVAEASG
                         H+K+ NRL Q RLND++++KY++AL+ RY   D  DPI L+ ID+ NEWL G +EE     EN +LVF++DDLTW  V EA+G
Subjt:  -----------------HSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEEN-ELVFDDDDLTWGNVAEASG

Query:  VREPIRQTRTYARS----KGKSPATPATPAPTTSRVRK----SVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDE
          +P   TR+ A S    KGK  A+ +     +   RK    S  + + D+D++ E++ L  E  + + D+++ + D+
Subjt:  VREPIRQTRTYARS----KGKSPATPATPAPTTSRVRK----SVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATTTCTAAGAAATGGACCACCTCTAAATGGGCGAAGGACCCGAAGGGGAAGGGAGCAGCTGAAACTGTTATGATGCCATCTTTTTGGAATTCAGTTGTG
TATACTTTGAAAGCATCGGGCCCTTTAGTTCGTGTGTTGAGGCTAGTAGATGGTAATAAACCGGCTATGTGTTATATTTACAAAGCTATGGATAGGGCAAAGGAG
GCTATTAAGAGTGGTTTTAATGGAACTGAGGCGAAGTATAGACCAATTTGGGAGATTTATGATAAAAGGTGGGATTGCCAACTTCATCGGCCTTTACATGCAGTC
AGATATTATTTGAATCCATCTCTTTTCTATGACAACAAAGAGAGAATTATGCAAGATCCTGAGACTCATTCCAAAAAGGGGAACAGGTTAGAGCAAAAGCGTCTT
AATGACTTGGTTTATATAAAATATAGCCAAGCACTTAAAGAGCGCTACACTCTACAAGACCAACTTGATCCAATTACTTTGGATCATATTGATGAAAGTAATGAA
TGGTTAATCGGAACACTCGAAGAAGAAGATGGTCAAGAAGAAAATGAGTTGGTCTTTGATGATGACGACCTCACATGGGGAAATGTAGCCGAGGCCAGTGGTGTT
AGAGAGCCCATAAGGCAAACTAGAACATATGCAAGATCTAAAGGAAAGTCGCCCGCTACTCCCGCTACTCCCGCTCCCACAACTTCAAGAGTTCGAAAATCAGTG
CAAGTGGTGGTTAGTGATGATGATGATGACACCGAAGAAGAAGAATTGGAAAAGGAAGATTGTGAGGACGTCCAAGATGAAGAGAATTTAGATTTAGATGAGGAT
GATGATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTATTTCTAAGAAATGGACCACCTCTAAATGGGCGAAGGACCCGAAGGGGAAGGGAGCAGCTGAAACTGTTATGATGCCATCTTTTTGGAATTCAGTTGTG
TATACTTTGAAAGCATCGGGCCCTTTAGTTCGTGTGTTGAGGCTAGTAGATGGTAATAAACCGGCTATGTGTTATATTTACAAAGCTATGGATAGGGCAAAGGAG
GCTATTAAGAGTGGTTTTAATGGAACTGAGGCGAAGTATAGACCAATTTGGGAGATTTATGATAAAAGGTGGGATTGCCAACTTCATCGGCCTTTACATGCAGTC
AGATATTATTTGAATCCATCTCTTTTCTATGACAACAAAGAGAGAATTATGCAAGATCCTGAGACTCATTCCAAAAAGGGGAACAGGTTAGAGCAAAAGCGTCTT
AATGACTTGGTTTATATAAAATATAGCCAAGCACTTAAAGAGCGCTACACTCTACAAGACCAACTTGATCCAATTACTTTGGATCATATTGATGAAAGTAATGAA
TGGTTAATCGGAACACTCGAAGAAGAAGATGGTCAAGAAGAAAATGAGTTGGTCTTTGATGATGACGACCTCACATGGGGAAATGTAGCCGAGGCCAGTGGTGTT
AGAGAGCCCATAAGGCAAACTAGAACATATGCAAGATCTAAAGGAAAGTCGCCCGCTACTCCCGCTACTCCCGCTCCCACAACTTCAAGAGTTCGAAAATCAGTG
CAAGTGGTGGTTAGTGATGATGATGATGACACCGAAGAAGAAGAATTGGAAAAGGAAGATTGTGAGGACGTCCAAGATGAAGAGAATTTAGATTTAGATGAGGAT
GATGATGTTTGA
Protein sequenceShow/hide protein sequence
MFISKKWTTSKWAKDPKGKGAAETVMMPSFWNSVVYTLKASGPLVRVLRLVDGNKPAMCYIYKAMDRAKEAIKSGFNGTEAKYRPIWEIYDKRWDCQLHRPLHAV
RYYLNPSLFYDNKERIMQDPETHSKKGNRLEQKRLNDLVYIKYSQALKERYTLQDQLDPITLDHIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGNVAEASGV
REPIRQTRTYARSKGKSPATPATPAPTTSRVRKSVQVVVSDDDDDTEEEELEKEDCEDVQDEENLDLDEDDDV