; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1397 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1397
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMethyltransf_11 domain-containing protein
Genome locationMC05:18139249..18140676
RNA-Seq ExpressionMC05g1397
SyntenyMC05g1397
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR013216 - Methyltransferase type 11
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017356.1 hypothetical protein SDJN02_19221, partial [Cucurbita argyrosperma subsp. argyrosperma]2.11e-28681.8Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+LKA+KWQI HG+LARRVV+R FLLALA+STVPLLH+LTG DFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALR M F DVIGVG+HRFFSLRRKQFVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G + S+PNNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF H EP  S +CR+LTRNKPLIPKMEPLVK + VGFD++LAYLPKLV+ASNG+K
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN+DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPYID+EFDFLSWFKETVQHA+
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDE+FLSCRD GVDE+ E+K RDCMDL+KDLR++GVYVHQWFLEAPS +K+
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

XP_022934953.1 uncharacterized protein LOC111441966 [Cucurbita moschata]2.17e-28382.01Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+LKA+KWQI HG+LARRVVVR FLLALA+STVPLLH+LTG DFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALR M F DVIGVG+HRFFSLRRKQFVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G + S+ NNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF H EP  S +CR+LTRNKPLIPKMEPLVK +S+  D++LAYLPKLV+ASNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN+DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPYID+EFDFLSWFKETVQHAD
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDE+FLSCRD GVDE+ E+K RDCMDLFKDLR++GVYVHQWFLEAPS +K+
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

XP_022983363.1 uncharacterized protein LOC111481971 [Cucurbita maxima]5.18e-28782.22Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+L+A+KWQI HG+LARRVV+R FLLALA+STVPL+H+LTGADFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALRDM F DVIGVGQHRFFSLRRK FVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G SDS+PNNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF HLEP  S  CR+LTRNKPLIPKMEPLVK + VGFD++LAYLPKLV+ASNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPY+D+EFDFLSWFKETVQHA+
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDE+FLSC D GVDE+ E+K RDCMDL+KDLR++GVYVHQWFL+APS +KI
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

XP_023528754.1 uncharacterized protein LOC111791594 [Cucurbita pepo subsp. pepo]1.81e-28782.22Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+LKA+KWQI HG+LARRVV+R FLLALA+STVPLLH+LTG DFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALRDM F DVIGVGQHRFFSLRRK FVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G + S+PNNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF HLEP  S +CR+LTRNKPLIPKMEPLVK + VGFD++LAYLPKLV+ASNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN+DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPYID+EFDFLSWFKETVQHA+
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFES VICWVDE+FLSCRD GVDE+ E+K RDCMDLFKDLR++GVYVHQWFL+APS +K+
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

XP_038904424.1 uncharacterized protein LOC120090789 [Benincasa hispida]5.37e-28882.43Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        MNLKA+KWQIVHG LARRVVVRIF LALA+STVPLLH+LTGADFG IPS IFRDC V+S    AK SRGSYLFQGHFLNPIWVPF A+HCE+ MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ LNH+AK LC+GEGS SAVLALRD+ F+DVIGVGQHRFFSLRRKQ VYELDFKD SFDFVFSRD+D+YSVPALLVLEIERVLRPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
          S SMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKL+EF HLEPH+SS+CR+LTRNKPLIPK+EP VK + V FD++L+YLPKLVD SNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGD-DEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIG GKR+N TN DWFP SYP+DRRDFNVY VDHDMSALAT+IH PGVTFVYHPGLAGTDQTT+NDG  D+E EEPYID+EFDFLSWFKETVQH+D
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGD-DEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRD+  +E+ ELK R C+DL+KDLR++GVYVHQWFL+A    KI
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

TrEMBL top hitse value%identityAlignment
A0A0A0LDG5 Methyltransf_11 domain-containing protein4.22e-28380.87Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        MNLKA+KWQI+HG+LARR+VVRIFLLALA+S VPLLH+  GADFG IPS IFRDC V+  D  AKVSRGSY+FQGHFLN IWVPFVA+HCEE+ NLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ LNH+AKSLC+GEGS SAVLALRD+ F+DVIGVGQHRFFSLRRKQFVYELDFK   FDFVFSRD+D+YSVPALLVLEIERVLRPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLE-PHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGE
          S+SMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKK +E+ HLE P LSS+CR+LTRNKPLIPK+EPLVK+R VGFD++L+YLPK VD S+G+
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLE-PHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGE

Query:  KLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDND-GDDEEGEEPYIDNEFDFLSWFKETVQHA
        +L+YVNIGTGKRLN TN DWFP SYP+ RRDFNVY VDHDMS+LATHIH PGVTFVYHP LAGTDQTTD+D   D+E EEPYID+EFDFLSWFKETVQH+
Subjt:  KLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDND-GDDEEGEEPYIDNEFDFLSWFKETVQHA

Query:  DFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESE--LKRDCMDLFKDLRDNGVYVHQWFLEA-PSFIKI
        DFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRD GVDEE     KR+C DL+KDLR++GVYVHQWFL+A PS +KI
Subjt:  DFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESE--LKRDCMDLFKDLRDNGVYVHQWFLEA-PSFIKI

A0A6J1CWW4 uncharacterized protein LOC1110150318.20e-28099.74Show/hide
Query:  MNLTINVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPG
        MNLTINVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPG
Subjt:  MNLTINVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPG

Query:  GIGAVIVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLV
        GIGAVIVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLV
Subjt:  GIGAVIVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLV

Query:  DASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKE
        DASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYID+EFDFLSWFKE
Subjt:  DASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKE

Query:  TVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        TVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
Subjt:  TVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

A0A6J1F958 uncharacterized protein LOC1114419661.05e-28382.01Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+LKA+KWQI HG+LARRVVVR FLLALA+STVPLLH+LTG DFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALR M F DVIGVG+HRFFSLRRKQFVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G + S+ NNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF H EP  S +CR+LTRNKPLIPKMEPLVK +S+  D++LAYLPKLV+ASNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN+DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPYID+EFDFLSWFKETVQHAD
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDE+FLSCRD GVDE+ E+K RDCMDLFKDLR++GVYVHQWFLEAPS +K+
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

A0A6J1H8W6 uncharacterized protein LOC1114611312.34e-28280.42Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSN--DASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI
        M+LK MKWQIVHG+LARRV+VR FL ALA+S VPLLH+LTGA+FGEIP  IFRDC V+ +  +++AK SRGSYLFQGHFLNP+WVPFV +HCEEHMNLT 
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSN--DASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI

Query:  NVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAV
        NVV ELMEK+ LNH  KSLC+GE S+SAVLALRDM F+DV+GVGQ  FFSLRRKQFVY+LDFKDRSFDFVFSRD+D++S PALLVLEIERVLRPGGIGAV
Subjt:  NVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAV

Query:  IVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNG
        IVG + S+PNNLIRAATPVSSLLKTS+VMHV HVNN  LVVFKKKL+EFG LEP  SS+CR+LTRNKPLIPKMEPLVK + VGFD++++YLPKLVDASN 
Subjt:  IVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNG

Query:  EKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGD-DEEGEEPYIDN-EFDFLSWFKETVQ
        EK VYVNIGTGKRL+ TN DWFP SYP+DRRDFNVY VDHDMSALATHIHKPG+TFVYHPGLAGTDQTTDND D + E EEPY+DN +FDFLSWFKETVQ
Subjt:  EKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGD-DEEGEEPYIDN-EFDFLSWFKETVQ

Query:  HADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        H+DFVVLKMDAGKEELKFLSDLFESGVIC VDE+FLSCRD GVDE+ ELKRDCMDLFKDLR++GVYVHQWFL+APS +K+
Subjt:  HADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

A0A6J1J7J6 uncharacterized protein LOC1114819712.51e-28782.22Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV
        M+L+A+KWQI HG+LARRVV+R FLLALA+STVPL+H+LTGADFG     IFRDCVV+S D  A+VSRGSY+FQGHFLNPIW PFVA+HCEE+MNLT NV
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINV

Query:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV
        V ELMEK+ L+HSAKSLC+GEGS SAVLALRDM F DVIGVGQHRFFSLRRK FVYELDFKD+ FDFVFSRD+D+YSVPALLVLEIERV+RPGGIGAVIV
Subjt:  VTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIV

Query:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK
        G SDS+PNNLIRAA PVSSLLK STVMHVGHVNNLTLVVFKKKL+EF HLEP  S  CR+LTRNKPLIPKMEPLVK + VGFD++LAYLPKLV+ASNGEK
Subjt:  GMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEK

Query:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD
        LVYVNIGTGKRLN TN DWFP SYP+DRRDFNVY VD+DMS+LA HIH PGVTFVYHPGLAGTD+TTDNDG DDE+ EEPY+D+EFDFLSWFKETVQHA+
Subjt:  LVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDG-DDEEGEEPYIDNEFDFLSWFKETVQHAD

Query:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI
        FVVLKMDAGKEELKFLSDLFESGVICWVDE+FLSC D GVDE+ E+K RDCMDL+KDLR++GVYVHQWFL+APS +KI
Subjt:  FVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELK-RDCMDLFKDLRDNGVYVHQWFLEAPSFIKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G24480.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.3e-1032.89Show/hide
Query:  ELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIVGM
        +L  +  L+  +K LC+G      V AL+ +   D +G+    +  L  K   +   F D +FDF FS   D    P   V EIER LRPGG+  + V +
Subjt:  ELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIVGM

Query:  ---SDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNL---TLVVFKKKLD
           SD    N + +   +  L + S V+HV +V+     T VVF+KK D
Subjt:  ---SDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNL---TLVVFKKKLD

AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)1.6e-9641.46Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEI-----PSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMN
        M  + +K +++  S  RRV++R  ++  A S V +L  L GA  GE      P  +    V  +       S    LF   FL P+W    +  C++++ 
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEI-----PSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMN

Query:  LTINVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGI
        LT  VV EL     L++ +K+LCIG  S SAVLA+     +DV        F+ + ++F  EL ++D SF FVFS D++  +VPA LV EIER+L+PGG 
Subjt:  LTINVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGI

Query:  GAVIVG-MSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKK------KLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAY
        GA++VG  S S  N L+R+ +PVSSLLK S+V+HV  +    LVVFK+      +LD+  H  P   + C ++  N+P I  +EPL+ ++   F+RR+ Y
Subjt:  GAVIVG-MSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKK------KLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAY

Query:  LPKLVDASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYI-DNEFDF
        LP+ +D S+ ++LVY++IG    +     +WF  SYPIDR+ FN Y V H+ S L +++  PGVTF+YHPGLA T  T  N GD    EEP++ D+ FDF
Subjt:  LPKLVDASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYI-DNEFDF

Query:  LSWFKETVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF
        L+WFKET   ADFVVLKM+    ELKFLS+L ++G IC VDE+FL C             DC  + K LR++GV+VHQW+
Subjt:  LSWFKETVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF

AT5G01710.1 methyltransferases6.0e-2723.06Show/hide
Query:  SLARRVVVRIFLLALAISTVPLLHV--LTGAD-------FGEIPSAI-FRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINVVTE
        S  R ++VR+ L  + I  V   +V  +TG         F  +P  + F      S  ++    R +      +    W+  V  +         ++  +
Subjt:  SLARRVVVRIFLLALAISTVPLLHV--LTGAD-------FGEIPSAI-FRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINVVTE

Query:  LMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRD--VDKYSVPALLVLEIERVLRPGGIGAVIVG
        L+   +L+  +K+LC+       V +LR++   + +G+ +     L  +   + + F+D +FDFVFS    + K         EI R L+P G   V VG
Subjt:  LMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRD--VDKYSVPALLVLEIERVLRPGGIGAVIVG

Query:  MSDSMPNNLIRAATPVSSLLKTSTV----MHVGHVNNLTLVVFKKKLDEFGHLE---PHLSSQCRALTRNKPLIPKMEPLVKQRSV-------GFDRRLA
         +D+   N          L+K   +      + H+    +  +  ++D  GH          +C      + LI   EPL+++  +          + + 
Subjt:  MSDSMPNNLIRAATPVSSLLKTSTV----MHVGHVNNLTLVVFKKKLDEFGHLE---PHLSSQCRALTRNKPLIPKMEPLVKQRSV-------GFDRRLA

Query:  YLPKLVDASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMS-----------------------ALATHI-HKPGVTF---VYHPGL
        Y+P +VD     + VYV++G  +    +   WF   YP   + F+V+ ++ D +                        L+  I H PG          G+
Subjt:  YLPKLVDASNGEKLVYVNIGTGKRLNDTNKDWFPSSYPIDRRDFNVYLVDHDMS-----------------------ALATHI-HKPGVTF---VYHPGL

Query:  AGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKETVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCR---------DDGVDEESELKRDCMD
                +D  D  GE   I   FDF  W K++V+  DFVV+KMD    E   +  L ++G IC +DE+FL C               + ++    C++
Subjt:  AGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKETVQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCR---------DDGVDEESELKRDCMD

Query:  LFKDLRDNGVYVHQWF
        LF  LR  GV VHQW+
Subjt:  LFKDLRDNGVYVHQWF

AT5G03190.1 conserved peptide upstream open reading frame 472.0e-8639.96Show/hide
Query:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRD-CVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI-
        M +K +K  I  GS  R  + R  ++A A+S VPLL +          + +F D   V   D    V  G  LF    + P W     +  E++  + I 
Subjt:  MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRD-CVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI-

Query:  NVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFK-DRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGA
        ++V ELM  + L++ AK LCIG+GS SAV   ++M F+ V GV +H  FS   ++ V EL+   D+SFDFV   DVD  + PALLVLE+ERVL+PGG GA
Subjt:  NVVTELMEKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFK-DRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGA

Query:  VIVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHL--EPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDA
        V+V        N  R    V+S LK S ++ V +++  T++VFK+ + E  +   +  L   C+++  N+P    MEPL++Q+   F + +AYLPK +D 
Subjt:  VIVGMSDSMPNNLIRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHL--EPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDA

Query:  SNGEKLVYVNIGTGKRLN-DTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKET
        S  + LVY++IG  + ++ +   +WF   YP+D + FNVY VDH+ S + +++ KPGVTFVYHP LA  + T       E+ E    D  FDFL+WF+ET
Subjt:  SNGEKLVYVNIGTGKRLN-DTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKET

Query:  VQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF
         ++ADFVVLKM+  + E+KFL+ L E+GVIC+VDE+FL C        S  K DC+++ + LR  GV+VHQW+
Subjt:  VQHADFVVLKMDAGKEELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF

AT5G03190.2 conserved peptide upstream open reading frame 471.2e-8339.96Show/hide
Query:  ARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRD-CVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI-NVVTELMEKQFLNHS
        +R  + R  ++A A+S VPLL +          + +F D   V   D    V  G  LF    + P W     +  E++  + I ++V ELM  + L++ 
Subjt:  ARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRD-CVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTI-NVVTELMEKQFLNHS

Query:  AKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFK-DRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIVGMSDSMPNNLIR
        AK LCIG+GS SAV   ++M F+ V GV +H  FS   ++ V EL+   D+SFDFV   DVD  + PALLVLE+ERVL+PGG GAV+V        N  R
Subjt:  AKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFK-DRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIVGMSDSMPNNLIR

Query:  AATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHL--EPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEKLVYVNIGTGK
            V+S LK S ++ V +++  T++VFK+ + E  +   +  L   C+++  N+P    MEPL++Q+   F + +AYLPK +D S  + LVY++IG  +
Subjt:  AATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHL--EPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEKLVYVNIGTGK

Query:  RLN-DTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKETVQHADFVVLKMDAGK
         ++ +   +WF   YP+D + FNVY VDH+ S + +++ KPGVTFVYHP LA  + T       E+ E    D  FDFL+WF+ET ++ADFVVLKM+  +
Subjt:  RLN-DTNKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKETVQHADFVVLKMDAGK

Query:  EELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF
         E+KFL+ L E+GVIC+VDE+FL C        S  K DC+++ + LR  GV+VHQW+
Subjt:  EELKFLSDLFESGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTGAAGGCTATGAAATGGCAGATCGTTCATGGATCACTCGCGAGGCGTGTGGTTGTTCGAATTTTCTTGCTCGCCTTGGCGATTTCGACCGTTCCTTTG
CTGCATGTATTGACGGGTGCCGATTTCGGAGAAATCCCGTCGGCGATCTTTCGTGATTGTGTTGTGAGGTCGAATGATGCTAGTGCGAAAGTTTCCCGAGGATCT
TACTTGTTTCAGGGTCACTTCCTGAATCCAATTTGGGTGCCGTTTGTAGCGCTGCACTGTGAGGAACATATGAATCTGACGATTAATGTCGTTACCGAGTTAATG
GAGAAGCAGTTCTTGAATCACAGTGCAAAATCTTTATGTATTGGAGAGGGATCTGCCTCCGCCGTATTGGCACTGAGAGACATGGAATTTACCGATGTTATCGGT
GTTGGTCAACACCGATTCTTCTCGCTAAGGAGAAAACAATTTGTTTACGAACTGGACTTTAAGGATCGATCATTTGATTTTGTTTTCTCTAGAGATGTAGACAAA
TATTCAGTGCCTGCACTTCTGGTGCTTGAGATCGAGCGTGTGCTTAGACCTGGCGGAATTGGGGCTGTCATTGTGGGAATGAGTGATTCAATGCCCAATAATTTG
ATTAGAGCTGCAACCCCAGTGTCATCTTTACTGAAAACTTCCACTGTGATGCACGTCGGCCATGTTAATAACTTAACTCTGGTTGTTTTCAAGAAGAAACTCGAC
GAGTTTGGGCATTTGGAGCCTCACTTGTCGTCTCAATGCCGCGCTCTCACAAGAAACAAACCTTTAATTCCAAAAATGGAGCCTCTTGTGAAGCAAAGATCTGTG
GGATTTGACAGAAGGCTGGCTTATTTGCCAAAGCTTGTGGATGCTTCCAATGGGGAAAAGTTGGTCTATGTTAATATTGGTACAGGGAAGCGCCTGAATGACACC
AACAAAGATTGGTTTCCATCTTCTTATCCCATAGATCGCAGAGATTTTAATGTTTATCTTGTTGATCATGACATGTCTGCTCTCGCCACCCACATACATAAGCCA
GGAGTCACGTTTGTTTATCATCCTGGCCTGGCTGGAACTGATCAAACTACAGACAACGACGGCGATGACGAAGAAGGAGAAGAACCGTACATCGACAACGAGTTC
GATTTTCTGTCTTGGTTCAAAGAAACCGTGCAGCATGCTGATTTCGTCGTCTTGAAGATGGATGCAGGGAAGGAGGAACTGAAGTTCCTATCAGATTTGTTCGAA
AGCGGAGTAATTTGCTGGGTGGACGAGGTGTTTCTGAGCTGCAGAGATGATGGGGTTGATGAAGAGAGTGAGCTGAAGAGAGACTGTATGGATCTGTTCAAGGAT
TTGCGAGACAATGGTGTCTATGTCCATCAATGGTTTCTGGAAGCTCCTTCCTTCATCAAAATA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTGAAGGCTATGAAATGGCAGATCGTTCATGGATCACTCGCGAGGCGTGTGGTTGTTCGAATTTTCTTGCTCGCCTTGGCGATTTCGACCGTTCCTTTG
CTGCATGTATTGACGGGTGCCGATTTCGGAGAAATCCCGTCGGCGATCTTTCGTGATTGTGTTGTGAGGTCGAATGATGCTAGTGCGAAAGTTTCCCGAGGATCT
TACTTGTTTCAGGGTCACTTCCTGAATCCAATTTGGGTGCCGTTTGTAGCGCTGCACTGTGAGGAACATATGAATCTGACGATTAATGTCGTTACCGAGTTAATG
GAGAAGCAGTTCTTGAATCACAGTGCAAAATCTTTATGTATTGGAGAGGGATCTGCCTCCGCCGTATTGGCACTGAGAGACATGGAATTTACCGATGTTATCGGT
GTTGGTCAACACCGATTCTTCTCGCTAAGGAGAAAACAATTTGTTTACGAACTGGACTTTAAGGATCGATCATTTGATTTTGTTTTCTCTAGAGATGTAGACAAA
TATTCAGTGCCTGCACTTCTGGTGCTTGAGATCGAGCGTGTGCTTAGACCTGGCGGAATTGGGGCTGTCATTGTGGGAATGAGTGATTCAATGCCCAATAATTTG
ATTAGAGCTGCAACCCCAGTGTCATCTTTACTGAAAACTTCCACTGTGATGCACGTCGGCCATGTTAATAACTTAACTCTGGTTGTTTTCAAGAAGAAACTCGAC
GAGTTTGGGCATTTGGAGCCTCACTTGTCGTCTCAATGCCGCGCTCTCACAAGAAACAAACCTTTAATTCCAAAAATGGAGCCTCTTGTGAAGCAAAGATCTGTG
GGATTTGACAGAAGGCTGGCTTATTTGCCAAAGCTTGTGGATGCTTCCAATGGGGAAAAGTTGGTCTATGTTAATATTGGTACAGGGAAGCGCCTGAATGACACC
AACAAAGATTGGTTTCCATCTTCTTATCCCATAGATCGCAGAGATTTTAATGTTTATCTTGTTGATCATGACATGTCTGCTCTCGCCACCCACATACATAAGCCA
GGAGTCACGTTTGTTTATCATCCTGGCCTGGCTGGAACTGATCAAACTACAGACAACGACGGCGATGACGAAGAAGGAGAAGAACCGTACATCGACAACGAGTTC
GATTTTCTGTCTTGGTTCAAAGAAACCGTGCAGCATGCTGATTTCGTCGTCTTGAAGATGGATGCAGGGAAGGAGGAACTGAAGTTCCTATCAGATTTGTTCGAA
AGCGGAGTAATTTGCTGGGTGGACGAGGTGTTTCTGAGCTGCAGAGATGATGGGGTTGATGAAGAGAGTGAGCTGAAGAGAGACTGTATGGATCTGTTCAAGGAT
TTGCGAGACAATGGTGTCTATGTCCATCAATGGTTTCTGGAAGCTCCTTCCTTCATCAAAATA
Protein sequenceShow/hide protein sequence
MNLKAMKWQIVHGSLARRVVVRIFLLALAISTVPLLHVLTGADFGEIPSAIFRDCVVRSNDASAKVSRGSYLFQGHFLNPIWVPFVALHCEEHMNLTINVVTELM
EKQFLNHSAKSLCIGEGSASAVLALRDMEFTDVIGVGQHRFFSLRRKQFVYELDFKDRSFDFVFSRDVDKYSVPALLVLEIERVLRPGGIGAVIVGMSDSMPNNL
IRAATPVSSLLKTSTVMHVGHVNNLTLVVFKKKLDEFGHLEPHLSSQCRALTRNKPLIPKMEPLVKQRSVGFDRRLAYLPKLVDASNGEKLVYVNIGTGKRLNDT
NKDWFPSSYPIDRRDFNVYLVDHDMSALATHIHKPGVTFVYHPGLAGTDQTTDNDGDDEEGEEPYIDNEFDFLSWFKETVQHADFVVLKMDAGKEELKFLSDLFE
SGVICWVDEVFLSCRDDGVDEESELKRDCMDLFKDLRDNGVYVHQWFLEAPSFIKI