; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G016860 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G016860
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiondual-specificity RNA methyltransferase RlmN
Genome locationchr08:24598595..24601913
RNA-Seq ExpressionLsi08G016860
SyntenyLsi08G016860
Gene Ontology termsGO:0030488 - tRNA methylation (biological process)
GO:0070475 - rRNA base methylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0008173 - RNA methyltransferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR007197 - Radical SAM
IPR013785 - Aldolase-type TIM barrel
IPR040072 - Methyltransferase (Class A)


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598204.1 hypothetical protein SDJN03_07982, partial [Cucurbita argyrosperma subsp. sororia]1.1e-16473.14Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLI+FLSS GR FTT S ASPTPLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT +VGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

KAG7029190.1 rlmN, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-17478.04Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLI+FLSS GR FTT S ASPTPLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  ----------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRKRQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIV
                              GLNKDFKKMLI+ AEFRALSLREILPSSDGTRKRQD+ LRFKSSGLC+EL               MGLKRHLTAAEIV
Subjt:  ----------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRKRQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIV

Query:  EQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYK
        EQAVF RRLLT +VGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATTDEVRNWIMPINRKYK
Subjt:  EQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYK

Query:  LGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAAC
        LGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLTV LRLSRGDDQMAAC
Subjt:  LGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAAC

Query:  GQLGKPGTVQAPLLRVPDQFQMAMKLCP
        GQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  GQLGKPGTVQAPLLRVPDQFQMAMKLCP

XP_022961770.1 uncharacterized protein LOC111462441 isoform X1 [Cucurbita moschata]4.7e-16573.36Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLISFLSS GR FTT S ASPTPLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT +VGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

XP_022996535.1 uncharacterized protein LOC111491756 isoform X1 [Cucurbita maxima]1.1e-16473.36Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLISFLSS GR FTT S ASP PLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT EVGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

XP_023546952.1 uncharacterized protein LOC111805899 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-16573.59Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLISFLSS GR FTT S ASPTPLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT EVGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

TrEMBL top hitse value%identityAlignment
A0A0A0L605 Radical_SAM domain-containing protein2.6e-14573.89Show/hide
Query:  CLFLTVREEKEILNSDSDSKMLLK------------------------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK-
        CL   + EEKEILN+D +SKMLLK                                          GLNKDFKKMLI+NAEFRALSLREILPS DGTRK 
Subjt:  CLFLTVREEKEILNSDSDSKMLLK------------------------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK-

Query:  --RQDHRLRFKS---------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVH
            +  L  ++         + +C    +       F  + RMGLKRHLTAAEIVEQAVF RRLLTSEVGLITNVVFMGMGEPLHNIDNV+KA NIMVH
Subjt:  --RQDHRLRFKS---------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVH

Query:  EQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLV
        EQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKH YKVLFEYVMLAGVNDSIEDAKRIVDLV
Subjt:  EQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLV

Query:  QDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLC
        Q IPCKINLISFNPHCGSQFRPTCKEKMI FRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPG VQAPLLRVPD+FQMAMKLC
Subjt:  QDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLC

A0A6J1DXZ7 uncharacterized protein LOC111025565 isoform X21.9e-15169.53Show/hide
Query:  MASSIL--APWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK---------------
        MASSIL  A  HF  LSVPS+RLS P HLISF+ S  RP +T SCASPTPLFA  DSS DF +     +++EIL   +DSK+LLK               
Subjt:  MASSIL--APWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK---------------

Query:  ---------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFN
                                   GLNK FKKMLI+NAEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F 
Subjt:  ---------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFN

Query:  SSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNA
         + RMGLKRHLTAAEIVEQAVF RRLLTSEVG ITNVVFMGMGEPLHNIDNV KAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL EC+CALAVSLNA
Subjt:  SSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNA

Query:  TTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAG
        TTDEVRNWIMPINRKYKLGLLLQTLREEL  KHNYKV FEYVMLAGVND +EDAKR+VDLVQ IPCKINLISFNPH GSQFRPT +EKMIEFRNVLAEAG
Subjt:  TTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAG

Query:  LTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKL
        LTVFLRLSRGDDQMAACGQLGKPGT+QAPLLRVP+QFQMAMKL
Subjt:  LTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKL

A0A6J1DYZ7 uncharacterized protein LOC111025565 isoform X15.9e-15369.35Show/hide
Query:  MASSIL--APWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTV----REEKEILNSDSDSKMLLK-----------
        MASSIL  A  HF  LSVPS+RLS P HLISF+ S  RP +T SCASPTPLFA  DSS DFC F  +     +++EIL   +DSK+LLK           
Subjt:  MASSIL--APWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTV----REEKEILNSDSDSKMLLK-----------

Query:  -------------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQ
                                       GLNK FKKMLI+NAEFRALSLREILPSSDGTRK     +  L  ++         + +C    +     
Subjt:  -------------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQ

Query:  ASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAV
          F  + RMGLKRHLTAAEIVEQAVF RRLLTSEVG ITNVVFMGMGEPLHNIDNV KAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL EC+CALAV
Subjt:  ASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAV

Query:  SLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVL
        SLNATTDEVRNWIMPINRKYKLGLLLQTLREEL  KHNYKV FEYVMLAGVND +EDAKR+VDLVQ IPCKINLISFNPH GSQFRPT +EKMIEFRNVL
Subjt:  SLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVL

Query:  AEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKL
        AEAGLTVFLRLSRGDDQMAACGQLGKPGT+QAPLLRVP+QFQMAMKL
Subjt:  AEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKL

A0A6J1HD79 uncharacterized protein LOC111462441 isoform X12.3e-16573.36Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLISFLSS GR FTT S ASPTPLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT +VGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

A0A6J1K720 uncharacterized protein LOC111491756 isoform X15.1e-16573.36Show/hide
Query:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------
        MASSILAP HF+SLS+PSARL PPLHLISFLSS GR FTT S ASP PLFA  DSSPDF    ++ E+KEIL+  +DSKMLLK                 
Subjt:  MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLK-----------------

Query:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS
                                 GLNKDFKKMLI+ AEFRALSLREILPSSDGTRK     +  L  ++         + +C    +       F  +
Subjt:  -------------------------GLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSS

Query:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT
         RMGLKRHLTAAEIVEQAVF RRLLT EVGLITNVVFMGMGEPLHNIDNV+KAANIMV EQGLHFSPRKVTVSTSGLVPQLKRFL++CNCALAVSLNATT
Subjt:  IRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATT

Query:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT
        DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQ IPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLA AGLT
Subjt:  DEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLT

Query:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP
        V LRLSRGDDQMAACGQLGKPGT+QAPLLRVPDQFQMAMKL P
Subjt:  VFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP

SwissProt top hitse value%identityAlignment
A1B4Z8 Dual-specificity RNA methyltransferase RlmN8.1e-5140.13Show/hide
Query:  LKGLNKDFKKMLIDNAEFRALSLREILP---SSDGTRKRQDHRLRFKS--------------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQA
        +  L KD++ +L +N E   ++L EI+    S+DGTRK   + LR                   LC    +      SF  +    L R+LTA EIV Q 
Subjt:  LKGLNKDFKKMLIDNAEFRALSLREILP---SSDGTRKRQDHRLRFKS--------------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQA

Query:  VFVRRLL---------TSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMP
        +  R  L           E  L++NVV MGMGEPL+N DNV  A  +++  +G+  S R++T+STSG+VP++ +   E  C LAVS +ATTDE R+ ++P
Subjt:  VFVRRLL---------TSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMP

Query:  INRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGD
        +NRK+ +  LL  LRE  R  ++ ++ FEYVML GVNDS EDA+R+V L++ IP K+NLI FN   GS +R +  E++  F +++ +AG    +R  RG+
Subjt:  INRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGD

Query:  DQMAACGQL
        D MAACGQL
Subjt:  DQMAACGQL

Q2Y6F3 Dual-specificity RNA methyltransferase RlmN4.0e-5045.14Show/hide
Query:  LCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLL----TSEVGL---ITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTS
        LC    +      +F S+ R G  R+LT AEI+ Q  +  + L    TSE G    ITN+V MGMGEPL N +NV+ + ++M+ +     S R+VTVSTS
Subjt:  LCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLL----TSEVGL---ITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTS

Query:  GLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHC
        G++P + R    C  ALAVSL+A  D +R+ ++PINRKY +  LL      L+      + FEYVML GVNDS+  A+ +V LV+DIPCK+NLI FNP  
Subjt:  GLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHC

Query:  GSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVP
         S FR +    +  FR+VL EAGL   +R +RGDD  AACGQL   G V     RVP
Subjt:  GSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVP

Q39S71 Dual-specificity RNA methyltransferase RlmN1.9e-5243.65Show/hide
Query:  KEILNSDSDSKMLLKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS--------SGLCHELSILLHWQASFNSSIRMGLKRHLTA
        K +   D+ S   +  L KD ++ L + A    L    +  S DGTRK   R +     +S        + LC    +       F  +    L R+LTA
Subjt:  KEILNSDSDSKMLLKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS--------SGLCHELSILLHWQASFNSSIRMGLKRHLTA

Query:  AEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPIN
         EIV Q   VRR +      + N+VFMGMGEPL N+DNV+KA  I++H+ GL FS R+VTVSTSGLVP+++R   E    LAVSLNATTDEVR+ IMP+N
Subjt:  AEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPIN

Query:  RKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQ
        R+Y L LLL   R         K+  EYVM+ G+NDS+EDAKR+V L+ DI  KINLI FN H G  F+   +  +  F + L     TV  R SRG D 
Subjt:  RKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQ

Query:  MAACGQL
         AACGQL
Subjt:  MAACGQL

Q74E53 Dual-specificity RNA methyltransferase RlmN1.3e-5344.33Show/hide
Query:  LNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS--------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTS
        L K+ ++ L + A    LS   +  S DGTRK   R D     +S        + LC    +       F  +    L R+LT AEIV Q   V+R +  
Subjt:  LNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS--------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTS

Query:  EVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREEL
            + N+VFMGMGEPL N+DNV++A  IM+H+ GL FS R++TVST+GLVP+++R        LAVSLNATTDE+R+ IMPINRKY L +LL   R   
Subjt:  EVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREEL

Query:  RCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQL
              K+  EYV+L GVND+++DAKR+V L+ DIP KINLI FN H G  FR   ++ +  F   L +   TV  R SRG D  AACGQL
Subjt:  RCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQL

Q7NS85 Dual-specificity RNA methyltransferase RlmN3.1e-5041.18Show/hide
Query:  LNKDFKKMLIDNAEFRALSLREILPSSDGTRK----------------RQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVR
        L K  +  L + AE R  SL     SSDGTRK                 +D R       LC    +    + +F S+ R G  R+L+ AEI+ Q  +  
Subjt:  LNKDFKKMLIDNAEFRALSLREILPSSDGTRK----------------RQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVR

Query:  RLL---TSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLL
        + +        +++NVV MGMGEPL N DNV+ A  IM+ + G   S R+VT+STSGLVPQ+ R   EC  ALAVSL+A  D +R+ I+PIN+KY L  L
Subjt:  RLL---TSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLL

Query:  LQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLG
        +   R  L       + FEYVML GVND  E A+++++LV+D+PCK NLI FNP   S +  +    +  FR +L E G  V +R +RGDD  AACGQL 
Subjt:  LQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLG

Query:  KPGTVQ
          G VQ
Subjt:  KPGTVQ

Arabidopsis top hitse value%identityAlignment
AT1G60230.1 Radical SAM superfamily protein4.8e-11567.82Show/hide
Query:  LKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRR
        L+GLNKD K+M+ ++AEF ALS ++I  +SDGTRK     D  L  ++         + +C    +       F  + RMGLKR+LT AEIVEQAV+ RR
Subjt:  LKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRK---RQDHRLRFKS---------SGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRR

Query:  LLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTL
        LL+ EVG ITNVVFMGMGEP HNIDNV+KAANIMV E GLHFSPRKVTVSTSGLVPQLKRFL E NCALAVSLNATTDEVRNWIMPINRKYKL LLL+TL
Subjt:  LLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTL

Query:  REELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGT
        RE L  +H YKVLFEYVMLAGVNDS++DA+R+V+LVQ IPCKINLI FNPH GSQF  T ++KMI+FRNVLAE G TV +R SRG+DQMAACGQLG  G 
Subjt:  REELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCKEKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGT

Query:  VQAPLLRVPDQFQMAMK
        VQAP++RVP+QF+ A+K
Subjt:  VQAPLLRVPDQFQMAMK

AT2G39670.1 Radical SAM superfamily protein1.8e-2933.74Show/hide
Query:  QASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL-NECNCAL
        + SF ++ + G  R+L   EI+EQ + +  +    V   TNVVFMGMGEP+ N+ +VL A   +   + +    R +T+ST G+   +K+   ++    L
Subjt:  QASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL-NECNCAL

Query:  AVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQD--IPCKINLISFNPHCGSQFRPTCKEKMIEF
        AVSL+A    +R  I+P  + Y L  +++  R+  + + N +V FEY +LAGVND +E A  + +L+++      +NLI +NP  GS+++   K+ ++ F
Subjt:  AVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQD--IPCKINLISFNPHCGSQFRPTCKEKMIEF

Query:  RNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPD
           L    +T  +R +RG D  AACGQL +    ++PLL   D
Subjt:  RNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPD

AT2G39670.2 Radical SAM superfamily protein1.8e-2933.74Show/hide
Query:  QASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL-NECNCAL
        + SF ++ + G  R+L   EI+EQ + +  +    V   TNVVFMGMGEP+ N+ +VL A   +   + +    R +T+ST G+   +K+   ++    L
Subjt:  QASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFL-NECNCAL

Query:  AVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQD--IPCKINLISFNPHCGSQFRPTCKEKMIEF
        AVSL+A    +R  I+P  + Y L  +++  R+  + + N +V FEY +LAGVND +E A  + +L+++      +NLI +NP  GS+++   K+ ++ F
Subjt:  AVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQD--IPCKINLISFNPHCGSQFRPTCKEKMIEF

Query:  RNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPD
           L    +T  +R +RG D  AACGQL +    ++PLL   D
Subjt:  RNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPD

AT3G19630.1 Radical SAM superfamily protein6.6e-3233.02Show/hide
Query:  FLTVREEKEILNSDSD---SKMLLKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRKRQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAA
        F T+      L   SD   SK+L+K  N  F + ++   + R L +    P   G R           S LC    +      +F ++  MG K +LT+ 
Subjt:  FLTVREEKEILNSDSD---SKMLLKGLNKDFKKMLIDNAEFRALSLREILPSSDGTRKRQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAA

Query:  EIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNEC-NCALAVSLNATTDEVRNWIMPIN
        EIVEQ V   R+       I N+VFMGMGEPL+N + V++A  +M++ Q    SP+++T+ST G+V  + +  N+    +LAVSL+A   E+R  IMP  
Subjt:  EIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVSTSGLVPQLKRFLNEC-NCALAVSLNATTDEVRNWIMPIN

Query:  RKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNP-HCGSQFRPTCKEKMIEFRNVLAEA-GLTVFLRLSRGD
        R + L  L+  L +  +     K+  EY+ML GVND  + A  + +L++     INLI FNP    SQF  +  + +  F+ +L E   +   +R   G 
Subjt:  RKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNP-HCGSQFRPTCKEKMIEFRNVLAEA-GLTVFLRLSRGD

Query:  DQMAACGQL--------GKPGTVQ
        D   ACGQL          PGTV+
Subjt:  DQMAACGQL--------GKPGTVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCCTCCATCCTTGCCCCTTGGCACTTCCATTCCCTTTCTGTCCCTTCTGCACGCCTATCGCCACCATTGCACCTAATTTCTTTCCTTTCTTCCGTTGGTCGCCC
GTTTACGACTACCTCTTGCGCATCACCTACGCCGTTATTTGCTCCTTACGATTCTTCACCTGATTTCTGTCTGTTCCTGACTGTGCGTGAGGAAAAAGAAATCTTGAACA
GTGATTCCGACTCCAAGATGCTTCTAAAGGGTTTAAACAAAGATTTTAAGAAAATGTTGATTGACAACGCTGAGTTCAGGGCGCTGTCTTTGAGGGAAATTCTCCCTTCA
TCTGATGGAACGAGGAAGAGGCAGGACCACCGTTTGCGTTTCAAGTCAAGTGGGCTGTGCCATGAACTGTCAATTCTGCTACACTGGCAGGCAAGCTTCAATTCTTCAAT
TCGGATGGGTCTGAAGAGACATCTGACTGCTGCTGAGATAGTAGAACAGGCAGTTTTTGTGAGGCGTTTGCTTACTAGTGAAGTGGGTTTAATTACTAATGTCGTGTTTA
TGGGAATGGGAGAGCCGCTTCACAACATTGACAATGTCCTTAAAGCTGCAAATATAATGGTTCATGAACAAGGCCTTCATTTCAGTCCTCGCAAGGTCACTGTTTCAACC
AGTGGACTTGTTCCCCAGCTCAAACGTTTCCTGAATGAATGTAACTGTGCTTTAGCTGTTAGTTTGAATGCAACTACTGATGAGGTTAGGAATTGGATCATGCCAATAAA
CCGGAAGTATAAGTTAGGCTTGCTTCTTCAGACTTTACGTGAGGAACTTCGCTGCAAACACAATTACAAGGTTCTTTTTGAATATGTGATGCTTGCTGGGGTTAATGACA
GCATTGAAGATGCGAAGAGGATCGTTGATCTTGTCCAGGATATTCCATGCAAGATTAACCTTATTTCATTTAATCCACATTGTGGGTCTCAATTTAGACCTACCTGCAAG
GAGAAGATGATTGAGTTTCGAAATGTTTTGGCTGAAGCTGGGTTGACTGTTTTCTTGCGACTAAGTAGAGGTGATGACCAGATGGCTGCCTGTGGTCAGTTAGGCAAACC
TGGTACTGTTCAAGCTCCTTTACTCCGCGTACCAGATCAATTCCAAATGGCAATGAAATTGTGTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCCTCCATCCTTGCCCCTTGGCACTTCCATTCCCTTTCTGTCCCTTCTGCACGCCTATCGCCACCATTGCACCTAATTTCTTTCCTTTCTTCCGTTGGTCGCCC
GTTTACGACTACCTCTTGCGCATCACCTACGCCGTTATTTGCTCCTTACGATTCTTCACCTGATTTCTGTCTGTTCCTGACTGTGCGTGAGGAAAAAGAAATCTTGAACA
GTGATTCCGACTCCAAGATGCTTCTAAAGGGTTTAAACAAAGATTTTAAGAAAATGTTGATTGACAACGCTGAGTTCAGGGCGCTGTCTTTGAGGGAAATTCTCCCTTCA
TCTGATGGAACGAGGAAGAGGCAGGACCACCGTTTGCGTTTCAAGTCAAGTGGGCTGTGCCATGAACTGTCAATTCTGCTACACTGGCAGGCAAGCTTCAATTCTTCAAT
TCGGATGGGTCTGAAGAGACATCTGACTGCTGCTGAGATAGTAGAACAGGCAGTTTTTGTGAGGCGTTTGCTTACTAGTGAAGTGGGTTTAATTACTAATGTCGTGTTTA
TGGGAATGGGAGAGCCGCTTCACAACATTGACAATGTCCTTAAAGCTGCAAATATAATGGTTCATGAACAAGGCCTTCATTTCAGTCCTCGCAAGGTCACTGTTTCAACC
AGTGGACTTGTTCCCCAGCTCAAACGTTTCCTGAATGAATGTAACTGTGCTTTAGCTGTTAGTTTGAATGCAACTACTGATGAGGTTAGGAATTGGATCATGCCAATAAA
CCGGAAGTATAAGTTAGGCTTGCTTCTTCAGACTTTACGTGAGGAACTTCGCTGCAAACACAATTACAAGGTTCTTTTTGAATATGTGATGCTTGCTGGGGTTAATGACA
GCATTGAAGATGCGAAGAGGATCGTTGATCTTGTCCAGGATATTCCATGCAAGATTAACCTTATTTCATTTAATCCACATTGTGGGTCTCAATTTAGACCTACCTGCAAG
GAGAAGATGATTGAGTTTCGAAATGTTTTGGCTGAAGCTGGGTTGACTGTTTTCTTGCGACTAAGTAGAGGTGATGACCAGATGGCTGCCTGTGGTCAGTTAGGCAAACC
TGGTACTGTTCAAGCTCCTTTACTCCGCGTACCAGATCAATTCCAAATGGCAATGAAATTGTGTCCCTAGTTTCTACGACAAGGTACAAAAACATCGACATCACTCCATC
GACGTGGCATAAAATGGTGTGCCTACTTACATGCGATGGTTAGAATTCATGCCCGTCTTCAGTTTTCAAGTCTTCTCAGAGGGAAGCTACGTATTCACCCCAGATCGAGA
GATTCATCCCTCGTGTGTTCCTAAATTTATTGTCATTCTCCAGTTTTGCTTTTCAACTTGTAACCTTACTTTTGGGATAAATTTTCTAATTGGTATATAGAACCAGTTGT
ATTTCAGTGATGTGGGCGTGTGTGGTCGACTAACTTGAACTTAACATGATGCAGTAGTTGAAAAAATAGATACTTTTCTGGGGTCTGTACATTCATTAGAAAACAATACC
TGCTTTATGAACAAATTTAAAGGAGTAACAAGGAGTGGCCGGATTTTGAG
Protein sequenceShow/hide protein sequence
MASSILAPWHFHSLSVPSARLSPPLHLISFLSSVGRPFTTTSCASPTPLFAPYDSSPDFCLFLTVREEKEILNSDSDSKMLLKGLNKDFKKMLIDNAEFRALSLREILPS
SDGTRKRQDHRLRFKSSGLCHELSILLHWQASFNSSIRMGLKRHLTAAEIVEQAVFVRRLLTSEVGLITNVVFMGMGEPLHNIDNVLKAANIMVHEQGLHFSPRKVTVST
SGLVPQLKRFLNECNCALAVSLNATTDEVRNWIMPINRKYKLGLLLQTLREELRCKHNYKVLFEYVMLAGVNDSIEDAKRIVDLVQDIPCKINLISFNPHCGSQFRPTCK
EKMIEFRNVLAEAGLTVFLRLSRGDDQMAACGQLGKPGTVQAPLLRVPDQFQMAMKLCP