; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0002889 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0002889
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionNUFIP1 domain-containing protein
Genome locationchr10:1362409..1367218
RNA-Seq ExpressionPI0002889
SyntenyPI0002889
Gene Ontology termsGO:0000492 - box C/D snoRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140922.1 uncharacterized protein LOC101213190 [Cucumis sativus]2.6e-23577.8Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSL NSGNGFQNQA FCNPNP  NN HGNPVPTMPPPMFQPGLMMNLQNPLMGLPNN LGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNM QVP +TSYGGPNQQAVPMPFQNPGFST Q FGVNQGM PVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNGSNSI NNSAHRNFMRNSKKGFQKNQTHH+KNEKKKF FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP
        QK KGFHNERRNKF G N TDQ                                         QTN TLVDKEA+LLRQELKEILAKQAELGVEVAEIPP
Subjt:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP

Query:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN
        EYLSYSEKHDNRKQRG  STLGEEA+ ASI KE SQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALRF  MN
Subjt:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN

Query:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        SFFKEWPNKPLKFPSV VKENEGETNVVDETSLSTGNFNLQETNNNSLVEN+GSHDID+DNEND +D+N DEKLKGDGIQVLE+EGEIID
Subjt:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

XP_008456637.1 PREDICTED: uncharacterized protein LOC103496534 isoform X2 [Cucumis melo]3.1e-24178.98Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSLANSGNGFQNQA FCNPNPH NN  GNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQAVPMPFQNPGFST QPFGVNQGMHPVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP
        QK KGFHNERRNKF G NSTDQ                                         QTN TLV++EAQLLRQELKEILAKQAELGVEVAEIPP
Subjt:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP

Query:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN
        EYLSYSEKHDNRKQRG PSTLGEEA GASI KEKSQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALRFM MN
Subjt:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN

Query:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        SFFKEWPNKPLKFPSV VKENEGETNVVDET LSTGNFNLQETNNNSLVENNG+HDI++DNEND ED++NDEKLKGDG QVLE+EGEIID
Subjt:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

XP_016902016.1 PREDICTED: uncharacterized protein LOC103496534 isoform X1 [Cucumis melo]1.2e-24078.32Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSLANSGNGFQNQA FCNPNPH NN  GNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQAVPMPFQNPGFST QPFGVNQGMHPVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ----------------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEV
        QK KGFHNERRNKF G NSTDQ                                              QTN TLV++EAQLLRQELKEILAKQAELGVEV
Subjt:  QKGKGFHNERRNKFGGANSTDQ----------------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEV

Query:  AEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALR
        AEIPPEYLSYSEKHDNRKQRG PSTLGEEA GASI KEKSQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALR
Subjt:  AEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALR

Query:  FMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        FM MNSFFKEWPNKPLKFPSV VKENEGETNVVDET LSTGNFNLQETNNNSLVENNG+HDI++DNEND ED++NDEKLKGDG QVLE+EGEIID
Subjt:  FMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

XP_023550423.1 uncharacterized protein LOC111808573 [Cucurbita pepo subsp. pepo]1.3e-19967.67Show/hide
Query:  QFLPNSSLANSGNGF--------QNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS-------------
        Q LPNSSLANS NG         QNQA FCNPN HLNN HGNPVP MPPPMFQPGLMMNLQNPLM LPNNPLGASPFA  HMGFANS             
Subjt:  QFLPNSSLANSGNGF--------QNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS-------------

Query:  --------------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNM-SQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHP
                      +QFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPG M+NM SQVP H SYG PNQQAVPMPFQNP  ST+QPFGVNQ MHP
Subjt:  --------------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNM-SQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHP

Query:  VNQNPQNFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNE
        VNQNPQNFIPQAMG               GNSTMP NS TQPQQARNLQS AF G+QGNSSISDGGNGSNS  NN AHRNF RNS KGFQK+Q HHMKNE
Subjt:  VNQNPQNFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNE

Query:  KKKFEFPGGQKGKGFHNERRNKFGGANS-----------------------------------------TDQQTNYTLVDKEAQLLRQELKEILAKQAEL
        KKKF  PGG KGKGFHNERRNKFGGANS                                         T++Q++ T+VDKEAQLLRQELKEILAKQAEL
Subjt:  KKKFEFPGGQKGKGFHNERRNKFGGANS-----------------------------------------TDQQTNYTLVDKEAQLLRQELKEILAKQAEL

Query:  GVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLL
        GVEVAEIP EYLSYSEK D+ K+ GD ST+GEEA+GAS GKEK++NR NKR R +KKN  RKK K +KHLSN+ PLKKREPTL QKLL+ADV++DK  +L
Subjt:  GVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLL

Query:  QALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVL-EKEGEIID
        QALRFM MNSFF EWPNKPL FPSV+VKE+  E  VVDE SLSTG+FNLQET N+S+VEN GSH I +DNE+D+++++NDEK KGDGI +L E+EGEIID
Subjt:  QALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVL-EKEGEIID

XP_038885674.1 uncharacterized protein LOC120075982 [Benincasa hispida]2.9e-22674.2Show/hide
Query:  GELQFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS------------------
        G  Q LPN+SLANSGNGFQNQA FCNPNPHLNN HGNPVP MPPPMFQPGLMMNLQNPLMGLPNNPL ASPFA  H+GFANS                  
Subjt:  GELQFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS------------------

Query:  ---------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQ
                 +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQA+PMPFQNPGFST+QPFGVNQ MHPVNQNPQ
Subjt:  ---------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQ

Query:  NFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEF
        NF PQAMG               GNSTM +NSSTQPQQARNLQS AF G+QGNSSISDGGNGSNS  NN AHRNF RNSKKGFQKNQ HHMKNEKKKF F
Subjt:  NFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEF

Query:  PGGQKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAE
        PGGQKGKGFHNERRNKFG ANSTDQ                                         QT+ TLVDKEAQLLR+ELKEILAKQAELGVEVAE
Subjt:  PGGQKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAE

Query:  IPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFM
        IPPEYLSYSEKH+NRK R DPSTLGEE KGAS+GKEKS+NR NKRGR +KKN  RKKGK EKHLSN+P LKKREPTLLQKLL+ADVR++K QLLQALRFM
Subjt:  IPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFM

Query:  AMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
         MNSFFKEWPNKPLKFPSVMVKEN  E N+VDE SLS GNFNLQET NNS+VENNGSH+ID+DNEND++D +N+EK KGD IQVLE+EGEIID
Subjt:  AMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

TrEMBL top hitse value%identityAlignment
A0A0A0KE87 NUFIP1 domain-containing protein1.2e-23577.8Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSL NSGNGFQNQA FCNPNP  NN HGNPVPTMPPPMFQPGLMMNLQNPLMGLPNN LGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNM QVP +TSYGGPNQQAVPMPFQNPGFST Q FGVNQGM PVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNGSNSI NNSAHRNFMRNSKKGFQKNQTHH+KNEKKKF FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP
        QK KGFHNERRNKF G N TDQ                                         QTN TLVDKEA+LLRQELKEILAKQAELGVEVAEIPP
Subjt:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP

Query:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN
        EYLSYSEKHDNRKQRG  STLGEEA+ ASI KE SQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALRF  MN
Subjt:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN

Query:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        SFFKEWPNKPLKFPSV VKENEGETNVVDETSLSTGNFNLQETNNNSLVEN+GSHDID+DNEND +D+N DEKLKGDGIQVLE+EGEIID
Subjt:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

A0A1S3C3B2 uncharacterized protein LOC103496534 isoform X21.5e-24178.98Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSLANSGNGFQNQA FCNPNPH NN  GNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQAVPMPFQNPGFST QPFGVNQGMHPVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP
        QK KGFHNERRNKF G NSTDQ                                         QTN TLV++EAQLLRQELKEILAKQAELGVEVAEIPP
Subjt:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP

Query:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN
        EYLSYSEKHDNRKQRG PSTLGEEA GASI KEKSQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALRFM MN
Subjt:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN

Query:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        SFFKEWPNKPLKFPSV VKENEGETNVVDET LSTGNFNLQETNNNSLVENNG+HDI++DNEND ED++NDEKLKGDG QVLE+EGEIID
Subjt:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

A0A1S4E1B3 uncharacterized protein LOC103496534 isoform X15.8e-24178.32Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSLANSGNGFQNQA FCNPNPH NN  GNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQAVPMPFQNPGFST QPFGVNQGMHPVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ----------------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEV
        QK KGFHNERRNKF G NSTDQ                                              QTN TLV++EAQLLRQELKEILAKQAELGVEV
Subjt:  QKGKGFHNERRNKFGGANSTDQ----------------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEV

Query:  AEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALR
        AEIPPEYLSYSEKHDNRKQRG PSTLGEEA GASI KEKSQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALR
Subjt:  AEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALR

Query:  FMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        FM MNSFFKEWPNKPLKFPSV VKENEGETNVVDET LSTGNFNLQETNNNSLVENNG+HDI++DNEND ED++NDEKLKGDG QVLE+EGEIID
Subjt:  FMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

A0A5A7SKM0 Putative basic-leucine zipper transcription factor F isoform X11.5e-24178.98Show/hide
Query:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------
        Q +PNSSLANSGNGFQNQA FCNPNPH NN  GNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFA  HMGFANS                     
Subjt:  QFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS---------------------

Query:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI
              +QFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPG MMNMSQVP HTSYGGPNQQAVPMPFQNPGFST QPFGVNQGMHPVNQNPQNFI
Subjt:  ------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFI

Query:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG
        PQAMG               GNSTMPINSSTQPQQARNLQS AFAGTQGNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGG
Subjt:  PQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGG

Query:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP
        QK KGFHNERRNKF G NSTDQ                                         QTN TLV++EAQLLRQELKEILAKQAELGVEVAEIPP
Subjt:  QKGKGFHNERRNKFGGANSTDQ-----------------------------------------QTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPP

Query:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN
        EYLSYSEKHDNRKQRG PSTLGEEA GASI KEKSQNRLNKRGR KKKN PRKKGKFEKHLSN+PPLKKREPTLLQKLLKADVRKDK QLLQALRFM MN
Subjt:  EYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMN

Query:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID
        SFFKEWPNKPLKFPSV VKENEGETNVVDET LSTGNFNLQETNNNSLVENNG+HDI++DNEND ED++NDEKLKGDG QVLE+EGEIID
Subjt:  SFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID

A0A6J1FNJ1 uncharacterized protein LOC1114455226.1e-19867.33Show/hide
Query:  QFLPNSSLANSGNGF--------QNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS-------------
        Q LPNSSLANS NG         QNQA FCNPN HLNN HGNPVP MPPPMFQPGLMMNLQNPLM LPNNPLGASPFA  HMGFANS             
Subjt:  QFLPNSSLANSGNGF--------QNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANS-------------

Query:  --------------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNM-SQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHP
                      +QFFGQNMPNLVQQL QNMGL NGQFCLPFQNMNQHVIPG M+NM SQVP H SYG PNQQAVPMPFQNP  ST+QPFGVNQ MHP
Subjt:  --------------SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQHVIPGHMMNM-SQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHP

Query:  VNQNPQNFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNE
        VNQNPQNFIPQAMG               GNSTMP NS TQPQQARNLQS AF G+QGNSSISDGGNGSNS  NN AHRNF RNS KGFQK+Q HHMKNE
Subjt:  VNQNPQNFIPQAMG---------------GNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNE

Query:  KKKFEFPGGQKGKGFHNERRNKFGGANS-----------------------------------------TDQQTNYTLVDKEAQLLRQELKEILAKQAEL
        KKKF  PGG KGKGFHNERRNKFG ANS                                         T++QT+ TLVDKEAQLLRQELKEILAKQAEL
Subjt:  KKKFEFPGGQKGKGFHNERRNKFGGANS-----------------------------------------TDQQTNYTLVDKEAQLLRQELKEILAKQAEL

Query:  GVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLL
        GVEVAEIP EYLSYSEK D+ K+ GD ST+GEEA+GAS GKEK++NR NKR R +KKN  RKK K +KHLSN+ PLKKREPTL QKLL+ADV++DK  +L
Subjt:  GVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLL

Query:  QALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVL-EKEGEIID
        QALRFM MNSFF EWPNKPL FPSV+VKE+  E  VVDE SLSTG+FNLQET N+S+VEN GS  I +DNE+D+++++N+EK +GDGI +L E+EGEIID
Subjt:  QALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVL-EKEGEIID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G18440.1 CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).5.7e-2330.24Show/hide
Query:  FCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMG----LPNNPLGASPFALRHMGFANS--SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQ
        F N     +N + NP+      M  P +M    NP+MG    +PN P+        H  F N+   Q     MPN + QL  N+ L N QF +   N+  
Subjt:  FCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMG----LPNNPLGASPFALRHMGFANS--SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQ

Query:  HVIPGHMM------NMSQVPFHTSYGGPNQQAVPMPFQN-----PGFSTVQPFGVN----QGMHPVNQNPQNFIPQAMGGNSTMPINSSTQPQQARNLQS
        H +P            S  P   S+       VP P QN     PGFS  +P G +     G  P   + +N  P+    N   P     +PQ       
Subjt:  HVIPGHMM------NMSQVPFHTSYGGPNQQAVPMPFQN-----PGFSTVQPFGVN----QGMHPVNQNPQNFIPQAMGGNSTMPINSSTQPQQARNLQS

Query:  HAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNS--KKGFQKNQTHHMKNEKKK---FEFPGGQKGKGFHNERRN---KFGGANSTDQQTNYTLVDKEA
                   +    NG      N  HR    N+  K G   + T ++  EKK+     +   +  +     R+N   KF       +  + +++D+EA
Subjt:  HAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNS--KKGFQKNQTHHMKNEKKK---FEFPGGQKGKGFHNERRN---KFGGANSTDQQTNYTLVDKEA

Query:  QLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEK--SQNRLNKRGRFKKKNHPRKKGKFE-KHLSNQPPLKKRE
        ++ RQ+L+E+LAKQAELGVEVAE+P  YLS +++  N    GD        +G + G++     NR NKR   +K     KK + E K  S    +  R+
Subjt:  QLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEK--SQNRLNKRGRFKKKNHPRKKGKFE-KHLSNQPPLKKRE

Query:  PTLLQKLLKADVRKDKRQLLQALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNED
        PTLL+KLL AD+++DK QLLQ  RFM MNS  KE+P +PLK P + VKE  G  + +++ S+      L +             D+D D+ + +ED
Subjt:  PTLLQKLLKADVRKDKRQLLQALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNED

AT5G18440.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).5.7e-2330.24Show/hide
Query:  FCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMG----LPNNPLGASPFALRHMGFANS--SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQ
        F N     +N + NP+      M  P +M    NP+MG    +PN P+        H  F N+   Q     MPN + QL  N+ L N QF +   N+  
Subjt:  FCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMG----LPNNPLGASPFALRHMGFANS--SQFFGQNMPNLVQQLGQNMGLNNGQFCLPFQNMNQ

Query:  HVIPGHMM------NMSQVPFHTSYGGPNQQAVPMPFQN-----PGFSTVQPFGVN----QGMHPVNQNPQNFIPQAMGGNSTMPINSSTQPQQARNLQS
        H +P            S  P   S+       VP P QN     PGFS  +P G +     G  P   + +N  P+    N   P     +PQ       
Subjt:  HVIPGHMM------NMSQVPFHTSYGGPNQQAVPMPFQN-----PGFSTVQPFGVN----QGMHPVNQNPQNFIPQAMGGNSTMPINSSTQPQQARNLQS

Query:  HAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNS--KKGFQKNQTHHMKNEKKK---FEFPGGQKGKGFHNERRN---KFGGANSTDQQTNYTLVDKEA
                   +    NG      N  HR    N+  K G   + T ++  EKK+     +   +  +     R+N   KF       +  + +++D+EA
Subjt:  HAFAGTQGNSSISDGGNGSNSILNNSAHRNFMRNS--KKGFQKNQTHHMKNEKKK---FEFPGGQKGKGFHNERRN---KFGGANSTDQQTNYTLVDKEA

Query:  QLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEK--SQNRLNKRGRFKKKNHPRKKGKFE-KHLSNQPPLKKRE
        ++ RQ+L+E+LAKQAELGVEVAE+P  YLS +++  N    GD        +G + G++     NR NKR   +K     KK + E K  S    +  R+
Subjt:  QLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKHDNRKQRGDPSTLGEEAKGASIGKEK--SQNRLNKRGRFKKKNHPRKKGKFE-KHLSNQPPLKKRE

Query:  PTLLQKLLKADVRKDKRQLLQALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNED
        PTLL+KLL AD+++DK QLLQ  RFM MNS  KE+P +PLK P + VKE  G  + +++ S+      L +             D+D D+ + +ED
Subjt:  PTLLQKLLKADVRKDKRQLLQALRFMAMNSFFKEWPNKPLKFPSVMVKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCAAGGTGAATTGCAATTTTTACCCAATTCATCTCTCGCCAATTCTGGCAATGGGTTTCAAAATCAGGCCTCCTTCTGCAATCCAAACCCCCACTTGAACAATTT
CCATGGAAACCCTGTTCCCACCATGCCACCTCCCATGTTTCAGCCGGGATTGATGATGAATTTGCAAAACCCTCTCATGGGGTTGCCTAATAATCCTCTTGGTGCTTCCC
CTTTTGCTCTTCGGCATATGGGTTTTGCAAATTCTTCCCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGGGTCAGAATATGGGTTTAAATAATGGGCAG
TTTTGTTTGCCGTTTCAAAATATGAATCAACATGTAATTCCTGGACATATGATGAATATGTCTCAAGTTCCTTTTCATACTTCATATGGTGGTCCAAATCAACAAGCTGT
TCCAATGCCTTTTCAGAATCCTGGCTTCTCTACGGTCCAGCCTTTTGGTGTCAACCAGGGAATGCACCCTGTTAACCAGAATCCCCAAAACTTCATTCCACAAGCAATGG
GCGGGAATTCGACCATGCCAATTAACTCTTCCACTCAACCACAACAAGCTAGGAACCTGCAGTCACATGCTTTCGCTGGGACACAGGGTAATTCTTCAATAAGTGATGGT
GGAAATGGATCAAATTCAATTTTGAATAATTCAGCTCACAGGAATTTCATGAGGAATTCAAAAAAAGGATTTCAGAAGAATCAAACTCATCATATGAAAAATGAGAAGAA
AAAGTTTGAGTTTCCTGGCGGACAGAAAGGGAAAGGTTTTCACAACGAGAGGAGGAACAAATTTGGTGGCGCTAACTCCACAGATCAACAAACCAACTACACATTGGTGG
ATAAGGAGGCTCAGCTTTTGCGACAGGAACTCAAAGAGATTTTAGCAAAGCAGGCTGAATTAGGAGTGGAAGTTGCAGAAATACCACCAGAGTATCTCTCATATTCAGAG
AAACATGACAATCGAAAACAACGTGGAGATCCATCGACATTAGGAGAGGAAGCCAAAGGAGCCTCAATAGGGAAAGAAAAATCTCAAAACAGGTTAAACAAGAGGGGAAG
ATTCAAGAAAAAGAATCACCCGAGAAAGAAGGGAAAATTTGAGAAGCATTTGTCAAACCAGCCGCCATTAAAGAAAAGAGAACCAACATTATTGCAGAAGCTCTTGAAAG
CAGATGTGAGGAAAGACAAAAGGCAGTTGTTACAAGCTTTGAGATTCATGGCGATGAATTCTTTCTTCAAAGAATGGCCCAATAAACCTTTGAAGTTTCCTTCAGTCATG
GTCAAAGAAAATGAAGGTGAGACCAATGTGGTTGATGAGACCTCTTTGTCTACTGGAAATTTCAATCTCCAAGAGACCAACAACAATTCATTGGTTGAAAACAATGGTAG
TCATGACATCGACAACGATAACGAAAATGACAATGAAGACAACAACAACGACGAGAAGCTCAAAGGAGATGGAATACAGGTACTTGAAAAGGAAGGAGAAATTATTGACT
AA
mRNA sequenceShow/hide mRNA sequence
ATGCACCAAGGTGAATTGCAATTTTTACCCAATTCATCTCTCGCCAATTCTGGCAATGGGTTTCAAAATCAGGCCTCCTTCTGCAATCCAAACCCCCACTTGAACAATTT
CCATGGAAACCCTGTTCCCACCATGCCACCTCCCATGTTTCAGCCGGGATTGATGATGAATTTGCAAAACCCTCTCATGGGGTTGCCTAATAATCCTCTTGGTGCTTCCC
CTTTTGCTCTTCGGCATATGGGTTTTGCAAATTCTTCCCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGGGTCAGAATATGGGTTTAAATAATGGGCAG
TTTTGTTTGCCGTTTCAAAATATGAATCAACATGTAATTCCTGGACATATGATGAATATGTCTCAAGTTCCTTTTCATACTTCATATGGTGGTCCAAATCAACAAGCTGT
TCCAATGCCTTTTCAGAATCCTGGCTTCTCTACGGTCCAGCCTTTTGGTGTCAACCAGGGAATGCACCCTGTTAACCAGAATCCCCAAAACTTCATTCCACAAGCAATGG
GCGGGAATTCGACCATGCCAATTAACTCTTCCACTCAACCACAACAAGCTAGGAACCTGCAGTCACATGCTTTCGCTGGGACACAGGGTAATTCTTCAATAAGTGATGGT
GGAAATGGATCAAATTCAATTTTGAATAATTCAGCTCACAGGAATTTCATGAGGAATTCAAAAAAAGGATTTCAGAAGAATCAAACTCATCATATGAAAAATGAGAAGAA
AAAGTTTGAGTTTCCTGGCGGACAGAAAGGGAAAGGTTTTCACAACGAGAGGAGGAACAAATTTGGTGGCGCTAACTCCACAGATCAACAAACCAACTACACATTGGTGG
ATAAGGAGGCTCAGCTTTTGCGACAGGAACTCAAAGAGATTTTAGCAAAGCAGGCTGAATTAGGAGTGGAAGTTGCAGAAATACCACCAGAGTATCTCTCATATTCAGAG
AAACATGACAATCGAAAACAACGTGGAGATCCATCGACATTAGGAGAGGAAGCCAAAGGAGCCTCAATAGGGAAAGAAAAATCTCAAAACAGGTTAAACAAGAGGGGAAG
ATTCAAGAAAAAGAATCACCCGAGAAAGAAGGGAAAATTTGAGAAGCATTTGTCAAACCAGCCGCCATTAAAGAAAAGAGAACCAACATTATTGCAGAAGCTCTTGAAAG
CAGATGTGAGGAAAGACAAAAGGCAGTTGTTACAAGCTTTGAGATTCATGGCGATGAATTCTTTCTTCAAAGAATGGCCCAATAAACCTTTGAAGTTTCCTTCAGTCATG
GTCAAAGAAAATGAAGGTGAGACCAATGTGGTTGATGAGACCTCTTTGTCTACTGGAAATTTCAATCTCCAAGAGACCAACAACAATTCATTGGTTGAAAACAATGGTAG
TCATGACATCGACAACGATAACGAAAATGACAATGAAGACAACAACAACGACGAGAAGCTCAAAGGAGATGGAATACAGGTACTTGAAAAGGAAGGAGAAATTATTGACT
AA
Protein sequenceShow/hide protein sequence
MHQGELQFLPNSSLANSGNGFQNQASFCNPNPHLNNFHGNPVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFALRHMGFANSSQFFGQNMPNLVQQLGQNMGLNNGQ
FCLPFQNMNQHVIPGHMMNMSQVPFHTSYGGPNQQAVPMPFQNPGFSTVQPFGVNQGMHPVNQNPQNFIPQAMGGNSTMPINSSTQPQQARNLQSHAFAGTQGNSSISDG
GNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKGFHNERRNKFGGANSTDQQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAEIPPEYLSYSE
KHDNRKQRGDPSTLGEEAKGASIGKEKSQNRLNKRGRFKKKNHPRKKGKFEKHLSNQPPLKKREPTLLQKLLKADVRKDKRQLLQALRFMAMNSFFKEWPNKPLKFPSVM
VKENEGETNVVDETSLSTGNFNLQETNNNSLVENNGSHDIDNDNENDNEDNNNDEKLKGDGIQVLEKEGEIID