; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012604 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012604
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionheterogeneous nuclear ribonucleoprotein H2 isoform X1
Genome locationscaffold1:4752210..4755997
RNA-Seq ExpressionSpg012604
SyntenySpg012604
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:1990904 - ribonucleoprotein complex (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR003035 - RWP-RK domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603861.1 hypothetical protein SDJN03_04470, partial [Cucurbita argyrosperma subsp. sororia]3.4e-20280.47Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSV
        MLGSGGVSDGYEVGSKRQRMMEP PYFAVSSSTAGFQPYGYGSFPPT AFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSV

Query:  QVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIA
        QVEFALQRDRQNMGRRYVEVF CKRQDYYNAVAAEVNYEGIYDNDY+GSPPPRQKR +DKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDL E+RIHIA
Subjt:  QVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIA

Query:  SRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQLVPIKDSFFDSELQKVADQMENAHLSCFSKGELSKLNDGKFHQP
        SRPDGKATGEAYVEFASAE+AKRAMSKDKMTIGSRYVELFPSTPNEARRAESR+  +  +  S F +E   VAD M +++LSCF      KLND KF   
Subjt:  SRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQLVPIKDSFFDSELQKVADQMENAHLSCFSKGELSKLNDGKFHQP

Query:  RNLPLLDQDLNFLPC-CSVAVSEGSE-NQMKESCEPEPEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREFGIHRWPHRKIKSID
             L QDLNFLPC  S+AVS+G E +QMKE CEP                              EASR+LNVGLTVLKRKCREFGIHRWPHRKIKSID
Subjt:  RNLPLLDQDLNFLPC-CSVAVSEGSE-NQMKESCEPEPEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREFGIHRWPHRKIKSID

Query:  GLIRDLQEEAKHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQGP
        GLIRDLQEEAKHREED KALMAVTKRQMMLQNERE IERTPFRELE ETKRFRQDVF+RRHKARAL S  P
Subjt:  GLIRDLQEEAKHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQGP

RYQ81764.1 hypothetical protein Ahy_B10g100373 isoform A [Arachis hypogaea]2.1e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

RYQ81765.1 hypothetical protein Ahy_B10g100373 isoform C [Arachis hypogaea]2.1e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

RYQ81766.1 hypothetical protein Ahy_B10g100373 isoform B [Arachis hypogaea]2.1e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

RYQ81767.1 hypothetical protein Ahy_B10g100373 isoform D [Arachis hypogaea]2.1e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

TrEMBL top hitse value%identityAlignment
A0A0A0KJN7 RRM domain-containing protein1.8e-14098.44Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSV
        MLGSGGVSDGYEVGSKRQRMMEP PYFAVSSSTAGFQPYGYGSFPPT AFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSV
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSV

Query:  QVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIA
        QVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRF+DKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDL EDRIHIA
Subjt:  QVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIA

Query:  SRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ
        SRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ
Subjt:  SRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ

A0A444WWB9 Uncharacterized protein1.0e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

A0A444WWD1 Uncharacterized protein1.0e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

A0A444WWG1 Uncharacterized protein1.0e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

A0A444WWK2 Uncharacterized protein1.0e-15956.75Show/hide
Query:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS
        MLGSGGVSDGYEVGSKRQRMME  PYFAVSS T  FQPYGY G F P P FPVVRLRGLPFNCTDIDI KFFAGL IVDVLLVNK+GRF GEAFVVFAG+
Subjt:  MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGY-GSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGS

Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH
        +QVEFALQRDRQNMGRRYVEVFRCK+QDYYNAVA+EVNYEGIYDNDYHGSPPP R KRF+DKDQMEYTEILK+RGLPFS TK+ II+FF +F L EDR+H
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP-RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIH

Query:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------
        IA RPDGKATGEAYVEF S +EAKRAM KDKMTIGSRYVELFPST +EARRAESRSR +                                IKD      
Subjt:  IASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQL------------------------------VPIKD------

Query:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL
           F  SE              LQK      V++  +N    C                                                    S GE 
Subjt:  --SFFDSE--------------LQK------VADQMENAHLSCF---------------------------------------------------SKGEL

Query:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF
         +  + K  QP  + +P+L QDLNFLP     +SE  +N++     P+    EKK+RA+S+R+A I LS+L KYF +PI EASR LNVGLTVLKRKCREF
Subjt:  SKLNDGKFHQP--RNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPE---PEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREF

Query:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG
        GI RWPHRKIKS+D LI ++QEEA ++E D K A +A  +++ ML++E+E IER PF +++ ETK+ RQD+F+RRH+ARA +  G
Subjt:  GIHRWPHRKIKSIDGLIRDLQEEAKHREEDHK-ALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKARALESQG

SwissProt top hitse value%identityAlignment
O81791 Protein RKD51.5e-3551.08Show/hide
Query:  QPRNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPEPE---------------KKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCRE
        +PR L +L QDLN LP  S   SE S N+  E  E E +               KK+R  S  +A ++L +LSKYF + I EASRNL VGLTVLK+KCRE
Subjt:  QPRNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPEPE---------------KKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCRE

Query:  FGIHRWPHRKIKSIDGLIRDLQEEA-KHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKA-RALESQ
        FGI RWPHRKIKS+D LI DLQ EA K +E++  A MAV K+Q  L+ E+  I + PF E+ IETK+FRQ+ F++RH+A RA ++Q
Subjt:  FGIHRWPHRKIKSIDGLIRDLQEEA-KHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKA-RALESQ

P52597 Heterogeneous nuclear ribonucleoprotein F2.2e-2636.63Show/hide
Query:  VVRLRGLPFNCTDIDIFKFFAGLDIVD-----VLLVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY
        VV+LRGLP++C+  D+  F +   I D       +  + GR  GEAFV       V+ AL++DR++MG RY+EVF+  R +               D   
Subjt:  VVRLRGLPFNCTDIDIFKFFAGLDIVD-----VLLVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY

Query:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE
          S P      ND         ++LRGLPF  TK  I++FF   ++  + I +   P+GK TGEA+V+FAS E A++A+ K K  IG RY+E+F S+  E
Subjt:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE

Query:  AR
         R
Subjt:  AR

Q5E9J1 Heterogeneous nuclear ribonucleoprotein F1.7e-2637.13Show/hide
Query:  VVRLRGLPFNCTDIDIFKFFAGLDIVDVL-----LVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY
        VV+LRGLP++C+  D+  F +   I D +     +  + GR  GEAFV       V+ AL++DR++MG RY+EVF+  R +               D   
Subjt:  VVRLRGLPFNCTDIDIFKFFAGLDIVDVL-----LVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY

Query:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE
          S P      ND         ++LRGLPF  TK  II+FF   ++  + I +   P+GK TGEA+V+FAS E A++A+ K K  IG RY+E+F S+  E
Subjt:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE

Query:  AR
         R
Subjt:  AR

Q60HC3 Heterogeneous nuclear ribonucleoprotein F2.2e-2636.63Show/hide
Query:  VVRLRGLPFNCTDIDIFKFFAGLDIVD-----VLLVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY
        VV+LRGLP++C+  D+  F +   I D       +  + GR  GEAFV       V+ AL++DR++MG RY+EVF+  R +               D   
Subjt:  VVRLRGLPFNCTDIDIFKFFAGLDIVD-----VLLVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY

Query:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE
          S P      ND         ++LRGLPF  TK  I++FF   ++  + I +   P+GK TGEA+V+FAS E A++A+ K K  IG RY+E+F S+  E
Subjt:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE

Query:  AR
         R
Subjt:  AR

Q9Z2X1 Heterogeneous nuclear ribonucleoprotein F2.2e-2636.63Show/hide
Query:  VVRLRGLPFNCTDIDIFKFFAGLDIVDVL-----LVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY
        VV+LRGLP++C+  D+  F +   I D +     +  + GR  GEAFV       V+ AL++DR++MG RY+EVF+  R +               D   
Subjt:  VVRLRGLPFNCTDIDIFKFFAGLDIVDVL-----LVNKNGRFMGEAFVVFAGSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDY

Query:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE
          S P      ND         ++LRGLPF  TK  I++FF   ++  + I +   P+GK TGEA+V+FAS E A++A+ K K  IG RY+E+F S+  E
Subjt:  HGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNE

Query:  AR
         R
Subjt:  AR

Arabidopsis top hitse value%identityAlignment
AT1G18790.1 RWP-RK domain-containing protein1.0e-1840.15Show/hide
Query:  EPEPEKKRRATSERIAGIALS------DLSKYFGVPITEASRNLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQE-EAKHREEDHKALMAVTKRQ
        E    KKRR   E  +  ++S       +S YF +PIT+A+R LN+GLT+LK++CRE GI RWPHRK+ S+  LI +++E E    EE+   L    ++ 
Subjt:  EPEPEKKRRATSERIAGIALS------DLSKYFGVPITEASRNLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQE-EAKHREEDHKALMAVTKRQ

Query:  MMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKAR
          L+ E++ IE+ P  + E +TKR RQ  F+  HK +
Subjt:  MMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKAR

AT3G20890.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.2e-6245.49Show/hide
Query:  GSGGVSDGYEVGSKRQRMME---PTPYFAV-SSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAG
        G G   DG E+G KRQRM++   P P++    SS   + PYG+ + PP P FP VRLRGLPF+C ++D+ +FF GLD+VDVL V++N +  GEAF V   
Subjt:  GSGGVSDGYEVGSKRQRMME---PTPYFAV-SSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAG

Query:  SVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIY--------------------------DNDYHGSPPPR---QKRFND--KDQMEYTE
         +QV+FALQ++RQNMGRRYVEVFR  +Q+YY A+A EV    ++                               GS P R   + R +D  K+ +E+T 
Subjt:  SVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIY--------------------------DNDYHGSPPPR---QKRFND--KDQMEYTE

Query:  ILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSR
        IL+LRGLPFS  K +I++FF +F+L+ED +H+    +G+ TGEA+VEF +AE+++ AM KD+ T+GSRY+ELFPS+  E   A SR R
Subjt:  ILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSR

AT4G35590.1 RWP-RK domain-containing protein1.1e-3651.08Show/hide
Query:  QPRNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPEPE---------------KKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCRE
        +PR L +L QDLN LP  S   SE S N+  E  E E +               KK+R  S  +A ++L +LSKYF + I EASRNL VGLTVLK+KCRE
Subjt:  QPRNLPLLDQDLNFLPCCSVAVSEGSENQMKESCEPEPE---------------KKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCRE

Query:  FGIHRWPHRKIKSIDGLIRDLQEEA-KHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKA-RALESQ
        FGI RWPHRKIKS+D LI DLQ EA K +E++  A MAV K+Q  L+ E+  I + PF E+ IETK+FRQ+ F++RH+A RA ++Q
Subjt:  FGIHRWPHRKIKSIDGLIRDLQEEA-KHREEDHKALMAVTKRQMMLQNEREIIERTPFRELEIETKRFRQDVFRRRHKA-RALESQ

AT5G66010.1 RNA-binding (RRM/RBD/RNP motifs) family protein5.7e-9466.16Show/hide
Query:  MLGSGGV---SDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFA
        M GS G    S GYEVGSKRQRMM+  PY AV +    F P+GY        FPVVRLRGLPFNC DIDIF+FFAGL+IVDVLLV+KNG+F GEAFVVFA
Subjt:  MLGSGGV---SDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFA

Query:  GSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP----RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLT
        G +QVE ALQRDR NMGRRYVEVFRC +QDYYNAVAAE   EG Y+ +   SPPP    R KRF++K+++EYTE+LK+RGLP+SV K  IIEFF  + + 
Subjt:  GSVQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP----RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLT

Query:  EDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ
        + R+ +  RPDGKATGEA+VEF + EEA+RAM+KDKM+IGSRYVELFP+T  EARRAE+RSRQ
Subjt:  EDRIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ

AT5G66010.2 RNA-binding (RRM/RBD/RNP motifs) family protein2.3e-5565.84Show/hide
Query:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP----RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTED
        +QVE ALQRDR NMGRRYVEVFRC +QDYYNAVAAE   EG Y+ +   SPPP    R KRF++K+++EYTE+LK+RGLP+SV K  IIEFF  + + + 
Subjt:  VQVEFALQRDRQNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPP----RQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTED

Query:  RIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ
        R+ +  RPDGKATGEA+VEF + EEA+RAM+KDKM+IGSRYVELFP+T  EARRAE+RSRQ
Subjt:  RIHIASRPDGKATGEAYVEFASAEEAKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAAGCGGGGGGGTTTCGGATGGGTACGAAGTCGGCTCAAAAAGACAAAGAATGATGGAACCGACTCCCTACTTCGCAGTTAGCAGCAGCACTGCTGGATTTCA
ACCTTATGGCTACGGGAGTTTTCCACCTACTCCTGCCTTTCCTGTGGTTCGCCTCAGAGGACTTCCCTTCAACTGCACCGACATTGACATTTTCAAGTTCTTTGCTGGAC
TGGACATTGTGGATGTGCTGCTCGTCAACAAGAATGGGCGGTTCATGGGAGAGGCCTTTGTTGTCTTTGCTGGCTCCGTGCAGGTTGAGTTTGCATTGCAACGTGATCGA
CAGAATATGGGGCGTAGGTACGTAGAAGTCTTCAGGTGCAAAAGACAGGATTATTATAATGCCGTTGCCGCTGAAGTAAATTACGAGGGCATTTATGATAACGACTACCA
TGGAAGTCCTCCTCCTCGACAAAAGAGGTTCAACGACAAGGACCAGATGGAATACACTGAGATTCTGAAGCTGCGTGGTCTTCCCTTCTCTGTGACAAAATCCAATATCA
TTGAATTTTTTGGAGAGTTCGACCTCACAGAAGATAGGATACATATTGCAAGTCGTCCAGATGGGAAGGCTACTGGTGAGGCTTATGTGGAGTTTGCTTCAGCAGAGGAG
GCAAAGAGAGCAATGAGCAAGGACAAGATGACAATTGGATCGAGATATGTGGAGTTGTTCCCTTCAACCCCAAATGAAGCTAGAAGAGCTGAGTCAAGGTCAAGACAGCT
CGTTCCAATTAAAGATAGTTTTTTTGATTCAGAACTTCAAAAAGTTGCAGATCAGATGGAGAATGCCCATCTATCTTGTTTCTCCAAGGGAGAATTAAGTAAACTGAACG
ATGGCAAATTTCATCAACCAAGAAATTTGCCTCTACTTGATCAGGACCTTAACTTCCTTCCTTGTTGTTCTGTTGCTGTATCTGAAGGGTCTGAGAATCAAATGAAAGAA
TCCTGTGAACCAGAACCAGAAAAGAAGAGGAGGGCAACAAGTGAGCGCATTGCTGGGATTGCTTTATCAGATCTGTCTAAATACTTCGGTGTTCCAATTACAGAAGCTTC
AAGAAATTTAAATGTTGGGTTAACAGTACTGAAAAGAAAATGCAGAGAGTTTGGGATTCATCGCTGGCCACACAGGAAGATCAAGTCCATTGATGGTCTAATCCGAGATC
TTCAGGAAGAAGCAAAGCATAGAGAGGAAGATCACAAAGCTTTGATGGCAGTGACAAAGAGGCAAATGATGTTGCAGAATGAAAGAGAGATCATTGAGAGAACACCATTT
AGAGAGCTGGAGATTGAGACCAAGAGATTTAGGCAAGATGTTTTCAGGAGAAGGCATAAAGCTAGAGCTCTAGAAAGTCAGGGTCCATCCGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGAAGCGGGGGGGTTTCGGATGGGTACGAAGTCGGCTCAAAAAGACAAAGAATGATGGAACCGACTCCCTACTTCGCAGTTAGCAGCAGCACTGCTGGATTTCA
ACCTTATGGCTACGGGAGTTTTCCACCTACTCCTGCCTTTCCTGTGGTTCGCCTCAGAGGACTTCCCTTCAACTGCACCGACATTGACATTTTCAAGTTCTTTGCTGGAC
TGGACATTGTGGATGTGCTGCTCGTCAACAAGAATGGGCGGTTCATGGGAGAGGCCTTTGTTGTCTTTGCTGGCTCCGTGCAGGTTGAGTTTGCATTGCAACGTGATCGA
CAGAATATGGGGCGTAGGTACGTAGAAGTCTTCAGGTGCAAAAGACAGGATTATTATAATGCCGTTGCCGCTGAAGTAAATTACGAGGGCATTTATGATAACGACTACCA
TGGAAGTCCTCCTCCTCGACAAAAGAGGTTCAACGACAAGGACCAGATGGAATACACTGAGATTCTGAAGCTGCGTGGTCTTCCCTTCTCTGTGACAAAATCCAATATCA
TTGAATTTTTTGGAGAGTTCGACCTCACAGAAGATAGGATACATATTGCAAGTCGTCCAGATGGGAAGGCTACTGGTGAGGCTTATGTGGAGTTTGCTTCAGCAGAGGAG
GCAAAGAGAGCAATGAGCAAGGACAAGATGACAATTGGATCGAGATATGTGGAGTTGTTCCCTTCAACCCCAAATGAAGCTAGAAGAGCTGAGTCAAGGTCAAGACAGCT
CGTTCCAATTAAAGATAGTTTTTTTGATTCAGAACTTCAAAAAGTTGCAGATCAGATGGAGAATGCCCATCTATCTTGTTTCTCCAAGGGAGAATTAAGTAAACTGAACG
ATGGCAAATTTCATCAACCAAGAAATTTGCCTCTACTTGATCAGGACCTTAACTTCCTTCCTTGTTGTTCTGTTGCTGTATCTGAAGGGTCTGAGAATCAAATGAAAGAA
TCCTGTGAACCAGAACCAGAAAAGAAGAGGAGGGCAACAAGTGAGCGCATTGCTGGGATTGCTTTATCAGATCTGTCTAAATACTTCGGTGTTCCAATTACAGAAGCTTC
AAGAAATTTAAATGTTGGGTTAACAGTACTGAAAAGAAAATGCAGAGAGTTTGGGATTCATCGCTGGCCACACAGGAAGATCAAGTCCATTGATGGTCTAATCCGAGATC
TTCAGGAAGAAGCAAAGCATAGAGAGGAAGATCACAAAGCTTTGATGGCAGTGACAAAGAGGCAAATGATGTTGCAGAATGAAAGAGAGATCATTGAGAGAACACCATTT
AGAGAGCTGGAGATTGAGACCAAGAGATTTAGGCAAGATGTTTTCAGGAGAAGGCATAAAGCTAGAGCTCTAGAAAGTCAGGGTCCATCCGTTTAG
Protein sequenceShow/hide protein sequence
MLGSGGVSDGYEVGSKRQRMMEPTPYFAVSSSTAGFQPYGYGSFPPTPAFPVVRLRGLPFNCTDIDIFKFFAGLDIVDVLLVNKNGRFMGEAFVVFAGSVQVEFALQRDR
QNMGRRYVEVFRCKRQDYYNAVAAEVNYEGIYDNDYHGSPPPRQKRFNDKDQMEYTEILKLRGLPFSVTKSNIIEFFGEFDLTEDRIHIASRPDGKATGEAYVEFASAEE
AKRAMSKDKMTIGSRYVELFPSTPNEARRAESRSRQLVPIKDSFFDSELQKVADQMENAHLSCFSKGELSKLNDGKFHQPRNLPLLDQDLNFLPCCSVAVSEGSENQMKE
SCEPEPEKKRRATSERIAGIALSDLSKYFGVPITEASRNLNVGLTVLKRKCREFGIHRWPHRKIKSIDGLIRDLQEEAKHREEDHKALMAVTKRQMMLQNEREIIERTPF
RELEIETKRFRQDVFRRRHKARALESQGPSV