; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038782 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038782
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSmall nuclear ribonucleoprotein E
Genome locationchr2:26532011..26538911
RNA-Seq ExpressionLag0038782
SyntenyLag0038782
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005682 - U5 snRNP (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0005686 - U2 snRNP (cellular component)
GO:0005687 - U4 snRNP (cellular component)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
GO:0071011 - precatalytic spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR027078 - Small nuclear ribonucleoprotein E


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5514425.1 hypothetical protein DKX38_028331 [Salix brachista]1.2e-1836.6Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVN-----LSLAGKI--WKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRI
        NLIFRFLQSKARIQ WLFEQKDLRIEGRII +     L+   K+   +H+    +  FL          +++    +  W  +     L     E I   
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVN-----LSLAGKI--WKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRI

Query:  LLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNL
        LL+C FAS+++    G                                                               W            GFDEYMNL
Subjt:  LLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNL

Query:  VLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTG
        VLDDAEEVN+KKKS+K+LGRILLKGDNITLMMNTG
Subjt:  VLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTG

KAF9612939.1 hypothetical protein IFM89_004355 [Coptis chinensis]3.6e-1836.24Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLF+QKD+RIEGRII                                +    RK++++AL         ++S+            
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
                             ++W+E+                  VA   L   ++ L N       C S   SL          GFDEYMNLVLDDAEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMNTGK
        VN+KKK+R +LGRILLKGDNITLM NTGK
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMNTGK

KAG6739774.1 hypothetical protein POTOM_057389 [Populus tomentosa]6.6e-2037.12Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLFEQKDLRIEGRII N+ +                                              +CL                
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
                       C+P                             LG   +  L   +E +L + C   S        S Y  GFDEYMNLVL+DAEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMNTGK
        VN+KKKSRK+LGRILLKGDNITLMMNTGK
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMNTGK

TYK31299.1 protein FAM91A1 [Cucumis melo var. makuwa]5.8e-2451.79Show/hide
Query:  GRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLE
        G    N S    IWK K  +KVK FLWS+AYRSLN  +K+QRK  N +LSP  C LCLK++E  D + LHCDFA K WN I  L  L  CLPKK+DD ++
Subjt:  GRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLE

Query:  EGLQAWNLRKKA
        +GL   +   KA
Subjt:  EGLQAWNLRKKA

XP_035817346.1 small nuclear ribonucleoprotein E isoform X1 [Zea mays]6.2e-1836.68Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLFEQKDLRIEGRIIV  S  G  W   S                                  GC   +   ED+          
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
                                                              SL+L  E              I+ G S    GFDEYMNLVL+DAEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMNTGK
        +NVKK +RK+LGRILLKGDNITLMMN+GK
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMNTGK

TrEMBL top hitse value%identityAlignment
A0A0E0CEU5 Small nuclear ribonucleoprotein E8.7e-1835.37Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLFEQKDLRIEGRII+ ++    +          +F+ ++++     DDK                         D ILL  +  
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
          T                                     ++V PV L                                  +  GFDEYMNLVLD+AEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMNTGK
        +N+KK +RK+LGRILLKGDNITLMMNTGK
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMNTGK

A0A0E0N972 Small nuclear ribonucleoprotein E1.9e-1735.37Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLFEQKDLRIEGRII+ ++    +          +F+ ++++    TDDK                         D ILL  +  
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
          T                                     ++V  V L                                  +  GFDEYMNLVLD+AEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMNTGK
        +N+KK +RK+LGRILLKGDNITLMMNTGK
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMNTGK

A0A5B6VGS0 Small nuclear ribonucleoprotein E9.0e-1535.4Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA
        NLIFRFLQSKARIQIWLFEQKDLRIEGRII                                                                      
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFA

Query:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
                                   G     L +K    V++PV LG                      +   IL        GFDEYMNLVLDDAEE
Subjt:  SKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE

Query:  VNVKKKSRKTLGRILLKGDNITLMMN
        VNVKKKSRK+LGRILLKGDNITLMMN
Subjt:  VNVKKKSRKTLGRILLKGDNITLMMN

A0A5D3E632 Protein FAM91A12.8e-2451.79Show/hide
Query:  GRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLE
        G    N S    IWK K  +KVK FLWS+AYRSLN  +K+QRK  N +LSP  C LCLK++E  D + LHCDFA K WN I  L  L  CLPKK+DD ++
Subjt:  GRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLE

Query:  EGLQAWNLRKKA
        +GL   +   KA
Subjt:  EGLQAWNLRKKA

A0A5N5JA39 Small nuclear ribonucleoprotein E6.0e-1936.6Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVN-----LSLAGKI--WKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRI
        NLIFRFLQSKARIQ WLFEQKDLRIEGRII +     L+   K+   +H+    +  FL          +++    +  W  +     L     E I   
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRIIVN-----LSLAGKI--WKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDIDRI

Query:  LLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNL
        LL+C FAS+++    G                                                               W            GFDEYMNL
Subjt:  LLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNL

Query:  VLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTG
        VLDDAEEVN+KKKS+K+LGRILLKGDNITLMMNTG
Subjt:  VLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTG

SwissProt top hitse value%identityAlignment
A4FUI2 Small nuclear ribonucleoprotein E1.7e-1078.05Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN
        GFDEYMNLVLDDAEE++ K KSRK LGRI+LKGDNITL+ +
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN

A4FUI2 Small nuclear ribonucleoprotein E9.0e-0457.89Show/hide
Query:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII
        Q + V+  NLIFR+LQ+++RIQ+WL+EQ ++RIEG II
Subjt:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII

P62303 Small nuclear ribonucleoprotein E1.7e-1078.05Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN
        GFDEYMNLVLDDAEE++ K KSRK LGRI+LKGDNITL+ +
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN

P62303 Small nuclear ribonucleoprotein E9.0e-0457.89Show/hide
Query:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII
        Q + V+  NLIFR+LQ+++RIQ+WL+EQ ++RIEG II
Subjt:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII

P62304 Small nuclear ribonucleoprotein E1.7e-1078.05Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN
        GFDEYMNLVLDDAEE++ K KSRK LGRI+LKGDNITL+ +
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN

P62304 Small nuclear ribonucleoprotein E9.0e-0457.89Show/hide
Query:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII
        Q + V+  NLIFR+LQ+++RIQ+WL+EQ ++RIEG II
Subjt:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII

P62305 Small nuclear ribonucleoprotein E1.7e-1078.05Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN
        GFDEYMNLVLDDAEE++ K KSRK LGRI+LKGDNITL+ +
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN

P62305 Small nuclear ribonucleoprotein E9.0e-0457.89Show/hide
Query:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII
        Q + V+  NLIFR+LQ+++RIQ+WL+EQ ++RIEG II
Subjt:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII

Q7ZUG0 Small nuclear ribonucleoprotein E1.3e-1078.05Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN
        GFDEYMNLVLDDAEEV++K K+RK LGRI+LKGDNITL+ +
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMN

Q7ZUG0 Small nuclear ribonucleoprotein E3.4e-0355.26Show/hide
Query:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII
        Q + V+  NLIFR+LQ+++RI +WL+EQ ++RIEG II
Subjt:  QWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRII

Arabidopsis top hitse value%identityAlignment
AT2G18740.1 Small nuclear ribonucleoprotein family protein3.1e-1586.36Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTGK
        GFDEYMNLVLD+AEEV++KK +RK LGRILLKGDNITLMMNTGK
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTGK

AT2G18740.1 Small nuclear ribonucleoprotein family protein4.3e-09100Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI
        NLIFRFLQSKARIQIWLFEQKDLRIEGRI
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI

AT2G18740.2 Small nuclear ribonucleoprotein family protein4.3e-09100Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI
        NLIFRFLQSKARIQIWLFEQKDLRIEGRI
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI

AT2G18740.2 Small nuclear ribonucleoprotein family protein1.7e-0577.78Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLG
        GFDEYMNLVLD+AEEV++KK +RK LG
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLG

AT3G25270.1 Ribonuclease H-like superfamily protein1.9e-0429.58Show/hide
Query:  LAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRK-MKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTW
        +  KIWK K+  K+K FLW +   +L T D ++R+ ++N       C  C ++ E    +   C +A + W
Subjt:  LAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRK-MKNWALSPFGCRLCLKDSEDIDRILLHCDFASKTW

AT4G30330.1 Small nuclear ribonucleoprotein family protein3.1e-1586.36Show/hide
Query:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTGK
        GFDEYMNLVLD+AEEV++KKK+RK LGRILLKGDNITLMMN GK
Subjt:  GFDEYMNLVLDDAEEVNVKKKSRKTLGRILLKGDNITLMMNTGK

AT4G30330.1 Small nuclear ribonucleoprotein family protein4.3e-09100Show/hide
Query:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI
        NLIFRFLQSKARIQIWLFEQKDLRIEGRI
Subjt:  NLIFRFLQSKARIQIWLFEQKDLRIEGRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTAGGGAGATGGCTTATGCAGCCAAGGATGGTTCGCCCCTTTTTACTGATCATACATTTAAGGGGTCATTTTTGTCAATTTGCCCCGTACCCAGTTTCCCTTGC
CCTAATTGAAGTTTCATTTCGTTGCTGTGAGCTGCGATTTGTGAGTGAAAGCAGAGAGTTAGGCCTTCAATCTCCTTTCACCACAAACCCCTTCCTTGAGGAATCTATTC
GATCTAGTCGCCATGGCGAGCACCAAAGTCCAGAGGATTATGACCCAACCCATTGTACCTTCTCATCTCAATTCCTTCTCCTTCATTTCATTCTTGGTTTTCTTTCATTT
TTTTACGTGTTTCACTTATATATTTTGCTTCTCTTTCAATGGCTGTTCGTGCGCCTTCAGAACTTGATTTTCAGGTTCCTTCAAAGTAAAGCTCGGATTCAAATATGGCT
TTTTGAGCAGAAAGACCTGAGGATCGAAGGCCGAATCATCGTCAACCTATCTCTTGCGGGCAAGATTTGGAAACATAAGTCTCCTAGGAAAGTAAAAATCTTCCTTTGGA
GCATGGCTTATAGAAGCTTGAACACGGATGATAAAGTGCAAAGAAAGATGAAAAACTGGGCTCTTTCTCCCTTTGGGTGCAGACTTTGTTTAAAAGATAGTGAGGACATC
GACCGCATTCTGTTACATTGTGACTTTGCCTCCAAGACTTGGAACTTCATTGCTGGTTTGTTGGGGCTTTCTTTCTGCTTGCCGAAAAAGGTGGATGACTGGCTCGAAGA
AGGGTTGCAAGCTTGGAATTTGAGAAAGAAGGCCAAGGATGTGGTGGTTAAGCCTGTGGCGTTAGGTGGTCTTGGGATTGAGAGCTTGAGACTTCGTAACGAGACTCTCC
TGGAAATTGGTTGTAGTAGTTCTTCATGGAGTCTAATACTTTGTGGCACAAGTTGCTATGAACCAGGCTTTGACGAATATATGAATTTGGTTTTGGATGATGCCGAGGAA
GTGAATGTAAAGAAGAAGAGCAGGAAGACTTTAGGTAGGATATTGCTTAAAGGAGATAACATAACTCTGATGATGAACACGGGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGTAGGGAGATGGCTTATGCAGCCAAGGATGGTTCGCCCCTTTTTACTGATCATACATTTAAGGGGTCATTTTTGTCAATTTGCCCCGTACCCAGTTTCCCTTGC
CCTAATTGAAGTTTCATTTCGTTGCTGTGAGCTGCGATTTGTGAGTGAAAGCAGAGAGTTAGGCCTTCAATCTCCTTTCACCACAAACCCCTTCCTTGAGGAATCTATTC
GATCTAGTCGCCATGGCGAGCACCAAAGTCCAGAGGATTATGACCCAACCCATTGTACCTTCTCATCTCAATTCCTTCTCCTTCATTTCATTCTTGGTTTTCTTTCATTT
TTTTACGTGTTTCACTTATATATTTTGCTTCTCTTTCAATGGCTGTTCGTGCGCCTTCAGAACTTGATTTTCAGGTTCCTTCAAAGTAAAGCTCGGATTCAAATATGGCT
TTTTGAGCAGAAAGACCTGAGGATCGAAGGCCGAATCATCGTCAACCTATCTCTTGCGGGCAAGATTTGGAAACATAAGTCTCCTAGGAAAGTAAAAATCTTCCTTTGGA
GCATGGCTTATAGAAGCTTGAACACGGATGATAAAGTGCAAAGAAAGATGAAAAACTGGGCTCTTTCTCCCTTTGGGTGCAGACTTTGTTTAAAAGATAGTGAGGACATC
GACCGCATTCTGTTACATTGTGACTTTGCCTCCAAGACTTGGAACTTCATTGCTGGTTTGTTGGGGCTTTCTTTCTGCTTGCCGAAAAAGGTGGATGACTGGCTCGAAGA
AGGGTTGCAAGCTTGGAATTTGAGAAAGAAGGCCAAGGATGTGGTGGTTAAGCCTGTGGCGTTAGGTGGTCTTGGGATTGAGAGCTTGAGACTTCGTAACGAGACTCTCC
TGGAAATTGGTTGTAGTAGTTCTTCATGGAGTCTAATACTTTGTGGCACAAGTTGCTATGAACCAGGCTTTGACGAATATATGAATTTGGTTTTGGATGATGCCGAGGAA
GTGAATGTAAAGAAGAAGAGCAGGAAGACTTTAGGTAGGATATTGCTTAAAGGAGATAACATAACTCTGATGATGAACACGGGGAAGTGA
Protein sequenceShow/hide protein sequence
MPVGRWLMQPRMVRPFLLIIHLRGHFCQFAPYPVSLALIEVSFRCCELRFVSESRELGLQSPFTTNPFLEESIRSSRHGEHQSPEDYDPTHCTFSSQFLLLHFILGFLSF
FYVFHLYILLLFQWLFVRLQNLIFRFLQSKARIQIWLFEQKDLRIEGRIIVNLSLAGKIWKHKSPRKVKIFLWSMAYRSLNTDDKVQRKMKNWALSPFGCRLCLKDSEDI
DRILLHCDFASKTWNFIAGLLGLSFCLPKKVDDWLEEGLQAWNLRKKAKDVVVKPVALGGLGIESLRLRNETLLEIGCSSSSWSLILCGTSCYEPGFDEYMNLVLDDAEE
VNVKKKSRKTLGRILLKGDNITLMMNTGK