; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007765 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007765
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:4387401..4395120
RNA-Seq ExpressionLag0007765
SyntenyLag0007765
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC84982.1 hypothetical protein OsI_32248 [Oryza sativa Indica Group]2.2e-1325.06Show/hide
Query:  RGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERPKELGKLIGSVGRRFVKARTENGKG---AHL-------KDGDWNEDLIKGNFLEAD
        + GK+ILIKA+ QAIPT++MSCF L  +     +     +   + +   K+         + + + G G    HL       + GDW+  L++  F E D
Subjt:  RGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERPKELGKLIGSVGRRFVKARTENGKG---AHL-------KDGDWNEDLIKGNFLEAD

Query:  AKTILNTPLGTISSPDEIIWNYGIEA-YSLSKVHTTLGNKRETSS-------------------HLFWNCK--------ITKEMQSYTRNQAQIPTSFFR
           IL+ P+ T    D   W+Y     +S+   +  L ++RE +S                   H F  CK        + KE                R
Subjt:  AKTILNTPLGTISSPDEIIWNYGIEA-YSLSKVHTTLGNKRETSS-------------------HLFWNCK--------ITKEMQSYTRNQAQIPTSFFR

Query:  HIERFSEK--------------ERNSEEA--YQLAPDGSTPSRIESLLSHDS----------------WSPPPSPAWKLNVDASKSDRLHSGGVGWILRD
        HI    E+               RN   A  +  +P+     R++++L   S                W PP     KLNVD +      +GG G++LRD
Subjt:  HIERFSEK--------------ERNSEEA--YQLAPDGSTPSRIESLLSHDS----------------WSPPPSPAWKLNVDASKSDRLHSGGVGWILRD

Query:  SSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDL-TENSFFVEVILKLCEALGEVSFSFCPRVKN
          G  +C G  ++       + E KA       L  L AI        + +ESD+  +++ +     DL T  + F E+   L          F PR  N
Subjt:  SSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDL-TENSFFVEVILKLCEALGEVSFSFCPRVKN

Query:  IAAHRLARMAVSPPPDFGY
          AH LAR+ VS  PD  Y
Subjt:  IAAHRLARMAVSPPPDFGY

XP_015382715.1 uncharacterized protein LOC107175626 [Citrus sinensis]6.7e-1525.65Show/hide
Query:  FLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNF-GGERPKELGKLIGSVGRRFVKARTENGKG---------AHLKDGDWN----EDLIKG
        F  GGKEILIKA+ QA+P Y+MS FRL         K    F  G R    G +  S   +  + ++  G G         A +    W      +L+  
Subjt:  FLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNF-GGERPKELGKLIGSVGRRFVKARTENGKG---------AHLKDGDWN----EDLIKG

Query:  NFLEA---DAKTILNTPLGTISSPDEIIWN---YGIEAYSLSKVHTTLGNK---------RETSSHLFWNCKITKEMQS-----------YTRNQAQIPT
          L+A        LNT   T+ S    IW    +G +    ++    L            ++  S++   C  +K  ++           Y RN+     
Subjt:  NFLEA---DAKTILNTPLGTISSPDEIIWN---YGIEAYSLSKVHTTLGNK---------RETSSHLFWNCKITKEMQS-----------YTRNQAQIPT

Query:  SFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQ
             +   ++ E   E  +++   G+        +S   WSPPP    KLNVDA+ +++    G+G +LRDS+G ++  G K++S    V   E +AIQ
Subjt:  SFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQ

Query:  EGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV
         G++       I  + A   ++VE+D   V  L+N  +   +E  + +  I         + F+F PR  N  AH LA+ A+
Subjt:  EGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.6e-1638.3Show/hide
Query:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD
        W PPP   W LN DAS SD  H GG+GWI+R   G ++  G + +    +VK LE  AI EG+++L  L  +       P+ +E+D+  V +LLN++  D
Subjt:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD

Query:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMA
        LT+  + VE IL L ++   ++F+   R  N  AH LA+ A
Subjt:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMA

XP_023895795.1 uncharacterized protein LOC112007664 [Quercus suber]8.2e-1323.47Show/hide
Query:  EREAFLRGGKEILIKAIVQAIPTYSMSCFRLPTSFV-------------MRSTKPSLNF------------GGERPKELGKL---------------IGS
        E +   + G+E+LIK+++QAIPTY M CF++P                  R  +  +++            GG   ++L K                  S
Subjt:  EREAFLRGGKEILIKAIVQAIPTYSMSCFRLPTSFV-------------MRSTKPSLNF------------GGERPKELGKL---------------IGS

Query:  VGRRFVKAR----------TENGKGAHL---------KDGDWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIEA-YSLSKVHTTLGNKRETSS
        +  +  KAR           E+  G++          ++  WN D++ G F   +A+ + + PL      D + W Y  +  YS    +  L  + E + 
Subjt:  VGRRFVKAR----------TENGKGAHL---------KDGDWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIEA-YSLSKVHTTLGNKRETSS

Query:  HLFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGS
            + ++T E +   R Q   P      +   S+   +       +PD +   ++ SL   + W PPPS  +K+N D +        GVG +++D  G 
Subjt:  HLFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGS

Query:  LICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHR
        +I    K + +   + + E++A+  G  +L F+     D+     ++E D+  VI  L +EE  L      +E    L     E+ +S   R  N  AH 
Subjt:  LICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHR

Query:  LARMAVSPP
        LAR AVS P
Subjt:  LARMAVSPP

XP_028945257.1 uncharacterized protein LOC114819788 [Malus domestica]1.3e-1827.16Show/hide
Query:  RQNLESAPELEREAFL-RGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERPKELGKLIGSVGRRFV---KARTENGKGAHLKDGDWNED
        R  LES      E FL + GKE+L+KA+  A+P Y+MSCF+LP      +    +  G    +  G L+  V +      K+ TE GKG+      W  +
Subjt:  RQNLESAPELEREAFL-RGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERPKELGKLIGSVGRRFV---KARTENGKGAHLKDGDWNED

Query:  LIKGNFLEADAKTILNTPLGTISSPDEIIWNYGI-EAYSLSK---------VHTTLGNKRETSS----------HLFWNCK----ITKEMQSY-------
        LI  +F   D   IL+ PL      D ++W+Y     YS+            +  LG K   S+          +L W  K      + MQ +       
Subjt:  LIKGNFLEADAKTILNTPLGTISSPDEIIWNYGI-EAYSLSK---------VHTTLGNKRETSS----------HLFWNCK----ITKEMQSY-------

Query:  --TRNQAQIPTSFFRHIERFSEKERNSEE---AYQLAPDGSTPSRIESLLSHD----SWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGF
           RN+      + + +E      RN  E   A      G  P     + S D     W  P     K+N DAS        GVGW+ RD +G L   G 
Subjt:  --TRNQAQIPTSFFRHIERFSEKERNSEE---AYQLAPDGSTPSRIESLLSHD----SWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGF

Query:  KKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALG----EVSFSFCPRVKNIAAHRLA
          +    S    E  AI+  + +         D  +  +++ESDA  +I ++  E     ++ F +E IL   E L      VSFS+ PR  N+AAH +A
Subjt:  KKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALG----EVSFSFCPRVKNIAAHRLA

Query:  RMAVSPPPDFGYSPFG
        +   +   +FG+   G
Subjt:  RMAVSPPPDFGYSPFG

TrEMBL top hitse value%identityAlignment
A0A5B6WWE2 Reverse transcriptase3.4e-1225.23Show/hide
Query:  TNCEKQVQSVPKIERQNLESAPELEREAFLR-----------GGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERP--KE----------
        TN EK +     + R+  ES   L+ +  LR           GGKE+ IK+++QAIPTY+M CF LP SF     K    F  ++   K+          
Subjt:  TNCEKQVQSVPKIERQNLESAPELEREAFLR-----------GGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERP--KE----------

Query:  -LGKLIGSVG-RRFVKA----------RTENG---------KGAHLKDGDWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIEAYSLSKVHTTL
           K  G +G R F K           R  N          K  +  + +W  ++++  F +  A+ IL  PL     PDE +  +  E    S +H   
Subjt:  -LGKLIGSVG-RRFVKA----------RTENG---------KGAHLKDGDWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIEAYSLSKVHTTL

Query:  GNKRETSSHLFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIE--------SLLSHDSWSPPPSPAWKLNVDASKSDR
          + + +SHLF  C ++  +        Q+    F  +     KE  +    Q     STPS+          S L    W P P    K+N DA+    
Subjt:  GNKRETSSHLFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIE--------SLLSHDSWSPPPSPAWKLNVDASKSDR

Query:  LHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIY--------PDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVIL
        L    +G + RD  GS++               L    IQE + S    +AI          D+ +  I++E DA  VI    K+  D +    F+  I 
Subjt:  LHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIY--------PDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVIL

Query:  KLCEALGEVSFSFCPRVKNIAAHRLARMAVSPPPDF
        +    + +  F + P+  N  AH LAR  +    +F
Subjt:  KLCEALGEVSFSFCPRVKNIAAHRLARMAVSPPPDF

A0A6J1DNV9 uncharacterized protein LOC1110224031.7e-1638.3Show/hide
Query:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD
        W PPP   W LN DAS SD  H GG+GWI+R   G ++  G + +    +VK LE  AI EG+++L  L  +       P+ +E+D+  V +LLN++  D
Subjt:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD

Query:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMA
        LT+  + VE IL L ++   ++F+   R  N  AH LA+ A
Subjt:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMA

A0A6J1DSV1 uncharacterized protein LOC1110236083.4e-1236.11Show/hide
Query:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSL--PFLKAIYPDLAFPPIMVESDATGVINLLNKEE
        W PP S +WKLN DA+     ++GG+GWILRD  G +I    + I    ++  LE+ AI EG++++     + I  +    PI +ESD+   I+LL+++ 
Subjt:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSL--PFLKAIYPDLAFPPIMVESDATGVINLLNKEE

Query:  TDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV
         D TE  + +E I ++ E +  VS     R  N  AH LAR A+
Subjt:  TDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV

A0A803P9C5 Uncharacterized protein2.6e-1222.75Show/hide
Query:  FLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMR-----------STKPSLNFGGERPKELGKLIGS----------VGRRFVKARTENGKGAHLKDG--
        F  GGKE+L+KA+VQ+IPTY+MSCFRL   F              ST  +     ++ + L K  GS            +  V  R    KG  LK G  
Subjt:  FLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMR-----------STKPSLNFGGERPKELGKLIGS----------VGRRFVKARTENGKGAHLKDG--

Query:  --------------------------------------DWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGI-------------------EAYS
                                              +WN +L+  +F   D + IL  PL  + + D  IW+Y +                   +  S
Subjt:  --------------------------------------DWNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGI-------------------EAYS

Query:  LSKVHTTLGNKRETSSHLFWNCKITKEMQS-------YTRNQAQIPTSFFRHIERFSEKER------------NSEEAYQLAPDGSTPSRIESLLSHDSW
         S    T     E+  H  ++C   K++ +       ++         +  H+     K+             ++  +Y  A          S +    W
Subjt:  LSKVHTTLGNKRETSSHLFWNCKITKEMQS-------YTRNQAQIPTSFFRHIERFSEKER------------NSEEAYQLAPDGSTPSRIESLLSHDSW

Query:  SPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDL
         PPP   +KLNVDA+        G+G I+R+S+G ++    K    N+  + +E KA+  G+       +       P   VE+D   ++N LN     +
Subjt:  SPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDL

Query:  TENSFFVEVILKL---CEALGEVSFSFCPRVKNIAAHRLARMAV
        + NS F  ++  +     +   V  S   R  N AAH LA+ A+
Subjt:  TENSFFVEVILKL---CEALGEVSFSFCPRVKNIAAHRLARMAV

A0A803PYY8 Uncharacterized protein3.4e-1226.51Show/hide
Query:  EREAFLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNF-GGERPKELGKLIGSVGRRFVKARTENGKGAH-----------------LKDGD
        + + F +GGKE L+K+++QAIPTYSM+CFRLP +          NF  G       K      ++  K++ + G G                        
Subjt:  EREAFLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNF-GGERPKELGKLIGSVGRRFVKARTENGKGAH-----------------LKDGD

Query:  WNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIE-AYSL-SKVHTTLGNKRETSSHLF------------WNCKITKEMQSYTRNQAQIPTSFFR
        W+   +   +     + IL  PL    SPD+ IW +    AYS+ S  H +L +   TS  +F            +   I        + QAQ       
Subjt:  WNEDLIKGNFLEADAKTILNTPLGTISSPDEIIWNYGIE-AYSL-SKVHTTLGNKRETSSHLF------------WNCKITKEMQSYTRNQAQIPTSFFR

Query:  HIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIK
        ++      E NS     L+    T SR +  ++  +WSPP S   KLNVDA+ S      G G ++R+S G ++    +      +V TLE K++     
Subjt:  HIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIK

Query:  SLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALG---EVSFSFCPRVKNIAAHRLARMAV
         L  L+    D  F    VE+D   + + L+  + D T    F ++I ++ EAL        S   R  N  A +LA  A+
Subjt:  SLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALG---EVSFSFCPRVKNIAAHRLARMAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.3e-1128.64Show/hide
Query:  LFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSL
        L W    ++    +   +   P    R +E F E     E    L    S P ++E  LS   W  PP    K N DA+        G+GWILR+ SG +
Subjt:  LFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSL

Query:  ICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRL
        + +G + + +  +V   EL+A++  + ++           +  I+ ESDA  ++NLLN ++   T     +E I +L     EV F F PR  N  A R+
Subjt:  ICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRL

Query:  ARMAVS
        AR ++S
Subjt:  ARMAVS

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0726.9Show/hide
Query:  DSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEE
        D W  P +   K N D S        G+ WI+R+S G+ +  G  K     ++K  E  A+   I+          DL +  +  E D   V  L+  +E
Subjt:  DSWSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEE

Query:  TDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAVS
        T+     +++E I +  +A   V F+F  R +N+    LA+ AV+
Subjt:  TDLTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAVS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.5e-0726.76Show/hide
Query:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD
        WSPP     K N DAS  +R    G+GWILR+S G++I  G  K     + +  E   +      +  ++A Y       ++ E D   +  ++N + ++
Subjt:  WSPPPSPAWKLNVDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETD

Query:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV
              F++ I     +   + FSF  R +N  A  LA+ A+
Subjt:  LTENSFFVEVILKLCEALGEVSFSFCPRVKNIAAHRLARMAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGCCGGAACCTGCTGAGGACTCGCTGGAAGGGAAGAAGATCACAAGTCTGGTGCTGCTGGACTCACCGGACGATGGTCAGAATGAGGACTACGGAGACGTGCTGAG
TTGGACGGCGTTAGAGATCGTAGAAGAGAAGGAAGTCGGCGGCTCCGACCGGCTTACCAAACTGCAGCAGCACCGCCACCGCTACCTTCTCTTTTCACGGCATGAAGGAT
ATGAACGCCAGATGAGACAAGGATCTTGGCTACTACCTGGGCATGCCATCACAAACTGCGAGAAACAAGTCCAAAGTGTTCCAAAGATTGAGAGACAGAATCTGGAAAGC
GCTCCAGAGTTGGAAAGGGAAGCTTTTCTCCGCGGGGGAAAAGAAATTCTTATAAAAGCAATAGTCCAAGCTATTCCAACCTATTCTATGAGTTGCTTTAGACTCCCAAC
CTCTTTTGTAATGAGATCAACAAAGCCATCGCTAAATTTTGGTGGGGAGAGACCCAAGGAACTCGGAAAACTCATTGGATCAGTTGGAAGACGCTTTGTAAAAGCAAGGA
CAGAGAATGGTAAGGGAGCTCATCTCAAAGATGGTGACTGGAATGAAGATCTTATTAAAGGAAATTTCTTAGAAGCTGACGCCAAGACCATTCTTAACACCCCTTTGGGA
ACAATCTCGAGTCCTGACGAGATTATATGGAACTATGGGATAGAGGCTTATTCTCTGTCAAAAGTGCATACCACCTTGGGGAACAAAAGGGAAACATCAAGCCACTTGTT
CTGGAATTGCAAGATTACTAAAGAAATGCAGTCCTACACAAGGAATCAAGCCCAGATACCAACCTCATTTTTCAGACATATAGAAAGATTTTCAGAGAAGGAGAGGAATT
CAGAGGAAGCGTACCAGTTGGCGCCCGACGGATCGACCCCTTCTAGGATAGAGAGCCTCTTGAGTCATGATAGTTGGTCTCCCCCTCCATCGCCCGCTTGGAAGCTTAAT
GTCGACGCTTCAAAATCGGATCGTCTTCATTCCGGAGGGGTCGGATGGATCCTCCGTGACTCATCAGGATCTCTGATCTGTTTGGGGTTCAAGAAAATTTCGAAGAACTG
GTCAGTTAAAACGCTGGAACTAAAAGCTATTCAAGAAGGGATCAAAAGCCTACCTTTCCTGAAAGCGATCTACCCCGATTTGGCATTCCCGCCTATAATGGTTGAGTCGG
ATGCGACGGGAGTTATCAATCTTCTCAACAAGGAAGAGACCGACCTCACGGAGAACTCTTTCTTTGTAGAAGTAATCCTTAAGCTTTGTGAAGCTTTGGGAGAAGTCTCG
TTTTCTTTTTGTCCGAGAGTGAAGAACATCGCAGCCCATCGTTTGGCGCGCATGGCTGTTTCTCCTCCACCAGATTTTGGCTATTCTCCTTTTGGCTCTTCTTCTACTTC
GGAAGAAGATGAATTTTTTTGTGTTGGGCTCCCCCTGTTTGGGTTGTTGAGCCCATTTTTGATGGTCTTTAAGAATTCGGAGGTGTTTCGGGATGAACCAGGCGGAACCG
AGGCGATCCGGGACATCACGGGTCGAAAGGAGGTGACCGAGCTCGGCCTGCGCAAGCGGGCCGAATGGTCAGCCTCGGCCTTTTGCCGAGGCCGGCCATATGGGTCAGGC
CATGTTGGCCCGACCCTTTGGTCTGGTCTTCCTATGGGTCGGATTTCTGGTCCTACCTTTGCCCGATTGTCCTTGTCAGCTTCTTGTTCGTCAGTAATTTTGGACCACTC
TGATGCACAAGGAGCTGACGAAGACAACCGAGCAGAGAAAGGACCAGGAAACCAACCCAGAGGAAGACCAGACCGAAGGGTCGGGCCAACTTGGCCTGACCCATATGGTC
GGCGAGCCAAGCTCGGTCACCTCTGTTTTGTCCTTGCTGCCTTTGGTCGCCCCGGTTCCACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGCCGGAACCTGCTGAGGACTCGCTGGAAGGGAAGAAGATCACAAGTCTGGTGCTGCTGGACTCACCGGACGATGGTCAGAATGAGGACTACGGAGACGTGCTGAG
TTGGACGGCGTTAGAGATCGTAGAAGAGAAGGAAGTCGGCGGCTCCGACCGGCTTACCAAACTGCAGCAGCACCGCCACCGCTACCTTCTCTTTTCACGGCATGAAGGAT
ATGAACGCCAGATGAGACAAGGATCTTGGCTACTACCTGGGCATGCCATCACAAACTGCGAGAAACAAGTCCAAAGTGTTCCAAAGATTGAGAGACAGAATCTGGAAAGC
GCTCCAGAGTTGGAAAGGGAAGCTTTTCTCCGCGGGGGAAAAGAAATTCTTATAAAAGCAATAGTCCAAGCTATTCCAACCTATTCTATGAGTTGCTTTAGACTCCCAAC
CTCTTTTGTAATGAGATCAACAAAGCCATCGCTAAATTTTGGTGGGGAGAGACCCAAGGAACTCGGAAAACTCATTGGATCAGTTGGAAGACGCTTTGTAAAAGCAAGGA
CAGAGAATGGTAAGGGAGCTCATCTCAAAGATGGTGACTGGAATGAAGATCTTATTAAAGGAAATTTCTTAGAAGCTGACGCCAAGACCATTCTTAACACCCCTTTGGGA
ACAATCTCGAGTCCTGACGAGATTATATGGAACTATGGGATAGAGGCTTATTCTCTGTCAAAAGTGCATACCACCTTGGGGAACAAAAGGGAAACATCAAGCCACTTGTT
CTGGAATTGCAAGATTACTAAAGAAATGCAGTCCTACACAAGGAATCAAGCCCAGATACCAACCTCATTTTTCAGACATATAGAAAGATTTTCAGAGAAGGAGAGGAATT
CAGAGGAAGCGTACCAGTTGGCGCCCGACGGATCGACCCCTTCTAGGATAGAGAGCCTCTTGAGTCATGATAGTTGGTCTCCCCCTCCATCGCCCGCTTGGAAGCTTAAT
GTCGACGCTTCAAAATCGGATCGTCTTCATTCCGGAGGGGTCGGATGGATCCTCCGTGACTCATCAGGATCTCTGATCTGTTTGGGGTTCAAGAAAATTTCGAAGAACTG
GTCAGTTAAAACGCTGGAACTAAAAGCTATTCAAGAAGGGATCAAAAGCCTACCTTTCCTGAAAGCGATCTACCCCGATTTGGCATTCCCGCCTATAATGGTTGAGTCGG
ATGCGACGGGAGTTATCAATCTTCTCAACAAGGAAGAGACCGACCTCACGGAGAACTCTTTCTTTGTAGAAGTAATCCTTAAGCTTTGTGAAGCTTTGGGAGAAGTCTCG
TTTTCTTTTTGTCCGAGAGTGAAGAACATCGCAGCCCATCGTTTGGCGCGCATGGCTGTTTCTCCTCCACCAGATTTTGGCTATTCTCCTTTTGGCTCTTCTTCTACTTC
GGAAGAAGATGAATTTTTTTGTGTTGGGCTCCCCCTGTTTGGGTTGTTGAGCCCATTTTTGATGGTCTTTAAGAATTCGGAGGTGTTTCGGGATGAACCAGGCGGAACCG
AGGCGATCCGGGACATCACGGGTCGAAAGGAGGTGACCGAGCTCGGCCTGCGCAAGCGGGCCGAATGGTCAGCCTCGGCCTTTTGCCGAGGCCGGCCATATGGGTCAGGC
CATGTTGGCCCGACCCTTTGGTCTGGTCTTCCTATGGGTCGGATTTCTGGTCCTACCTTTGCCCGATTGTCCTTGTCAGCTTCTTGTTCGTCAGTAATTTTGGACCACTC
TGATGCACAAGGAGCTGACGAAGACAACCGAGCAGAGAAAGGACCAGGAAACCAACCCAGAGGAAGACCAGACCGAAGGGTCGGGCCAACTTGGCCTGACCCATATGGTC
GGCGAGCCAAGCTCGGTCACCTCTGTTTTGTCCTTGCTGCCTTTGGTCGCCCCGGTTCCACCTAG
Protein sequenceShow/hide protein sequence
MTPEPAEDSLEGKKITSLVLLDSPDDGQNEDYGDVLSWTALEIVEEKEVGGSDRLTKLQQHRHRYLLFSRHEGYERQMRQGSWLLPGHAITNCEKQVQSVPKIERQNLES
APELEREAFLRGGKEILIKAIVQAIPTYSMSCFRLPTSFVMRSTKPSLNFGGERPKELGKLIGSVGRRFVKARTENGKGAHLKDGDWNEDLIKGNFLEADAKTILNTPLG
TISSPDEIIWNYGIEAYSLSKVHTTLGNKRETSSHLFWNCKITKEMQSYTRNQAQIPTSFFRHIERFSEKERNSEEAYQLAPDGSTPSRIESLLSHDSWSPPPSPAWKLN
VDASKSDRLHSGGVGWILRDSSGSLICLGFKKISKNWSVKTLELKAIQEGIKSLPFLKAIYPDLAFPPIMVESDATGVINLLNKEETDLTENSFFVEVILKLCEALGEVS
FSFCPRVKNIAAHRLARMAVSPPPDFGYSPFGSSSTSEEDEFFCVGLPLFGLLSPFLMVFKNSEVFRDEPGGTEAIRDITGRKEVTELGLRKRAEWSASAFCRGRPYGSG
HVGPTLWSGLPMGRISGPTFARLSLSASCSSVILDHSDAQGADEDNRAEKGPGNQPRGRPDRRVGPTWPDPYGRRAKLGHLCFVLAAFGRPGST