; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029922 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029922
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:9432337..9437558
RNA-Seq ExpressionSpg029922
SyntenySpg029922
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC68887.1 hypothetical protein OsI_37529 [Oryza sativa Indica Group]1.9e-1725.48Show/hide
Query:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK---RTGQRMWNI
        G+G  + I    W+  D + +PI    N + K V +LI EDG+W    I   F   DAE+ILNI   + S+ED I W  D  G+FSV+   R   ++ NI
Subjt:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK---RTGQRMWNI

Query:  CFGIANGLGNFGDPLFLILK-------QFFLYAGAGGDLK-------------------IIGKLCPGSWIFRK---------------------RGQHI-
            ++G  N      +I K       + F +  A   L                    I GK  P +   ++                     +G+H+ 
Subjt:  CFGIANGLGNFGDPLFLILK-------QFFLYAGAGGDLK-------------------IIGKLCPGSWIFRK---------------------RGQHI-

Query:  -TCAVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDF
         T  ++   KY++ +         W+ P     K+NVD ++  +S  GG+G ++R+SAG ++   CK + R +     E  A +EG+ +   +  L P  
Subjt:  -TCAVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDF

Query:  LSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA
            + +E+D A+VV+L+     D S ++ +I E +       V+      R+ + ++H LA  A
Subjt:  LSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA

KAA3472112.1 reverse transcriptase [Gossypium australe]2.7e-1627.78Show/hide
Query:  CRARALIILGKFAASRRCG--GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWG
        C AR LI  G       C   G+G+ + I   PW+   G       + N+  + V +LI E    WKKD+I        A+ ILNIP     +ED ++W 
Subjt:  CRARALIILGKFAASRRCG--GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWG

Query:  KDSKGLFSVKR------TGQRMW--NICFGIANGLGNF----GDPL--FLILKQFFLYAGAGGDLKIIGKLCPGSWIFRKRGQHITCAVQEDK-------
         D+ G++SVK       +   +W   +   I N L  +    GDP     +++ F +       L  I       W  R +  H  C    D+       
Subjt:  KDSKGLFSVKR------TGQRMW--NICFGIANGLGNF----GDPL--FLILKQFFLYAGAGGDLKIIGKLCPGSWIFRKRGQHITCAVQEDK-------

Query:  ---KYQLSSPETQLS----HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLS
           +  LS     LS       WKPP   + K+N DA++ +A++S  VG + R+  G ++GA   +I+  ++  + ES A    I       L   D   
Subjt:  ---KYQLSSPETQLS----HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLS

Query:  HNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA
          +++E D+ TV+K I  E +D S I  +I  I++ A     + F F PR  + +AH+LA
Subjt:  HNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.9e-1834.52Show/hide
Query:  GSW---------IFRKRGQHITCAVQEDKKYQLSS---PETQLS--HGV------WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKI
        GSW         IFR      +  +Q+  K+   S    ET LS  H        W+PP   +W +N DA+W D++  GG+GW+IR   G +V AG + +
Subjt:  GSW---------IFRKRGQHITCAVQEDKKYQLSS---PETQLS--HGV------WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKI

Query:  SRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAA
           + +K+LE+ AILEG+   T  G+L P      L IE+D+A V  L+NR+ EDL++   +++EI        ++ F    R T+  AHSLA+ A+
Subjt:  SRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAA

XP_024033483.1 uncharacterized protein LOC112095606 [Citrus clementina]3.1e-2025.72Show/hide
Query:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK------------
        GDG+++ I +  W+     +KPI      +   V ELI E+  W +  I   F R DA+ I+ IP     KED IIW  D KGL+SVK            
Subjt:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK------------

Query:  ------RTGQRMWNICFGIA--------------------------------------NGLGNFGDPLF----------LILKQFFLYAGAGGDLKIIGK
               +G+  WNI + +A                                       G+ N    L           L L +  +   AG ++   GK
Subjt:  ------RTGQRMWNICFGIA--------------------------------------NGLGNFGDPLF----------LILKQFFLYAGAGGDLKIIGK

Query:  LCPGSWIFRKRGQHITC--AVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAIL
              +  K    +     +Q+ ++ ++ S + Q +  VW PP     K+NVDAA        G+G +IRD  G+++ A  K      ++   E+ A+ 
Subjt:  LCPGSWIFRKRGQHITC--AVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAIL

Query:  EGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA
         G+        +  +  +  L++ESDA +VV L+N +    SEI  +I EIQ      ++V   ++ R+ + +AHSLA+ A
Subjt:  EGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA

XP_038722039.1 uncharacterized protein LOC120014190 [Tripterygium wilfordii]5.8e-1929.06Show/hide
Query:  GDGRQVYIEEGPWMMDDGNWKPIR--VDENLKGKRVVELI-QEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVKRTGQRMWNI
        GDG  V +    W+     +KPI    D+ +KG  V  LI Q   +W  D ++NSFI SD E+IL+IP       D I+W  DSKG FS+K T       
Subjt:  GDGRQVYIEEGPWMMDDGNWKPIR--VDENLKGKRVVELI-QEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVKRTGQRMWNI

Query:  CFGIANGLG---NFGD--PLFLILKQFFLYAGAGGDLKIIGKLC---PGSWIF-----------RKRGQHITCAVQEDKKYQLSSPETQLS---------
         + +A GL    + GD  PL        +YA   G  +   K C     +W+            R R    T +       +LS+  T+ S         
Subjt:  CFGIANGLG---NFGD--PLFLILKQFFLYAGAGGDLKIIGKLC---PGSWIF-----------RKRGQHITCAVQEDKKYQLSSPETQLS---------

Query:  -HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINR
          G W+PP    +K+NVD A F A  S G+G ++RD  G +  +  + I  K    + E+  +L G+        L+ +    + V+ESD++  +  I+ 
Subjt:  -HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINR

Query:  EAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAAHFSDF
        ++  LS    L+D ++        V F F+ R  + +AH+LAR A     F
Subjt:  EAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAAHFSDF

TrEMBL top hitse value%identityAlignment
A0A5B6VSY9 Reverse transcriptase1.3e-1627.78Show/hide
Query:  CRARALIILGKFAASRRCG--GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWG
        C AR LI  G       C   G+G+ + I   PW+   G       + N+  + V +LI E    WKKD+I        A+ ILNIP     +ED ++W 
Subjt:  CRARALIILGKFAASRRCG--GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWG

Query:  KDSKGLFSVKR------TGQRMW--NICFGIANGLGNF----GDPL--FLILKQFFLYAGAGGDLKIIGKLCPGSWIFRKRGQHITCAVQEDK-------
         D+ G++SVK       +   +W   +   I N L  +    GDP     +++ F +       L  I       W  R +  H  C    D+       
Subjt:  KDSKGLFSVKR------TGQRMW--NICFGIANGLGNF----GDPL--FLILKQFFLYAGAGGDLKIIGKLCPGSWIFRKRGQHITCAVQEDK-------

Query:  ---KYQLSSPETQLS----HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLS
           +  LS     LS       WKPP   + K+N DA++ +A++S  VG + R+  G ++GA   +I+  ++  + ES A    I       L   D   
Subjt:  ---KYQLSSPETQLS----HGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLS

Query:  HNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA
          +++E D+ TV+K I  E +D S I  +I  I++ A     + F F PR  + +AH+LA
Subjt:  HNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA

A0A5B6WEY4 Reverse transcriptase1.7e-1626.9Show/hide
Query:  CRARALIILGKFAASRRCGGDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKD
        C AR LI  G F       G+GR V I   PW+   G       + N     V +LI E    WKKD+I        A+ ILNIP  +  +ED ++W  D
Subjt:  CRARALIILGKFAASRRCGGDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDG-NWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKD

Query:  SKGLFSVKR------TGQRMWNICFGIANGLGNFGDPLFLILKQFFLYAGAGGDLKIIGKLCP------------------GSWIFRKRGQHITCAVQED
        + G+++VK       +   +W+             D L  I     LY    GD     KL                      W  R +  H  C    D
Subjt:  SKGLFSVKR------TGQRMWNICFGIANGLGNFGDPLFLILKQFFLYAGAGGDLKIIGKLCP------------------GSWIFRKRGQHITCAVQED

Query:  K------------KYQLSSPETQLSHGV--WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNG
        +            K   +S  +  S     WKPP   + K+N DA++  AS+S  VG +  +  G ++GA   +++  ++  + ES A    I       
Subjt:  K------------KYQLSSPETQLSHGV--WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNG

Query:  LLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA
        L   D     +++E D+ TV+K I   + D S I  +I  I + A     + F FSPR  + +AH+LA
Subjt:  LLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA

A0A6J1DNV9 uncharacterized protein LOC1110224031.4e-1834.52Show/hide
Query:  GSW---------IFRKRGQHITCAVQEDKKYQLSS---PETQLS--HGV------WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKI
        GSW         IFR      +  +Q+  K+   S    ET LS  H        W+PP   +W +N DA+W D++  GG+GW+IR   G +V AG + +
Subjt:  GSW---------IFRKRGQHITCAVQEDKKYQLSS---PETQLS--HGV------WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKI

Query:  SRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAA
           + +K+LE+ AILEG+   T  G+L P      L IE+D+A V  L+NR+ EDL++   +++EI        ++ F    R T+  AHSLA+ A+
Subjt:  SRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAAA

A0A803NTU1 Uncharacterized protein2.6e-1724.57Show/hide
Query:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVKRTGQRMWNICFG
        G G  + + E PW+    N K I      +G +V++L   DG+W +  I+++F   D  +IL++   N   +D+I+W     G +S+K +G ++ +    
Subjt:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVKRTGQRMWNICFG

Query:  IANGLGNFGDPLFLILKQFFLYAGAGGDLKIIGKL--CPGSWIFRKRGQH-------------------------------ITCAVQEDKKYQLSSPETQ
         A      G          +   G   + +II  L  C   +   KR                                  I  AV    +Y +     +
Subjt:  IANGLGNFGDPLFLILKQFFLYAGAGGDLKIIGKL--CPGSWIFRKRGQH-------------------------------ITCAVQEDKKYQLSSPETQ

Query:  LSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLIN
         S   W PP   +WK+NVDA  ++ + S   G +IRD +G ++ +     SR       ES+AI+ G+      G+           + SD    + LIN
Subjt:  LSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLIN

Query:  REAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA
          +  +S++  L++EI   +   ++VEF F  R+ + LAHSLA+ A
Subjt:  REAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA

B8BN96 Reverse transcriptase domain-containing protein9.0e-1825.48Show/hide
Query:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK---RTGQRMWNI
        G+G  + I    W+  D + +PI    N + K V +LI EDG+W    I   F   DAE+ILNI   + S+ED I W  D  G+FSV+   R   ++ NI
Subjt:  GDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPKRNISKEDEIIWGKDSKGLFSVK---RTGQRMWNI

Query:  CFGIANGLGNFGDPLFLILK-------QFFLYAGAGGDLK-------------------IIGKLCPGSWIFRK---------------------RGQHI-
            ++G  N      +I K       + F +  A   L                    I GK  P +   ++                     +G+H+ 
Subjt:  CFGIANGLGNFGDPLFLILK-------QFFLYAGAGGDLK-------------------IIGKLCPGSWIFRK---------------------RGQHI-

Query:  -TCAVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDF
         T  ++   KY++ +         W+ P     K+NVD ++  +S  GG+G ++R+SAG ++   CK + R +     E  A +EG+ +   +  L P  
Subjt:  -TCAVQEDKKYQLSSPETQLSHGVWKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDF

Query:  LSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA
            + +E+D A+VV+L+     D S ++ +I E +       V+      R+ + ++H LA  A
Subjt:  LSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein2.5e-0426.52Show/hide
Query:  KMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLID
        K N DA+  +     G+GWLIR+S G+++  G  K   +   +  E  A++  I         T  F    ++ E D + V +LIN ++ D   +   +D
Subjt:  KMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLID

Query:  EIQVQATRANVVEFVFSPRNTHFLAHSLARAA
         I+         EF+F+ R  +  A +L + A
Subjt:  EIQVQATRANVVEFVFSPRNTHFLAHSLARAA

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.0e-0525.44Show/hide
Query:  WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAED
        W+ P     K N D +        G+ W+IR+S G+ +  GC K   +  IK  E  A++  I      G    +F       E D  TV +LI R  E 
Subjt:  WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAED

Query:  LSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA-RAAAHFSDFVFFPFDPSLCLRMDESISEGV
           +   ++ IQ  +     V+F F  R  +     LA +A A+  +   + F P   +      +E V
Subjt:  LSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLA-RAAAHFSDFVFFPFDPSLCLRMDESISEGV

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.8e-0624.11Show/hide
Query:  WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAED
        W PP     K N DA+  + ++  G+GW++R+S G+++  G  K   +   +  E   ++  I         +  F    ++ E D  T+ ++IN ++ +
Subjt:  WKPPDGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAED

Query:  LSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA
           +   +D IQ        +EF F  R  +  A  LA+ A
Subjt:  LSEISLLIDEIQVQATRANVVEFVFSPRNTHFLAHSLARAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGCAAACAATATATACTGTCCATTCTGCGCGACAGCGTCGAGACGCTAAGGACACAGCGTCGCGACGCTGTCGCGCGCGTGCCCTAATTATTTTAGGAAAGTT
CGCAGCGTCGAGACGCTGTGGAGGTGATGGGAGGCAAGTGTACATCGAAGAAGGCCCTTGGATGATGGATGATGGGAATTGGAAACCTATTAGAGTCGATGAGAACCTTA
AAGGTAAAAGAGTGGTGGAGCTCATTCAAGAGGACGGGAATTGGAAAAAAGATCTGATAAGGAACTCTTTCATTCGGAGTGATGCAGAGATTATTCTTAATATTCCCAAG
AGAAACATTTCAAAAGAAGACGAGATTATTTGGGGGAAGGATTCTAAAGGCTTATTTTCGGTTAAAAGAACAGGCCAGAGGATGTGGAACATTTGTTTTGGAATTGCAAA
TGGGTTAGGAAACTTTGGGGATCCCTTATTCCTAATTCTGAAGCAATTTTTTCTATATGCAGGGGCTGGTGGAGATTTAAAGATTATTGGGAAGCTTTGTCCAGGATCCT
GGATCTTCAGAAAGCGAGGACAGCATATTACATGTGCGGTTCAAGAGGATAAAAAGTACCAGTTGAGTTCGCCGGAGACCCAGCTGAGTCATGGTGTCTGGAAGCCTCCC
GATGGTCTTCTGTGGAAGATGAATGTGGACGCCGCCTGGTTTGACGCTTCAAGCTCCGGTGGAGTGGGCTGGCTTATTCGCGACTCAGCCGGTTCTTTGGTCGGAGCTGG
CTGCAAAAAGATCTCGAGGAAATCAGAGATCAAAATGTTAGAATCATTGGCGATTTTAGAAGGCATAAACCAGTTTACAACAAACGGTCTTCTTACCCCGGATTTTCTTA
GTCACAATCTGGTTATTGAATCGGACGCCGCCACAGTAGTGAAGCTAATTAATCGAGAAGCGGAGGACTTATCTGAAATCTCCCTTCTGATCGACGAGATTCAGGTCCAA
GCGACTCGCGCCAATGTGGTTGAGTTCGTCTTTAGCCCGAGAAACACACATTTTTTGGCTCACTCCCTTGCGCGCGCTGCTGCTCATTTCAGCGATTTCGTTTTCTTTCC
TTTTGATCCTTCTCTGTGCTTGAGAATGGATGAGTCGATTTCCGAAGGCGTGTGGCCTTCGCGATTTTTTTTCGATTTTGGAGGGCTTGTTTTGTGTAACGAACGTTTTA
TCTCCTTCCCTGACTTCCACGTGACACCAGTAGTGAATTTGTTTGAGGGGGGAAGATGTGACTTGCAAGACAAAGTAAATCTGCACACCGGTGTGGTGCTTGTTACACTG
CCTCCGATGTCTAAGTCAGCAAGGAGTAAGGTGAGAGAGCAAGAGAAGGAGTCGAGTCCAGAGAATAGAGTCCGGGTTCTCTTCTTCAATGACGAAGAAGGGTTTATATA
CATATTCCTGCCTTTGGGTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCGCAAACAATATATACTGTCCATTCTGCGCGACAGCGTCGAGACGCTAAGGACACAGCGTCGCGACGCTGTCGCGCGCGTGCCCTAATTATTTTAGGAAAGTT
CGCAGCGTCGAGACGCTGTGGAGGTGATGGGAGGCAAGTGTACATCGAAGAAGGCCCTTGGATGATGGATGATGGGAATTGGAAACCTATTAGAGTCGATGAGAACCTTA
AAGGTAAAAGAGTGGTGGAGCTCATTCAAGAGGACGGGAATTGGAAAAAAGATCTGATAAGGAACTCTTTCATTCGGAGTGATGCAGAGATTATTCTTAATATTCCCAAG
AGAAACATTTCAAAAGAAGACGAGATTATTTGGGGGAAGGATTCTAAAGGCTTATTTTCGGTTAAAAGAACAGGCCAGAGGATGTGGAACATTTGTTTTGGAATTGCAAA
TGGGTTAGGAAACTTTGGGGATCCCTTATTCCTAATTCTGAAGCAATTTTTTCTATATGCAGGGGCTGGTGGAGATTTAAAGATTATTGGGAAGCTTTGTCCAGGATCCT
GGATCTTCAGAAAGCGAGGACAGCATATTACATGTGCGGTTCAAGAGGATAAAAAGTACCAGTTGAGTTCGCCGGAGACCCAGCTGAGTCATGGTGTCTGGAAGCCTCCC
GATGGTCTTCTGTGGAAGATGAATGTGGACGCCGCCTGGTTTGACGCTTCAAGCTCCGGTGGAGTGGGCTGGCTTATTCGCGACTCAGCCGGTTCTTTGGTCGGAGCTGG
CTGCAAAAAGATCTCGAGGAAATCAGAGATCAAAATGTTAGAATCATTGGCGATTTTAGAAGGCATAAACCAGTTTACAACAAACGGTCTTCTTACCCCGGATTTTCTTA
GTCACAATCTGGTTATTGAATCGGACGCCGCCACAGTAGTGAAGCTAATTAATCGAGAAGCGGAGGACTTATCTGAAATCTCCCTTCTGATCGACGAGATTCAGGTCCAA
GCGACTCGCGCCAATGTGGTTGAGTTCGTCTTTAGCCCGAGAAACACACATTTTTTGGCTCACTCCCTTGCGCGCGCTGCTGCTCATTTCAGCGATTTCGTTTTCTTTCC
TTTTGATCCTTCTCTGTGCTTGAGAATGGATGAGTCGATTTCCGAAGGCGTGTGGCCTTCGCGATTTTTTTTCGATTTTGGAGGGCTTGTTTTGTGTAACGAACGTTTTA
TCTCCTTCCCTGACTTCCACGTGACACCAGTAGTGAATTTGTTTGAGGGGGGAAGATGTGACTTGCAAGACAAAGTAAATCTGCACACCGGTGTGGTGCTTGTTACACTG
CCTCCGATGTCTAAGTCAGCAAGGAGTAAGGTGAGAGAGCAAGAGAAGGAGTCGAGTCCAGAGAATAGAGTCCGGGTTCTCTTCTTCAATGACGAAGAAGGGTTTATATA
CATATTCCTGCCTTTGGGTTTCTAG
Protein sequenceShow/hide protein sequence
MASQTIYTVHSARQRRDAKDTASRRCRARALIILGKFAASRRCGGDGRQVYIEEGPWMMDDGNWKPIRVDENLKGKRVVELIQEDGNWKKDLIRNSFIRSDAEIILNIPK
RNISKEDEIIWGKDSKGLFSVKRTGQRMWNICFGIANGLGNFGDPLFLILKQFFLYAGAGGDLKIIGKLCPGSWIFRKRGQHITCAVQEDKKYQLSSPETQLSHGVWKPP
DGLLWKMNVDAAWFDASSSGGVGWLIRDSAGSLVGAGCKKISRKSEIKMLESLAILEGINQFTTNGLLTPDFLSHNLVIESDAATVVKLINREAEDLSEISLLIDEIQVQ
ATRANVVEFVFSPRNTHFLAHSLARAAAHFSDFVFFPFDPSLCLRMDESISEGVWPSRFFFDFGGLVLCNERFISFPDFHVTPVVNLFEGGRCDLQDKVNLHTGVVLVTL
PPMSKSARSKVREQEKESSPENRVRVLFFNDEEGFIYIFLPLGF