; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g008100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g008100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRNase H domain-containing protein
Genome locationChr07:36295862..36300657
RNA-Seq ExpressionLcy07g008100
SyntenyLcy07g008100
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383853.1 uncharacterized protein LOC107176237 [Citrus sinensis]4.4e-2525.3Show/hide
Query:  MARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG
        MA+FWW   +D R IHW                                     YP   ++RVLK RY+   DFL A  GS PSYIWRS++WGR++L KG
Subjt:  MARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG

Query:  IRWRIGNG----------------EKVRVYGSN----------------------------------------WIPDETCLK---ADFMLNLLRDVKNKV
        IRWRI N                  KVR +  +                                        + P ETC        ML ++ ++  K+
Subjt:  IRWRIGNG----------------EKVRVYGSN----------------------------------------WIPDETCLK---ADFMLNLLRDVKNKV

Query:  DWVKFEELVVVLWAVWCCRNQQKFRGRVPSAGLVDWAVNNLV-------------------------------VFRVNVDAAFCEDLFRAGAGVVVRDEA
             E +V + W +W  +N+  F G+     L    V  +V                                F+VNVDAA   +  +AG GVV+RD  
Subjt:  DWVKFEELVVVLWAVWCCRNQQKFRGRVPSAGLVDWAVNNLV-------------------------------VFRVNVDAAFCEDLFRAGAGVVVRDEA

Query:  RRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARF
         +V ++A  +    G +  AE  A+  G+ +        +++E+D   V  L+ +      +E+  ++SE +        F    T R  N  AH LA+ 
Subjt:  RRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARF

Query:  ALRERGCEYWMELVP
        ALR      W+E +P
Subjt:  ALRERGCEYWMELVP

XP_016649695.1 PREDICTED: uncharacterized protein LOC107881115 [Prunus mume]2.7e-3029.26Show/hide
Query:  MARFWWNGDKDNRRIHWYP-------------------------------------TSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG
        MA+FWW   K+ R IHW                                        S  +R+ K RYFP  +FL+AG+G RPS IW SLIWG++LL  G
Subjt:  MARFWWNGDKDNRRIHWYP-------------------------------------TSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG

Query:  IRWRIGNGEKVRVYGSNWIPDETCLKA------DFMLNLLRDVK----------NKVDW--VKFEEL---VVVL-------------WAVWCCRNQQKFR
        +RWR+GNGE + VY   W+P     K       D  +    +VK          N  +W    F+EL   V+V+             W +W  RN   F+
Subjt:  IRWRIGNGEKVRVYGSNWIPDETCLKA------DFMLNLLRDVK----------NKVDW--VKFEEL---VVVL-------------WAVWCCRNQQKFR

Query:  GR-VPSAGLVDWAV---NNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYML
        GR + S  + + A+        +++NVD A        G GVV+R+  +  M  AA S   +G        A  +G++   ++G   ++LE D+  V   
Subjt:  GR-VPSAGLVDWAV---NNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYML

Query:  LQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLVEEDVLA
        +        +E G +V + +        F  N+ RR GN+VAHELA+F  R  G   W+E  P  +   +  D+ A
Subjt:  LQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLVEEDVLA

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.7e-2723.31Show/hide
Query:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLK------------------------
        RI  +P S LSRVLKGRYF    F++A +   PSYIWRS++WGR+LL KG+RWRIGNG+ V +YG NW+P++  LK                        
Subjt:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLK------------------------

Query:  --------------------------------------------ADFMLNLLRD-------------------------VKNKVD---------------
                                                    + + + LL +                         + NK+                
Subjt:  --------------------------------------------ADFMLNLLRD-------------------------VKNKVD---------------

Query:  ------------------------------------WVK-----------------------FEELVVVLWAVWCCRNQQKFR-----------------
                                            W+                        FEEL VV+W +W  RN + F                  
Subjt:  ------------------------------------WVK-----------------------FEELVVVLWAVWCCRNQQKFR-----------------

Query:  ----------------GRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV
                        GRV +   + W   +  ++++N DA+F      AG G+++ ++  +VM +A    +++ S+D+AE +A ++G++L  E+G+ P 
Subjt:  ----------------GRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV

Query:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWME
                        A+ DLSE G +V +A+      L   FNF +R+GN+ AH LAR AL       WME
Subjt:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWME

XP_024190496.1 uncharacterized protein LOC112194497 [Rosa chinensis]3.4e-2527.35Show/hide
Query:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIP------------------------------
        RI  +P S ++R+   RYFPS  F  AG+ + PSY WR+++  REL+  G  W+IGNGEKV+V+   WIP                              
Subjt:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIP------------------------------

Query:  DETCLKADFMLNLL-----------RDVKNKVDWV----------KFEELVVVLWAVWCCRNQQKFRGR---------VPSAGLVDWAVNNLV-------
        DE  ++  F  N             R     V+W+           FEEL+VVLW +W  RN +    +           S  L ++ ++N+        
Subjt:  DETCLKADFMLNLL-----------RDVKNKVDWV----------KFEELVVVLWAVWCCRNQQKFRGR---------VPSAGLVDWAVNNLV-------

Query:  ------------VFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDL
                    V +VN+D +F       G G ++RD    V+        H+ S + AE LA  + ++ V+E  L PVI+ETDS  V   +  ++  + 
Subjt:  ------------VFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDL

Query:  SELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFA
        S +G +  +    + +    +   T+R+ N+VAH LA  A
Subjt:  SELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFA

XP_024196130.1 uncharacterized protein LOC112199331 [Rosa chinensis]8.8e-2625.56Show/hide
Query:  ARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGI
        A FWW   ++  +IHW                                      P S ++R+ K  YFP+G++L AG+G  PS+ WRS++  R++L +G+
Subjt:  ARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGI

Query:  RWRIGNGEKVRVYGSNWIPDE---------------------TCLKADFMLNLLRDVKNKVD-----WVKFEELVVVLWAVWCCRNQQKFRGRVPSAGLV
        RW++GNGE++ ++  NWIPD                        +   + +NLL      VD     W    +  +    VW     Q   G  P    +
Subjt:  RWRIGNGEKVRVYGSNWIPDE---------------------TCLKADFMLNLLRDVKNKVD-----WVKFEELVVVLWAVWCCRNQQKFRGRVPSAGLV

Query:  DW---------------------------AVN--------NLVVF--------RVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEG
        DW                            VN         LV +        +VN D AF     + GAGVV+RD      + AA     V S   AE 
Subjt:  DW---------------------------AVN--------NLVVF--------RVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEG

Query:  LALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLV
        +AL +G+ L V +    V+ E+D   +   +  T   DLS + +L+ E R  +  H  +R N+  R+ N VAH  A  ALR    + W  + P  +  ++
Subjt:  LALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLV

Query:  EED
          D
Subjt:  EED

TrEMBL top hitse value%identityAlignment
A0A151SBS5 RNase H domain-containing protein4.0e-2427.65Show/hide
Query:  PTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVY--------GSNWIPDETCLKADFMLNLLRDVKNKVDWVKFE
        P +  +RV K RYFP GDFL A +G  PSYIWRS+   + ++ KGIRW +GNG ++ V+        G+ +I   T  +++ ++     VK+ +D+    
Subjt:  PTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVY--------GSNWIPDETCLKADFMLNLLRDVKNKVDWVKFE

Query:  ELVVVLWAVWCCRNQQKFRG----RVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVV
          +  L A++  ++ Q  +      +    +V W +N   ++  N DAA  +D         +RD   R   +    +  +     AE +A ++ M  + 
Subjt:  ELVVVLWAVWCCRNQQKFRG----RVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVV

Query:  EMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLVEEDV
              V++E D   V   L  +  + LSE G+L+ + R  + +H      F RR  N VAH LAR A R     +  + +P C+  L+ +++
Subjt:  EMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLVEEDV

A0A2N9FG05 RNase H domain-containing protein3.1e-2427.17Show/hide
Query:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLKADFMLNLLRDVKNKVDWVKFEELV
        R++  PT+ L RV K ++FP G  LDA   ++ SY W+S+I  R+ + +G+ WR+GNG  ++++   W+P+E+      +  +   ++N V ++  EE +
Subjt:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLKADFMLNLLRDVKNKVDWVKFEELV

Query:  VVLWAVWCCRNQQKFRGRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV
                C + Q   G       V W  ++   F++N D A  ++  +A  GVV+RDE    + S      +  S++  E LA    ++  +++G+   
Subjt:  VVLWAVWCCRNQQKFRGRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV

Query:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPA
         +E DS  +    +       +  G+ +   +       + +FN  +R GN +AH LAR A      E WME VP+
Subjt:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPA

A0A6J1CDQ4 uncharacterized protein LOC1110105334.4e-2333.62Show/hide
Query:  MLNLLRDVKNKVDWVKFEELVVVLWAVWCCRNQQKF-RGRVPSAGLVDW----------------------------------AVNNLV-------VFRV
        M+N+LRD ++ ++W  FEELVV LW++W  RN   F + RV    L  W                                  A N+ +       VF++
Subjt:  MLNLLRDVKNKVDWVKFEELVVVLWAVWCCRNQQKF-RGRVPSAGLVDW----------------------------------AVNNLV-------VFRV

Query:  NVDAAFCEDLFRAGAGV-VVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVP
          DA+F    F AG GV ++RD   +V+ SA    +HV S+D AE LA ++G+R+ +E G++P++LETDS+R+Y L        LS+ G ++   +  + 
Subjt:  NVDAAFCEDLFRAGAGV-VVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVP

Query:  AHLQFRFNFTRRDGNRVAHELARFALRER
          LQ  ++FT+R GN +AH LAR AL+ +
Subjt:  AHLQFRFNFTRRDGNRVAHELARFALRER

A0A6J1DAR4 uncharacterized protein LOC1110189541.3e-2723.31Show/hide
Query:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLK------------------------
        RI  +P S LSRVLKGRYF    F++A +   PSYIWRS++WGR+LL KG+RWRIGNG+ V +YG NW+P++  LK                        
Subjt:  RIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLK------------------------

Query:  --------------------------------------------ADFMLNLLRD-------------------------VKNKVD---------------
                                                    + + + LL +                         + NK+                
Subjt:  --------------------------------------------ADFMLNLLRD-------------------------VKNKVD---------------

Query:  ------------------------------------WVK-----------------------FEELVVVLWAVWCCRNQQKFR-----------------
                                            W+                        FEEL VV+W +W  RN + F                  
Subjt:  ------------------------------------WVK-----------------------FEELVVVLWAVWCCRNQQKFR-----------------

Query:  ----------------GRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV
                        GRV +   + W   +  ++++N DA+F      AG G+++ ++  +VM +A    +++ S+D+AE +A ++G++L  E+G+ P 
Subjt:  ----------------GRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPV

Query:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWME
                        A+ DLSE G +V +A+      L   FNF +R+GN+ AH LAR AL       WME
Subjt:  ILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWME

A0A6P6W2N9 uncharacterized protein LOC1137290681.8e-2425.13Show/hide
Query:  MARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG
        M +FWW      R IHW                                      P+S L +V K +YFP+ +F  AG+GSRPS+ WR +   R+ L  G
Subjt:  MARFWWNGDKDNRRIHW-------------------------------------YPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKG

Query:  IRWRIGNGE-KVRVYGSNWIPDETCL-KADFMLNLLRDVKNKVDWVKFEELVVVLWAVWCCRNQQKFRGRVPS-AGLVDWAVNNLVVFR-----------
         RWR+GNG   ++ +  +++  +  L  A  +++ L ++  K+   + E + V+LW +W  RN   F G       LV ++++ L  FR           
Subjt:  IRWRIGNGE-KVRVYGSNWIPDETCL-KADFMLNLLRDVKNKVDWVKFEELVVVLWAVWCCRNQQKFRGRVPS-AGLVDWAVNNLVVFR-----------

Query:  ---------------------VNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQD
                              N D A   +   +G GVV+RD     M   A     + + ++ E  A    + L++++ L  +ILE D ++V  +LQ 
Subjt:  ---------------------VNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGLAPVILETDSMRVYMLLQD

Query:  TAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALR-ERGCEYWMELVPACVCLLVEEDVLA
        T   D S  G+LV +  R++ +   +  ++  R  N VAH +A++A     GC  W +  P  +   +  D+++
Subjt:  TAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALR-ERGCEYWMELVPACVCLLVEEDVLA

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003105.6e-1548.1Show/hide
Query:  KDNRRIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCL
        K + RI   P + LSR+L+ RYFP    ++  VG+RPSY WRS+I GRELL +G+   IG+G   +V+   WI DET L
Subjt:  KDNRRIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-0546Show/hide
Query:  LKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRV
        +K RYF     LDA V  + SY W SL+ G  LL KG R  IG+G+ +R+
Subjt:  LKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRV

AT4G29090.1 Ribonuclease H-like superfamily protein5.0e-1140.62Show/hide
Query:  PTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWI
        P S +++V K RYF   D L+A +GSRPS++W+S+   +E+L +G R  +GNGE + ++   W+
Subjt:  PTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-1648.1Show/hide
Query:  KDNRRIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCL
        K + RI   P + LSR+L+ RYFP    ++  VG+RPSY WRS+I GRELL +G+   IG+G   +V+   WI DET L
Subjt:  KDNRRIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGATTCTGGTGGAATGGGGATAAGGACAATAGAAGAATCCATTGGTATCCTACTTCGTTTCTTTCCCGTGTGTTGAAGGGGCGGTATTTTCCTAGTGGT
GATTTCCTGGATGCAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGTCTGATTTGGGGGAGGGAGCTATTGGGAAAGGGTATTCGCTGGAGGATTGGGAAT
GGGGAGAAAGTTAGGGTGTATGGGTCTAATTGGATTCCAGATGAAACTTGCCTTAAGGCGGATTTTATGCTAAATCTGCTTAGGGATGTGAAGAATAAGGTCGAT
TGGGTTAAGTTTGAGGAGCTTGTTGTAGTGTTGTGGGCTGTGTGGTGTTGCCGAAACCAACAGAAGTTTAGAGGGCGGGTTCCTTCAGCAGGGCTTGTGGATTGG
GCTGTGAATAATCTTGTTGTTTTTCGAGTGAATGTGGATGCTGCTTTTTGTGAGGACCTGTTTCGGGCGGGTGCTGGAGTTGTTGTTCGGGATGAAGCGAGGCGA
GTCATGTTGTCGGCTGCTGTTAGTCATGATCATGTGGGGAGTTTGGATTTGGCAGAGGGGCTGGCGTTGATGGACGGAATGAGACTTGTGGTGGAGATGGGTTTA
GCTCCGGTAATCCTTGAGACTGATTCTATGCGGGTATATATGCTGCTGCAAGATACTGCGATGGTGGATTTATCTGAGCTCGGTGTGCTGGTTTCTGAGGCTCGA
AGGGAGGTGCCTGCACATCTTCAGTTCAGATTCAATTTTACAAGAAGGGATGGAAATCGTGTCGCCCATGAGTTAGCACGTTTTGCTTTGAGAGAAAGAGGCTGT
GAGTATTGGATGGAATTGGTACCTGCGTGTGTCTGTCTTTTGGTTGAGGAAGATGTTTTAGCTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGATTCTGGTGGAATGGGGATAAGGACAATAGAAGAATCCATTGGTATCCTACTTCGTTTCTTTCCCGTGTGTTGAAGGGGCGGTATTTTCCTAGTGGT
GATTTCCTGGATGCAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGTCTGATTTGGGGGAGGGAGCTATTGGGAAAGGGTATTCGCTGGAGGATTGGGAAT
GGGGAGAAAGTTAGGGTGTATGGGTCTAATTGGATTCCAGATGAAACTTGCCTTAAGGCGGATTTTATGCTAAATCTGCTTAGGGATGTGAAGAATAAGGTCGAT
TGGGTTAAGTTTGAGGAGCTTGTTGTAGTGTTGTGGGCTGTGTGGTGTTGCCGAAACCAACAGAAGTTTAGAGGGCGGGTTCCTTCAGCAGGGCTTGTGGATTGG
GCTGTGAATAATCTTGTTGTTTTTCGAGTGAATGTGGATGCTGCTTTTTGTGAGGACCTGTTTCGGGCGGGTGCTGGAGTTGTTGTTCGGGATGAAGCGAGGCGA
GTCATGTTGTCGGCTGCTGTTAGTCATGATCATGTGGGGAGTTTGGATTTGGCAGAGGGGCTGGCGTTGATGGACGGAATGAGACTTGTGGTGGAGATGGGTTTA
GCTCCGGTAATCCTTGAGACTGATTCTATGCGGGTATATATGCTGCTGCAAGATACTGCGATGGTGGATTTATCTGAGCTCGGTGTGCTGGTTTCTGAGGCTCGA
AGGGAGGTGCCTGCACATCTTCAGTTCAGATTCAATTTTACAAGAAGGGATGGAAATCGTGTCGCCCATGAGTTAGCACGTTTTGCTTTGAGAGAAAGAGGCTGT
GAGTATTGGATGGAATTGGTACCTGCGTGTGTCTGTCTTTTGGTTGAGGAAGATGTTTTAGCTCTGTAA
Protein sequenceShow/hide protein sequence
MARFWWNGDKDNRRIHWYPTSFLSRVLKGRYFPSGDFLDAGVGSRPSYIWRSLIWGRELLGKGIRWRIGNGEKVRVYGSNWIPDETCLKADFMLNLLRDVKNKVD
WVKFEELVVVLWAVWCCRNQQKFRGRVPSAGLVDWAVNNLVVFRVNVDAAFCEDLFRAGAGVVVRDEARRVMLSAAVSHDHVGSLDLAEGLALMDGMRLVVEMGL
APVILETDSMRVYMLLQDTAMVDLSELGVLVSEARREVPAHLQFRFNFTRRDGNRVAHELARFALRERGCEYWMELVPACVCLLVEEDVLAL