; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr7:5292866..5295223
RNA-Seq ExpressionLag0021174
SyntenyLag0021174
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC73134.1 hypothetical protein OsI_07152 [Oryza sativa Indica Group]5.4e-1526.09Show/hide
Query:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLV-WHYDKLGR-----------------------
        ++ DPWI + ++ +P+ + RN     V D + P G WD + +    L  DVE I KI IS+  E+  V W  D+LGR                       
Subjt:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLV-WHYDKLGR-----------------------

Query:  -YTVKSGW-----FDISSKV------------------SKEDLGLVAVTCWAIWMDKNKFVHE---DPIP---PANIRSQW--------ILEYLRNSEGA
         Y +K GW      ++  KV                   K + G+V      +W+      ++    P P     NI   +        I   LR S G 
Subjt:  -YTVKSGW-----FDISSKV------------------SKEDLGLVAVTCWAIWMDKNKFVHE---DPIP---PANIRSQW--------ILEYLRNSEGA

Query:  IIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL
        +I ++  F      A  +EL A   G+  A       I +ESDC   I  L    ++ S    +V EI  +    KEV F  V R  NRV+ +LA + + 
Subjt:  IIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL

Query:  SKCSDFWDNNFPDWLSLLVEND
            +FW ++  + +S LV  D
Subjt:  SKCSDFWDNNFPDWLSLLVEND

PAN10731.2 hypothetical protein PAHAL_2G114100 [Panicum hallii]8.4e-1625.96Show/hide
Query:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLVA
        ++ DPWI +E + +P     N L  +V   I P +G+WD   + Q     D   I  IP+    ED   WHYD  G ++V+S  +    ++ +     V 
Subjt:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLVA

Query:  VTCWAIWMDKNKF---------VH-----------------EDPIPPANIRSQWILE------------------------YLRNSEGAIIEASSSFSPS
           W  W+++N+          VH                 + P+  A    +W+                           +R+ +G +I+A +   P 
Subjt:  VTCWAIWMDKNKF---------VH-----------------EDPIPPANIRSQWILE------------------------YLRNSEGAIIEASSSFSPS

Query:  VPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMAD-ELKEVKFQFVPRKGNRVADYLA
        + EA  AEL A L G+R A  LG +K+IIE+D  +A   L  +S   ++  G+V EI ++ +     V   F PR+ N+VA  +A
Subjt:  VPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMAD-ELKEVKFQFVPRKGNRVADYLA

XP_010678308.1 PREDICTED: uncharacterized protein LOC104893877 [Beta vulgaris subsp. vulgaris]4.6e-1426.64Show/hide
Query:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLV
        +++DPW+L +          + +N+ V D I   +  W++  +D      D   I  +P+S    +D+L W + K G YTVK+                 
Subjt:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLV

Query:  AVTCWAIWMDKNKFV-HEDPIPPANIRSQWILE-------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLT
           CW       K+V   + +   N+ +   +E         RN  G I+ ++S  S +      AE KA+ MG+R  +  G   +IIESDCQ+ +N L+
Subjt:  AVTCWAIWMDKNKFV-HEDPIPPANIRSQWILE-------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLT

Query:  KSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVENDRSSL
        KS    S  + ++ +I + +     V++  V R GN VA +LAK          W+N+ P  ++  V  D  SL
Subjt:  KSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVENDRSSL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.4e-1820.57Show/hide
Query:  FEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSG------------------
        F DPW+ +   FKP+  +   L+  V  FIT  G+WD++ +  +  + D + I  +PIS+++ +D  +WHYDK G Y+V+SG                  
Subjt:  FEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSG------------------

Query:  -------------------------------------------------------------------------------------------WFDISSKVS
                                                                                                   W  ++ ++ 
Subjt:  -------------------------------------------------------------------------------------------WFDISSKVS

Query:  KEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYL-----------------------------------RNSEGAIIEASSSFSPSVPEAPC-
         +DL L A+T W IW D+N  +H   + P   + +W+  +L                                    N++ A   AS+SF   + ++ C 
Subjt:  KEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYL-----------------------------------RNSEGAIIEASSSFSPSVPEAPC-

Query:  -----------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL
                         AE++ IL G++ A A     + +ESD  +AI  +          +  V EI A+      + F    R+ NR A  LAK    
Subjt:  -----------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL

Query:  SKCSDF-WDNNFPDWLSLLVEND
        S  + + W  NFP WL  LV+ D
Subjt:  SKCSDF-WDNNFPDWLSLLVEND

XP_042969199.1 uncharacterized protein LOC122301911 [Carya illinoinensis]3.5e-1423.21Show/hide
Query:  MFEDPWILKEINFKPVC-IDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDIS--------SKVS
        +F+DPW+   ++   +  +D N     ++D  T  G W++  +      + ++ I K+ IS +SED L W ++K G ++VKS +  +            S
Subjt:  MFEDPWILKEINFKPVC-IDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDIS--------SKVS

Query:  KEDLGLVAVTCWAIWMDKNKFVH-----EDPIPP-ANIRSQWILE--------------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQAL
             +   + W + + K   +      ++ +P   N++ +++L+               LRN  G ++ A S     V  A   E  A+L G++     
Subjt:  KEDLGLVAVTCWAIWMDKNKFVH-----EDPIPP-ANIRSQWILE--------------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQAL

Query:  GCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVENDRSSL
        G  K+++++DC I +N L ++SE  +    ++++I  +    +EVK   V R GN VA  LA+   L      W +  P ++S  +  D+ ++
Subjt:  GCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVENDRSSL

TrEMBL top hitse value%identityAlignment
A0A2S3GXJ0 RNase H domain-containing protein4.1e-1625.96Show/hide
Query:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLVA
        ++ DPWI +E + +P     N L  +V   I P +G+WD   + Q     D   I  IP+    ED   WHYD  G ++V+S  +    ++ +     V 
Subjt:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITP-SGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLVA

Query:  VTCWAIWMDKNKF---------VH-----------------EDPIPPANIRSQWILE------------------------YLRNSEGAIIEASSSFSPS
           W  W+++N+          VH                 + P+  A    +W+                           +R+ +G +I+A +   P 
Subjt:  VTCWAIWMDKNKF---------VH-----------------EDPIPPANIRSQWILE------------------------YLRNSEGAIIEASSSFSPS

Query:  VPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMAD-ELKEVKFQFVPRKGNRVADYLA
        + EA  AEL A L G+R A  LG +K+IIE+D  +A   L  +S   ++  G+V EI ++ +     V   F PR+ N+VA  +A
Subjt:  VPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMAD-ELKEVKFQFVPRKGNRVADYLA

A0A6J1DX30 uncharacterized protein LOC1110248746.7e-1920.57Show/hide
Query:  FEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSG------------------
        F DPW+ +   FKP+  +   L+  V  FIT  G+WD++ +  +  + D + I  +PIS+++ +D  +WHYDK G Y+V+SG                  
Subjt:  FEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSG------------------

Query:  -------------------------------------------------------------------------------------------WFDISSKVS
                                                                                                   W  ++ ++ 
Subjt:  -------------------------------------------------------------------------------------------WFDISSKVS

Query:  KEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYL-----------------------------------RNSEGAIIEASSSFSPSVPEAPC-
         +DL L A+T W IW D+N  +H   + P   + +W+  +L                                    N++ A   AS+SF   + ++ C 
Subjt:  KEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYL-----------------------------------RNSEGAIIEASSSFSPSVPEAPC-

Query:  -----------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL
                         AE++ IL G++ A A     + +ESD  +AI  +          +  V EI A+      + F    R+ NR A  LAK    
Subjt:  -----------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL

Query:  SKCSDF-WDNNFPDWLSLLVEND
        S  + + W  NFP WL  LV+ D
Subjt:  SKCSDF-WDNNFPDWLSLLVEND

A0A6J1DZK3 uncharacterized protein LOC1110249683.8e-1426.67Show/hide
Query:  VSKEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYLR------------------------------NSEGAIIEASSSFSPSVPEAPC----
        +S  +L L  +TCWA+W D++  +++  IP A I+ +WIL+Y                                N++ A+ E  S     + E       
Subjt:  VSKEDLGLVAVTCWAIWMDKNKFVHEDPIPPANIRSQWILEYLR------------------------------NSEGAIIEASSSFSPSVPEAPC----

Query:  -------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCS
                     A++ AI  G+  A  LG +++++E+D   A+N +   S     A   VE+I A A + +E+ FQ V R+ N VA +L +     +C 
Subjt:  -------------AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLSKCS

Query:  DFWDNNFPDWLSLLVE-NDRSSLAL
          W  +FP WL  L E   ++S+AL
Subjt:  DFWDNNFPDWLSLLVE-NDRSSLAL

A0A803NTU1 Uncharacterized protein1.7e-1425.22Show/hide
Query:  EDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSGWFDISSKVSKEDL----GL
        EDPWI + +N K +        M+V+D     GDWD   +     S DV  I  +  SN   +DK++WHY K G Y++KSG+   S+ + K +     G+
Subjt:  EDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHS-EDKLVWHYDKLGRYTVKSGWFDISSKVSKEDL----GL

Query:  VAVTC-----------------------------------WAIW-----MDKNKFVHEDPIPPA------------NIRSQWILE---------------
         + T                                    W I+      D  +FV      PA            +++ Q+ ++               
Subjt:  VAVTC-----------------------------------WAIW-----MDKNKFVHEDPIPPA------------NIRSQWILE---------------

Query:  ---------------------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAM
                              +R+  G ++ +SS FS     A  AE  AI+ G++ A A G  K  + SDC  AIN +   S   S  + L+EEI  +
Subjt:  ---------------------YLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAM

Query:  ADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFP
        +     V+F F  R  N +A  LAK A  SK S  W+   P
Subjt:  ADELKEVKFQFVPRKGNRVADYLAKRAKLSKCSDFWDNNFP

B8AHI8 Uncharacterized protein2.6e-1526.09Show/hide
Query:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLV-WHYDKLGR-----------------------
        ++ DPWI + ++ +P+ + RN     V D + P G WD + +    L  DVE I KI IS+  E+  V W  D+LGR                       
Subjt:  MFEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLV-WHYDKLGR-----------------------

Query:  -YTVKSGW-----FDISSKV------------------SKEDLGLVAVTCWAIWMDKNKFVHE---DPIP---PANIRSQW--------ILEYLRNSEGA
         Y +K GW      ++  KV                   K + G+V      +W+      ++    P P     NI   +        I   LR S G 
Subjt:  -YTVKSGW-----FDISSKV------------------SKEDLGLVAVTCWAIWMDKNKFVHE---DPIP---PANIRSQW--------ILEYLRNSEGA

Query:  IIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL
        +I ++  F      A  +EL A   G+  A       I +ESDC   I  L    ++ S    +V EI  +    KEV F  V R  NRV+ +LA + + 
Subjt:  IIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKL

Query:  SKCSDFWDNNFPDWLSLLVEND
            +FW ++  + +S LV  D
Subjt:  SKCSDFWDNNFPDWLSLLVEND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0730.88Show/hide
Query:  SQWILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRK
        ++WI   +RN +G      S    +      AE KA+L  +++    G  ++I+E DC+   N ++ SS    +A  L+++I   A +   V+F FV R 
Subjt:  SQWILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRK

Query:  GNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVEND
        GN+VA  LAK    S C        P WL     ND
Subjt:  GNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVEND

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1032.73Show/hide
Query:  WILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGN
        WI   LRN  G ++   +   P       AEL+A+   +         +II ESD Q  +N L  S + W   +  +E+I  +    +EVKF+F PR GN
Subjt:  WILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGN

Query:  RVADYLAKRA
        +VAD +A+ +
Subjt:  RVADYLAKRA

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-0730.6Show/hide
Query:  WILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGN
        WI   +RN  G  I   S           AE KA+L  +++    G  ++ +E DCQ  IN +   S   S+A  L E+I   A++   ++F F+ RKGN
Subjt:  WILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGN

Query:  RVADYLAKRAKLSKCSDFWDNNFPDWLSLLVEND
        ++A  LAK             + P WL     ND
Subjt:  RVADYLAKRAKLSKCSDFWDNNFPDWLSLLVEND

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.8e-0635.71Show/hide
Query:  AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLS
        AE  A+ + ++ AQ++G  K+ + SD Q  I  +T  S       G++ +I  ++    +V F FVPR  NRVAD LAK + +S
Subjt:  AELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADYLAKRAKLS

AT4G29090.1 Ribonuclease H-like superfamily protein3.1e-0828.57Show/hide
Query:  LRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADY
        LRN +G +    +   P +     AEL+A+   +        N +I ESD Q+ I  L  + E+W   +  ++++  +  +  EVKF F+PR+GN +A+ 
Subjt:  LRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQFVPRKGNRVADY

Query:  LAKRA
        +A+ +
Subjt:  LAKRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGAGGACCCCTGGATTCTTAAGGAAATCAATTTTAAACCTGTCTGTATTGATAGAAATTTTTTAAACATGAGGGTTGTGGACTTCATTACGCCTAGTGGGGATTG
GGATATTTCAAAACTAGATCAAGCTGTTCTTAGCAGTGATGTGGAGGCTATTAGAAAGATTCCCATTAGTAATCATTCAGAAGATAAATTGGTGTGGCATTATGATAAAC
TAGGGAGATACACAGTTAAGAGCGGGTGGTTTGATATCAGCTCCAAAGTTTCGAAAGAGGATCTAGGGCTTGTTGCGGTCACCTGTTGGGCTATTTGGATGGACAAAAAC
AAATTTGTTCATGAAGATCCTATTCCTCCAGCTAACATAAGAAGTCAATGGATTCTAGAGTACCTGAGAAACTCTGAAGGTGCCATTATTGAAGCCTCAAGCTCGTTTTC
TCCATCGGTTCCAGAAGCTCCCTGTGCTGAATTAAAAGCTATTTTGATGGGTATTAGAAGAGCTCAAGCTTTGGGATGTAATAAAATCATTATTGAATCCGACTGTCAAA
TCGCAATTAATTTCTTGACTAAATCATCTGAAGTCTGGAGCATTGCAGAAGGCCTTGTAGAAGAAATTTGGGCCATGGCTGATGAGCTCAAGGAAGTTAAATTTCAATTT
GTTCCAAGGAAAGGGAATAGAGTAGCTGATTATCTAGCTAAAAGAGCTAAATTGTCCAAATGTAGTGATTTCTGGGATAATAATTTCCCAGATTGGTTGTCTTTGTTGGT
CGAGAATGACCGTTCATCTTTAGCCCTCTCGCCGGCCAGCCGTCCTTCTCGTGTCCGGTCGCTCTCTGACAAGCTGCATCAGTTCAGACGAGCGGTGGCAGTACGCGTTT
CTTCGATCTTTCTGATCTCTCTCACACGCGCGACATTTTTCATTTCACGAACAGATTTGAAGTTTCATCGTTGCTCTCTTCGAAACTCGCTGGAATCACCGTTGCTCGTG
GATTCTCGTCCGGAACTCGCTGGAATCGCAGATTTGGTCCGCCGCTCGAGGGTCGATTTCGCCGAAACTAAAGGTCTGAAACTCGATAGAATCGCCGCCGCTCGTGGATG
CTCGTCCAGAACTCGCTGGAATCGCAGATCTGGTTCGCCGCTCGCGGTTCGATTTCAAACCCACGTTGCCGCAGGTCGTTCGATTTCAAACCCACGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGAGGACCCCTGGATTCTTAAGGAAATCAATTTTAAACCTGTCTGTATTGATAGAAATTTTTTAAACATGAGGGTTGTGGACTTCATTACGCCTAGTGGGGATTG
GGATATTTCAAAACTAGATCAAGCTGTTCTTAGCAGTGATGTGGAGGCTATTAGAAAGATTCCCATTAGTAATCATTCAGAAGATAAATTGGTGTGGCATTATGATAAAC
TAGGGAGATACACAGTTAAGAGCGGGTGGTTTGATATCAGCTCCAAAGTTTCGAAAGAGGATCTAGGGCTTGTTGCGGTCACCTGTTGGGCTATTTGGATGGACAAAAAC
AAATTTGTTCATGAAGATCCTATTCCTCCAGCTAACATAAGAAGTCAATGGATTCTAGAGTACCTGAGAAACTCTGAAGGTGCCATTATTGAAGCCTCAAGCTCGTTTTC
TCCATCGGTTCCAGAAGCTCCCTGTGCTGAATTAAAAGCTATTTTGATGGGTATTAGAAGAGCTCAAGCTTTGGGATGTAATAAAATCATTATTGAATCCGACTGTCAAA
TCGCAATTAATTTCTTGACTAAATCATCTGAAGTCTGGAGCATTGCAGAAGGCCTTGTAGAAGAAATTTGGGCCATGGCTGATGAGCTCAAGGAAGTTAAATTTCAATTT
GTTCCAAGGAAAGGGAATAGAGTAGCTGATTATCTAGCTAAAAGAGCTAAATTGTCCAAATGTAGTGATTTCTGGGATAATAATTTCCCAGATTGGTTGTCTTTGTTGGT
CGAGAATGACCGTTCATCTTTAGCCCTCTCGCCGGCCAGCCGTCCTTCTCGTGTCCGGTCGCTCTCTGACAAGCTGCATCAGTTCAGACGAGCGGTGGCAGTACGCGTTT
CTTCGATCTTTCTGATCTCTCTCACACGCGCGACATTTTTCATTTCACGAACAGATTTGAAGTTTCATCGTTGCTCTCTTCGAAACTCGCTGGAATCACCGTTGCTCGTG
GATTCTCGTCCGGAACTCGCTGGAATCGCAGATTTGGTCCGCCGCTCGAGGGTCGATTTCGCCGAAACTAAAGGTCTGAAACTCGATAGAATCGCCGCCGCTCGTGGATG
CTCGTCCAGAACTCGCTGGAATCGCAGATCTGGTTCGCCGCTCGCGGTTCGATTTCAAACCCACGTTGCCGCAGGTCGTTCGATTTCAAACCCACGATGA
Protein sequenceShow/hide protein sequence
MFEDPWILKEINFKPVCIDRNFLNMRVVDFITPSGDWDISKLDQAVLSSDVEAIRKIPISNHSEDKLVWHYDKLGRYTVKSGWFDISSKVSKEDLGLVAVTCWAIWMDKN
KFVHEDPIPPANIRSQWILEYLRNSEGAIIEASSSFSPSVPEAPCAELKAILMGIRRAQALGCNKIIIESDCQIAINFLTKSSEVWSIAEGLVEEIWAMADELKEVKFQF
VPRKGNRVADYLAKRAKLSKCSDFWDNNFPDWLSLLVENDRSSLALSPASRPSRVRSLSDKLHQFRRAVAVRVSSIFLISLTRATFFISRTDLKFHRCSLRNSLESPLLV
DSRPELAGIADLVRRSRVDFAETKGLKLDRIAAARGCSSRTRWNRRSGSPLAVRFQTHVAAGRSISNPR