; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019890 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019890
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr5:46448461..46454759
RNA-Seq ExpressionLag0019890
SyntenyLag0019890
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]3.3e-6833.14Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDLE+ NQALLAKQCWRIL+ P+SL+AR+   RY PS  F+EA VG  PSFIWRSL WG+ELL KG+RW+VG+G  I++Y   WLP  +  +I S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVG--QMCLLAQVPSSSSGDSMCGWWTGCWRMNL
        P L +  RV DL T++ QWNV LL+  F   EV  IL IP+      D  +W YE++G YSVKSGYR+   +   ++  PS+   D    +W   W + +
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVG--QMCLLAQVPSSSSGDSMCGWWTGCWRMNL

Query:  P--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI
        P                    +F +++ PT           SVLH  WLC+  + V  ++ +G + +    +    L   ++ +   E       L WG+
Subjt:  P--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI

Query:  WNCRNKVRFHGAGPA--------TELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAAT
        WN RN   F G            T+L    +   +    + G++        +  L  W  PPA           K      G G+V+RN  GE MAA  
Subjt:  WNCRNKVRFHGAGPA--------TELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAAT

Query:  -RVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETD---------SKRVYEILQGE-RDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA
         R+H   G +   E  A ++GL+ A+D+G    ++E D         S   Y  + G   +E++ L N     +  W TP         R GN +AH LA
Subjt:  -RVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETD---------SKRVYEILQGE-RDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA

Query:  SFALVEGQNLVWVEDVPECVRDMMLADI
         FA    + + W+E+ P  +  ++ AD+
Subjt:  SFALVEGQNLVWVEDVPECVRDMMLADI

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.1e-7434.23Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDLE+ NQALLAKQCWRIL+ P+SL+AR+   RY PS  F+EA VG  PSFIWRSL WG+ELL KG+RW+VGNG  I++Y   WLP  +F +I S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSG---DSMCGWWTGCWRMN
        P L +   V DL T++ QWNV LL+  F   EV   L IP+      D  +W YE++G YSVKSGYR+   CL     S       D    +W   W + 
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSG---DSMCGWWTGCWRMN

Query:  LP--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWG
        +P                    +F +++ PT           SVLH  WLC+  + V  ++ +G + +    +    L   ++ +   E       L WG
Subjt:  LP--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWG

Query:  IWNCRNKVRFHG-AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRA--RRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHE
        +WN RN   F G +  AT+L          F           G ++  +  L  W  PPA  +K+NVD   K      G G+V+RN  GE MAA  R  +
Subjt:  IWNCRNKVRFHG-AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRA--RRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHE

Query:  YVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKR-VYEILQGER---------DELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALV
            +   E  A ++GL+ A+D+G    ++E D++  +  IL  E          +E++ L +     +  W TP         R GN +AH LA FA  
Subjt:  YVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKR-VYEILQGER---------DELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALV

Query:  EGQNLVWVEDVPECVRDMMLADI
          + + W+E+ P  +  ++ AD+
Subjt:  EGQNLVWVEDVPECVRDMMLADI

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]7.4e-9236.69Show/hide
Query:  CQNVLGPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFM
        C+  +G   RDLE  N+ALLAKQCWRIL +P+S+L+RVL+GRYF   SFMEA +   PS+IWRS++WGR+LL KG+RW++GNG+ + IY  NW+P    +
Subjt:  CQNVLGPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFM

Query:  RICSLPSLGVGARVADLITATR-QWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQM---CLLAQVPSSSSGDSMCGWWT
        +I S P L + +RV+ L+      W  D+++  F+P E   ILSIPI R   +D  +W YEK+G YSV+SGY+V  +   C+  Q PSSSS + +  WW 
Subjt:  RICSLPSLGVGARVADLITATR-QWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQM---CLLAQVPSSSSGDSMCGWWT

Query:  GCWRMNLP---------MFMQRVPTSV-----------------------LHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDL
        G W+M++P         + + R+PT                         +H+FW+CK+   +  ++ FG L         FL+LR+  ++L    FE+L
Subjt:  GCWRMNLP---------MFMQRVPTSV-----------------------LHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDL

Query:  VVLLWGIWNCRNKVRFHGAGPAT-----ELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVM
         V++WG+WN RN   F+ +         EL  WA  Y   FR  K   +   G       + W  P    +K+N DA+F    + AG G++I N  G+VM
Subjt:  VVLLWGIWNCRNKVRFHGAGPAT-----ELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVM

Query:  AAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEG
        AAAT+  E +   D+AE  AAV+GL+LA ++G+ P L                ++LSE   ++  A   W       F+F  REGN  AH LA  AL+  
Subjt:  AAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEG

Query:  QNLVWVEDVP---------ECVRDMM
        +  +W+ED P         EC+ +++
Subjt:  QNLVWVEDVP---------ECVRDMM

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.6e-7534.06Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDL S NQAL+AKQ WRI+Q P SL+ARVL+ RYF  + FM A +G +PSF+WRS++WGR++L KG RW++GNG+ + +Y +NW+P  T  +  S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLP-
        PS+G    VA+LI   +QW  DL+ QHF P +   I+ IP+ +   +D  +W Y+K G YSVKSGY+V       + PS S+ D     W   W++ +P 
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLP-

Query:  ---MFMQR-----VPT-----------------------SVLHVFWLCKYTRNVLSDAGFG-FLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGIW
           +F+ R     +PT                       +V H    C   R +   +     L      DI+++L    R    +E  E +  LLW IW
Subjt:  ---MFMQR-----VPT-----------------------SVLHVFWLCKYTRNVLSDAGFG-FLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGIW

Query:  NCRNKVRFHGAGP-ATELPTWAAGYVSSFRCVKGQE-VMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVG
          RNK  F G       +   A   V SF+ ++  E V +  G A R+  +WS PP  W K+NVDA    E + AG G+V+R+  G   AAA +     G
Subjt:  NCRNKVRFHGAGP-ATELPTWAAGYVSSFRCVKGQE-VMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVG

Query:  DSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVPE
           +AE  A   GLK+A    +   + E+DS  V +++  +   L+E+  L++D   +       +   S R+ N  AH LA  AL + + ++W++++P 
Subjt:  DSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVPE

Query:  CVRDMMLA
         +  + L+
Subjt:  CVRDMMLA

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]8.2e-6732.35Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLP-CDTFMRICS
        G   R+    NQAL+AKQ WR+LQ P+SL++RVL+ RYF +SSF+ A  G   S+IWRS++WGR+++ KGMRW++GNG++I I+  NWLP  +TF  I  
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLP-CDTFMRICS

Query:  LPSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSS-GDSMCGWWTGCWRMNL
        L SL V + VADLI A  QW+   L+QHF   + + IL IP+     +D  +W Y+K G YSVKSGY   Q+ L ++ P S+S  ++   +W+  W + L
Subjt:  LPSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSS-GDSMCGWWTGCWRMNL

Query:  P----MFMQRVPTSVL----------------------------HVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI
        P    +FM R   ++L                            H    CK  R +   + F       ++  +F  L+++   L     E +V L W  
Subjt:  P----MFMQRVPTSVL----------------------------HVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI

Query:  WNCRNKVRFHG--AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYV
        W  RNK  F G    P       A   +++F+ V+  +        + +  +W  PP + FK+NVDA F  +   AG G VIR+  G+++AA    +   
Subjt:  WNCRNKVRFHG--AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYV

Query:  GDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVP
        G + LAE  A + GL+LA +  +  ++IE+D   V +++   +   SE+   +            +  +   R  N  AH LA  AL +    +W+ ++P
Subjt:  GDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVP

Query:  ECVRDMM
          + D +
Subjt:  ECVRDMM

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein7.7e-7131.53Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RD+ S NQAL+AKQ WRI++ P+SL+A++L+ +YF  + F++A +G +PSF+WRS+IWGR+++  GMRW++G G+R+KIY+S+W+P     +  S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLP-
        P+LG+   VA+LI   ++W   L+QQHF+  +  LI  I +      D  +W Y+K G YSVKSGY++         PSSSS  S  G W   W + LP 
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLP-

Query:  ---MFMQR-----VPTS-----------------------VLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGIWN
           +FM +     +PTS                       + H   +CK  + +     F      L    L  +L+++ +    +    ++ L W  W+
Subjt:  ---MFMQR-----VPTS-----------------------VLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGIWN

Query:  CRNKVRFHGAGPATELP-TWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDS
         RN   F       ++    A   V S+  V+  +V         +  KW  PP   FK NVDA   KE  + G G+VIR+ +G ++ AA    +Y GD 
Subjt:  CRNKVRFHGAGPATELP-TWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDS

Query:  DLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPT-PWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVP
          AE  A   GL++ ++  L P+++ETD + V + +   +   +E+   +++      +    +      R  N +AH LA  ALV  ++ VW  ++P
Subjt:  DLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPT-PWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVP

A0A5E4FZN9 PREDICTED: retrotransposon5.2e-7534.23Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDLE+ NQALLAKQCWRIL+ P+SL+AR+   RY PS  F+EA VG  PSFIWRSL WG+ELL KG+RW+VGNG  I++Y   WLP  +F +I S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSG---DSMCGWWTGCWRMN
        P L +   V DL T++ QWNV LL+  F   EV   L IP+      D  +W YE++G YSVKSGYR+   CL     S       D    +W   W + 
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSG---DSMCGWWTGCWRMN

Query:  LP--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWG
        +P                    +F +++ PT           SVLH  WLC+  + V  ++ +G + +    +    L   ++ +   E       L WG
Subjt:  LP--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWG

Query:  IWNCRNKVRFHG-AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRA--RRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHE
        +WN RN   F G +  AT+L          F           G ++  +  L  W  PPA  +K+NVD   K      G G+V+RN  GE MAA  R  +
Subjt:  IWNCRNKVRFHG-AGPATELPTWAAGYVSSFRCVKGQEVMRDGGRA--RRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHE

Query:  YVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKR-VYEILQGER---------DELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALV
            +   E  A ++GL+ A+D+G    ++E D++  +  IL  E          +E++ L +     +  W TP         R GN +AH LA FA  
Subjt:  YVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKR-VYEILQGER---------DELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALV

Query:  EGQNLVWVEDVPECVRDMMLADI
          + + W+E+ P  +  ++ AD+
Subjt:  EGQNLVWVEDVPECVRDMMLADI

A0A6J1DAR4 uncharacterized protein LOC1110189543.6e-9236.69Show/hide
Query:  CQNVLGPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFM
        C+  +G   RDLE  N+ALLAKQCWRIL +P+S+L+RVL+GRYF   SFMEA +   PS+IWRS++WGR+LL KG+RW++GNG+ + IY  NW+P    +
Subjt:  CQNVLGPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFM

Query:  RICSLPSLGVGARVADLITATR-QWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQM---CLLAQVPSSSSGDSMCGWWT
        +I S P L + +RV+ L+      W  D+++  F+P E   ILSIPI R   +D  +W YEK+G YSV+SGY+V  +   C+  Q PSSSS + +  WW 
Subjt:  RICSLPSLGVGARVADLITATR-QWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQM---CLLAQVPSSSSGDSMCGWWT

Query:  GCWRMNLP---------MFMQRVPTSV-----------------------LHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDL
        G W+M++P         + + R+PT                         +H+FW+CK+   +  ++ FG L         FL+LR+  ++L    FE+L
Subjt:  GCWRMNLP---------MFMQRVPTSV-----------------------LHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDL

Query:  VVLLWGIWNCRNKVRFHGAGPAT-----ELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVM
         V++WG+WN RN   F+ +         EL  WA  Y   FR  K   +   G       + W  P    +K+N DA+F    + AG G++I N  G+VM
Subjt:  VVLLWGIWNCRNKVRFHGAGPAT-----ELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVM

Query:  AAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEG
        AAAT+  E +   D+AE  AAV+GL+LA ++G+ P L                ++LSE   ++  A   W       F+F  REGN  AH LA  AL+  
Subjt:  AAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEG

Query:  QNLVWVEDVP---------ECVRDMM
        +  +W+ED P         EC+ +++
Subjt:  QNLVWVEDVP---------ECVRDMM

A0A803QQT2 Uncharacterized protein2.7e-7132.61Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDL   NQALLAKQ WR L++P  L +RVL+  YFP    +EA  G   SF+WRSL+WG++L+ KG RW+VGNGE +++ E  WLP     ++   
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLPM
        PSL     V DL  A  QW+   ++  F+PT+V LIL IP   +  +D  +W Y K G YSVKSGYR+       Q    S+  S+  WW   WR+ +P 
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLPM

Query:  FMQ---------------------------------RVPTSVLHVFWLCKYTRNVLSDAG-FGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI
         ++                                  V  SV H  W CK ++     +G +  L   L  D L +L+R +      E+ E  +++ W I
Subjt:  FMQ---------------------------------RVPTSVLHVFWLCKYTRNVLSDAG-FGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI

Query:  WNCRNKVRFHGAGP-ATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVG
        WN RN V   G  P   E+  W   +++ FR   G    R+  +   E  +W  P  D   +NVDA  K+    +G G V+R+ AG V++AA  V +   
Subjt:  WNCRNKVRFHGAGP-ATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVG

Query:  DSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVPE
             E  A   G+++ +   L    +ETD  +   ++Q + +   ++  LL    A       +  SF +RE N +AH LA++ALV   + +W+  +P 
Subjt:  DSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVPE

Query:  CVRDMMLAD
        C R  +L D
Subjt:  CVRDMMLAD

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.6e-6833.14Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL
        G   RDLE+ NQALLAKQCWRIL+ P+SL+AR+   RY PS  F+EA VG  PSFIWRSL WG+ELL KG+RW+VG+G  I++Y   WLP  +  +I S 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSL

Query:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVG--QMCLLAQVPSSSSGDSMCGWWTGCWRMNL
        P L +  RV DL T++ QWNV LL+  F   EV  IL IP+      D  +W YE++G YSVKSGYR+   +   ++  PS+   D    +W   W + +
Subjt:  PSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVG--QMCLLAQVPSSSSGDSMCGWWTGCWRMNL

Query:  P--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI
        P                    +F +++ PT           SVLH  WLC+  + V  ++ +G + +    +    L   ++ +   E       L WG+
Subjt:  P--------------------MFMQRV-PT-----------SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGI

Query:  WNCRNKVRFHGAGPA--------TELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAAT
        WN RN   F G            T+L    +   +    + G++        +  L  W  PPA           K      G G+V+RN  GE MAA  
Subjt:  WNCRNKVRFHGAGPA--------TELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAAT

Query:  -RVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETD---------SKRVYEILQGE-RDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA
         R+H   G +   E  A ++GL+ A+D+G    ++E D         S   Y  + G   +E++ L N     +  W TP         R GN +AH LA
Subjt:  -RVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETD---------SKRVYEILQGE-RDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA

Query:  SFALVEGQNLVWVEDVPECVRDMMLADI
         FA    + + W+E+ P  +  ++ AD+
Subjt:  SFALVEGQNLVWVEDVPECVRDMMLADI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.9e-1921.95Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPS---SSFMEAPVGYRPSFIWRSLIWG-RELLGKGMRWQVGNGERIKIYESNWLPCDTFMR
        G  +R  +S N+AL++K  WR+LQ  +SL   VL+ +Y       S    P G   S  WRS+  G R+++  G+ W  G+G++I+ +   W+     + 
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPS---SSFMEAPVGYRPSFIWRSLIWG-RELLGKGMRWQVGNGERIKIYESNWLPCDTFMR

Query:  ICS--LPSLGVGARVADLITATRQWNVDLLQQH-FSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGC
        + +   P+        DL    R W+   +  +  + T + L   +     G  D   W++ + G++SV+S Y   +M  + +VP      +M  ++   
Subjt:  ICS--LPSLGVGARVADLITATRQWNVDLLQQH-FSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGC

Query:  WRMNLPMFMQRVPT-----------------------------------SVLHVFWLCK-----YTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGL
        W++ +P   +RV T                                   S+LHV   C      + R V      GF F +   + L+  L D      +
Subjt:  WRMNLPMFMQRVPT-----------------------------------SVLHVFWLCK-----YTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGL

Query:  ERFEDLVVLLWGIW------------NCRNKVRFHGAGPATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAG
               V++W  W             CR++V+F        +  WA   V  +R   G  ++         ++ W +P   W K+N D   +     A 
Subjt:  ERFEDLVVLLWGIW------------NCRNKVRFHGAGPATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAG

Query:  SGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNG
        +G V+R+  G        ++     +  AE +    GL  A +  +  + +E DS+ +   L+    +   LS L+          W +R    YRE N 
Subjt:  SGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNG

Query:  LAHRLASFALVEGQNLVWVEDVPECVRDMMLAD
        LA  LA++A          + VP+ +  ++  D
Subjt:  LAHRLASFALVEGQNLVWVEDVPECVRDMMLAD

P93295 Uncharacterized mitochondrial protein AtMg003101.7e-1950.54Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDT
        G   RDL   NQALLAKQ +RI+  P +LL+R+L  RYFP SS ME  VG RPS+ WRS+I GRELL +G+   +G+G   K++   W+  +T
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDT

Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)7.8e-0744.44Show/hide
Query:  VADLITA-TRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQ
        V DLI   T  W +D LQ    P ++ LIL I   R  + D F W + KSG Y+VKSGY V +
Subjt:  VADLITA-TRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQ

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.5e-1327.78Show/hide
Query:  SVLHVFWLCKYTRNVLSDAGF-----GFLFDRLHADILFLLLRDVRDALGLERFEDLVV-LLWGIWNCRNKVRFHGAG-PATELPTWAAGYVSSFRCVKG
        +V H+ + C + R V + +       G   D L+A++ ++L  +V +   L +  +LV  LLW +W  RN++ F G    A E+   A      +   + 
Subjt:  SVLHVFWLCKYTRNVLSDAGF-----GFLFDRLHADILFLLLRDVRDALGLERFEDLVV-LLWGIWNCRNKVRFHGAG-PATELPTWAAGYVSSFRCVKG

Query:  QEVMRDGGRARREL-VKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAV----DLGLFPILIETD
         E    G +  R L V+W APP  W K N DAT++ E  R G G ++RN +G V+    R      +   AE    ++ L+ AV          I+ E+D
Subjt:  QEVMRDGGRARREL-VKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAV----DLGLFPILIETD

Query:  SKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA
        ++ +  +L  + D    L   L D          ++F F+ R GN +A R+A
Subjt:  SKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLA

AT3G09510.1 Ribonuclease H-like superfamily protein1.7e-2223.61Show/hide
Query:  LEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWL------PCDTFMRICSLPSLGVGARVADLITATRQWNVDLLQQH
        ++ RYF   S ++A V  + S+ W SL+ G  LL KG R  +G+G+ I+I   N +      P +T      +    +  R          W+   + Q 
Subjt:  LEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWL------PCDTFMRICSLPSLGVGARVADLITATRQWNVDLLQQH

Query:  FSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLPMF-----------------MQRVPT---
           ++   I  I + +    D  +W Y  +G Y+V+SGY +        +P+ +         T  W  NLP+                   +R+ T   
Subjt:  FSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLLAQVPSSSSGDSMCGWWTGCWRMNLPMF-----------------MQRVPT---

Query:  --------------SVLHVFWLCKYTRNV--LSDAGF--GFLFDRLHADILFLLLRDVRDALGLERFEDL--VVLLWGIWNCRNKV---RFHGAGPATEL
                      S+ H  + C +      LSD+      L      + +  +L  V+D   +  F  L  V L+W IW  RN V   +F  +   T L
Subjt:  --------------SVLHVFWLCKYTRNV--LSDAGF--GFLFDRLHADILFLLLRDVRDALGLERFEDL--VVLLWGIWNCRNKV---RFHGAGPATEL

Query:  PTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAVD
           A  +         ++      +     ++W  PPA + K N DA F  +   A  G +IRNH G  ++  +    +  +   AE  A +  L+    
Subjt:  PTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAVD

Query:  LGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPW-PLRFSFSYREGNGLAHRLASF
         G   + +E D + +  ++ G     S L+N L D I+ W   +  ++F F  R+GN LAH LA +
Subjt:  LGLFPILIETDSKRVYEILQGERDELSELSNLLTDAIADWPTPW-PLRFSFSYREGNGLAHRLASF

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-4529.62Show/hide
Query:  NLSCQNVLGP-WLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWL--
        +LSC    G    +D+E+ N ALL KQ WR+L  P+SL+A+V + RYF  S  + AP+G RPSF+W+S+   +E+L +G R  VGNGE I I+   WL  
Subjt:  NLSCQNVLGP-WLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWL--

Query:  -PCDTFMRICSLP-----SLGVGARVADLITAT-RQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRV-GQMCLLAQVPSSS
         P    +R+  +P     S+    +V+DLI  + R+W  D+++  F   E  LI  +      + DS+ W Y  SG Y+VKSGY V  Q+      P   
Subjt:  -PCDTFMRICSLP-----SLGVGARVADLITAT-RQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRV-GQMCLLAQVPSSS

Query:  SGDSMCGWWTGCWRMNLPMFMQ-----------------------------RVPT---SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRD
        S  S+   +   W+      +Q                             R P+   +V H+ + C + R   + +          AD +++ L  V +
Subjt:  SGDSMCGWWTGCWRMNLPMFMQ-----------------------------RVPT---SVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRD

Query:  ALG-----LERFEDLVV-LLWGIWNCRNKVRFHGAG-PATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELV-KWSAPPADWFKLNVDATFKKECKRAG
         LG      E+   LV  LLW +W  RN++ F G    A E+   A   +  +R     E      +  R    +W  PP  W K N DAT+ ++ +R G
Subjt:  ALG-----LERFEDLVV-LLWGIWNCRNKVRFHGAG-PATELPTWAAGYVSSFRCVKGQEVMRDGGRARRELV-KWSAPPADWFKLNVDATFKKECKRAG

Query:  SGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAV-DLGLFP---ILIETDSKRVYEILQGE------RDELSELSNLLTDAIADWPTPWPLR
         G V+RN  GEV     R    +     AE    ++ ++ AV  L  F    ++ E+DS+ + EIL  +      +  + +L  LL+           ++
Subjt:  SGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAV-DLGLFP---ILIETDSKRVYEILQGE------RDELSELSNLLTDAIADWPTPWPLR

Query:  FSFSYREGNGLAHRLASFAL
        F F  REGN LA R+A  +L
Subjt:  FSFSYREGNGLAHRLASFAL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-2050.54Show/hide
Query:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDT
        G   RDL   NQALLAKQ +RI+  P +LL+R+L  RYFP SS ME  VG RPS+ WRS+I GRELL +G+   +G+G   K++   W+  +T
Subjt:  GPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGRELLGKGMRWQVGNGERIKIYESNWLPCDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCTCTCGTATCCATATGGGATGACATTCTGGCTATGGTGGCCTTATGGACGACCACTTTTAAAGCATTCTCTAACTACTGTGCTTCCCATATTGCCTCAAAT
TGGAAATCTTTCATGCCAAAATGTTTTGGGGCCTTGGCTTAGGGATTTGGAGTCCTCCAACCAGGCGCTGTTGGCAAAGCAGTGTTGGAGGATTTTACAAAATCCAGATT
CGCTGCTAGCTCGTGTGTTGGAAGGGAGGTATTTCCCTTCATCGAGTTTCATGGAAGCCCCAGTGGGGTATCGCCCTTCCTTCATCTGGAGGAGTTTAATCTGGGGGAGA
GAGTTGCTGGGAAAAGGTATGAGATGGCAAGTTGGTAATGGTGAGCGGATTAAAATTTATGAATCTAACTGGTTGCCATGTGATACATTTATGAGGATTTGCTCTTTGCC
TTCTTTAGGGGTTGGAGCCCGTGTGGCTGATTTGATTACAGCGACGAGGCAATGGAATGTGGACTTGCTGCAACAACATTTCAGCCCGACTGAGGTAAGTCTTATCTTAT
CTATCCCTATTAGACGTTTTGGTGTGGATGATTCCTTTGTGTGGCAATATGAGAAATCTGGCAGATACTCCGTTAAGAGTGGGTATCGTGTTGGGCAGATGTGTTTGCTG
GCTCAAGTTCCTTCATCTTCTTCGGGGGACTCGATGTGTGGTTGGTGGACCGGCTGTTGGAGAATGAATCTTCCAATGTTTATGCAAAGGGTGCCAACATCAGTTTTGCA
TGTTTTTTGGCTCTGTAAATACACAAGAAATGTCCTGAGTGATGCTGGGTTTGGGTTCCTGTTTGATAGGCTTCATGCTGACATTTTATTCTTGCTACTAAGGGATGTGA
GGGATGCTTTGGGGCTGGAACGGTTTGAGGATCTGGTGGTCCTTTTATGGGGCATTTGGAATTGCAGGAATAAGGTGAGGTTTCATGGGGCTGGGCCAGCGACGGAGTTG
CCAACGTGGGCTGCTGGTTATGTTTCTTCTTTCCGATGTGTGAAGGGCCAAGAAGTGATGAGAGATGGTGGCAGAGCTAGAAGGGAGCTCGTGAAGTGGTCAGCACCGCC
AGCGGATTGGTTTAAACTAAACGTGGACGCGACTTTTAAGAAGGAGTGCAAACGGGCTGGTTCGGGGTTGGTTATTCGGAACCACGCCGGAGAGGTCATGGCAGCTGCGA
CGCGAGTTCACGAGTATGTAGGCGACTCTGATTTGGCTGAAGGTTTTGCGGCAGTTGATGGCTTGAAGCTGGCTGTTGATTTGGGCCTTTTCCCGATCTTGATTGAAACT
GATTCGAAGCGGGTTTATGAAATTTTGCAGGGGGAAAGAGACGAACTATCTGAGTTGAGCAACCTGCTTACAGACGCTATTGCTGATTGGCCGACACCTTGGCCATTGCG
ATTTAGTTTCTCCTACCGTGAAGGAAATGGTCTTGCCCATCGTCTTGCGAGTTTTGCATTAGTTGAGGGTCAGAACCTTGTGTGGGTGGAGGATGTGCCTGAGTGCGTGA
GGGATATGATGTTAGCTGATATTGATTTCCTACGTGGCCTGCAAGATAGAAATCTGCACACTGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAGCGGA
AGGAGGAGAGCAAGAGAGCAAGAGGAAAGTGTAGAGAATAGAGTTCGAGATCTCTTCTTCAGTGGCGAAGAGAGGTTTAAATACCTGCTCGTGCTCCTAGGTTTTTTAGG
AATTCGGAGGCGTTTCGGGATGAACCAGGTGGAACCGAGGTGGCCAGAGGCAGTAGGGACCGAACGGAGGCAGAGGCGACCCTTTGGTCGGTTCTTCCTCCGGGTTCCGT
TTCCTAGCTGTCTCCTTGGGTTGGTTTTATTTGTTGTGATGCTCATTCTTACTTCTTCCCAATTGGCGTTTGGAAGACATTTTGCTGATGCTACTCCTCATAGATCTTGT
CATAACAAGAACAACCATAGGGATAAGGTGCATCTTTCAAGAGTTCTATTGGCTTCCCAAAAAGATGTCCCAAGAAACGGTAACTATGGAGGTGGAGGAGGGAACCCGTC
AAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCTCTCGTATCCATATGGGATGACATTCTGGCTATGGTGGCCTTATGGACGACCACTTTTAAAGCATTCTCTAACTACTGTGCTTCCCATATTGCCTCAAAT
TGGAAATCTTTCATGCCAAAATGTTTTGGGGCCTTGGCTTAGGGATTTGGAGTCCTCCAACCAGGCGCTGTTGGCAAAGCAGTGTTGGAGGATTTTACAAAATCCAGATT
CGCTGCTAGCTCGTGTGTTGGAAGGGAGGTATTTCCCTTCATCGAGTTTCATGGAAGCCCCAGTGGGGTATCGCCCTTCCTTCATCTGGAGGAGTTTAATCTGGGGGAGA
GAGTTGCTGGGAAAAGGTATGAGATGGCAAGTTGGTAATGGTGAGCGGATTAAAATTTATGAATCTAACTGGTTGCCATGTGATACATTTATGAGGATTTGCTCTTTGCC
TTCTTTAGGGGTTGGAGCCCGTGTGGCTGATTTGATTACAGCGACGAGGCAATGGAATGTGGACTTGCTGCAACAACATTTCAGCCCGACTGAGGTAAGTCTTATCTTAT
CTATCCCTATTAGACGTTTTGGTGTGGATGATTCCTTTGTGTGGCAATATGAGAAATCTGGCAGATACTCCGTTAAGAGTGGGTATCGTGTTGGGCAGATGTGTTTGCTG
GCTCAAGTTCCTTCATCTTCTTCGGGGGACTCGATGTGTGGTTGGTGGACCGGCTGTTGGAGAATGAATCTTCCAATGTTTATGCAAAGGGTGCCAACATCAGTTTTGCA
TGTTTTTTGGCTCTGTAAATACACAAGAAATGTCCTGAGTGATGCTGGGTTTGGGTTCCTGTTTGATAGGCTTCATGCTGACATTTTATTCTTGCTACTAAGGGATGTGA
GGGATGCTTTGGGGCTGGAACGGTTTGAGGATCTGGTGGTCCTTTTATGGGGCATTTGGAATTGCAGGAATAAGGTGAGGTTTCATGGGGCTGGGCCAGCGACGGAGTTG
CCAACGTGGGCTGCTGGTTATGTTTCTTCTTTCCGATGTGTGAAGGGCCAAGAAGTGATGAGAGATGGTGGCAGAGCTAGAAGGGAGCTCGTGAAGTGGTCAGCACCGCC
AGCGGATTGGTTTAAACTAAACGTGGACGCGACTTTTAAGAAGGAGTGCAAACGGGCTGGTTCGGGGTTGGTTATTCGGAACCACGCCGGAGAGGTCATGGCAGCTGCGA
CGCGAGTTCACGAGTATGTAGGCGACTCTGATTTGGCTGAAGGTTTTGCGGCAGTTGATGGCTTGAAGCTGGCTGTTGATTTGGGCCTTTTCCCGATCTTGATTGAAACT
GATTCGAAGCGGGTTTATGAAATTTTGCAGGGGGAAAGAGACGAACTATCTGAGTTGAGCAACCTGCTTACAGACGCTATTGCTGATTGGCCGACACCTTGGCCATTGCG
ATTTAGTTTCTCCTACCGTGAAGGAAATGGTCTTGCCCATCGTCTTGCGAGTTTTGCATTAGTTGAGGGTCAGAACCTTGTGTGGGTGGAGGATGTGCCTGAGTGCGTGA
GGGATATGATGTTAGCTGATATTGATTTCCTACGTGGCCTGCAAGATAGAAATCTGCACACTGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAGCGGA
AGGAGGAGAGCAAGAGAGCAAGAGGAAAGTGTAGAGAATAGAGTTCGAGATCTCTTCTTCAGTGGCGAAGAGAGGTTTAAATACCTGCTCGTGCTCCTAGGTTTTTTAGG
AATTCGGAGGCGTTTCGGGATGAACCAGGTGGAACCGAGGTGGCCAGAGGCAGTAGGGACCGAACGGAGGCAGAGGCGACCCTTTGGTCGGTTCTTCCTCCGGGTTCCGT
TTCCTAGCTGTCTCCTTGGGTTGGTTTTATTTGTTGTGATGCTCATTCTTACTTCTTCCCAATTGGCGTTTGGAAGACATTTTGCTGATGCTACTCCTCATAGATCTTGT
CATAACAAGAACAACCATAGGGATAAGGTGCATCTTTCAAGAGTTCTATTGGCTTCCCAAAAAGATGTCCCAAGAAACGGTAACTATGGAGGTGGAGGAGGGAACCCGTC
AAGTTAA
Protein sequenceShow/hide protein sequence
MSDLSYPYGMTFWLWWPYGRPLLKHSLTTVLPILPQIGNLSCQNVLGPWLRDLESSNQALLAKQCWRILQNPDSLLARVLEGRYFPSSSFMEAPVGYRPSFIWRSLIWGR
ELLGKGMRWQVGNGERIKIYESNWLPCDTFMRICSLPSLGVGARVADLITATRQWNVDLLQQHFSPTEVSLILSIPIRRFGVDDSFVWQYEKSGRYSVKSGYRVGQMCLL
AQVPSSSSGDSMCGWWTGCWRMNLPMFMQRVPTSVLHVFWLCKYTRNVLSDAGFGFLFDRLHADILFLLLRDVRDALGLERFEDLVVLLWGIWNCRNKVRFHGAGPATEL
PTWAAGYVSSFRCVKGQEVMRDGGRARRELVKWSAPPADWFKLNVDATFKKECKRAGSGLVIRNHAGEVMAAATRVHEYVGDSDLAEGFAAVDGLKLAVDLGLFPILIET
DSKRVYEILQGERDELSELSNLLTDAIADWPTPWPLRFSFSYREGNGLAHRLASFALVEGQNLVWVEDVPECVRDMMLADIDFLRGLQDRNLHTGVVLATPPPMLKSESG
RRRAREQEESVENRVRDLFFSGEERFKYLLVLLGFLGIRRRFGMNQVEPRWPEAVGTERRQRRPFGRFFLRVPFPSCLLGLVLFVVMLILTSSQLAFGRHFADATPHRSC
HNKNNHRDKVHLSRVLLASQKDVPRNGNYGGGGGNPSS