; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010034 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010034
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:17298481..17304197
RNA-Seq ExpressionHG10010034
SyntenyHG10010034
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEE53976.1 hypothetical protein OsJ_00596 [Oryza sativa Japonica Group]2.1e-6725.29Show/hide
Query:  EEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTGVQ
        E+DE ++ VD   A    +T  +  IGK           +F   +KAW +  G++ K L  N FL  F            GPW +     ++ E     +
Subjt:  EEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTGVQ

Query:  PPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKI-NLEGTSGGRWVAMRYERLPELCSFCGI
        P ++    +P WV  + L L      M + +G  +G       ++    W + +RV+    + KP    +K+ + E +S  +   ++YER+P  C +CG 
Subjt:  PPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKI-NLEGTSGGRWVAMRYERLPELCSFCGI

Query:  IGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNCRKNLSSTFEDISVGKGDLFLF
        IGH+ ++C      DL  D            TV  G  +  SP    K  E  S  L G V              D+N ++NL    E+ S         
Subjt:  IGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNCRKNLSSTFEDISVGKGDLFLF

Query:  SHGGGHNIMKFGKDIPLFKD---LNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQHQI
        +  G  N     KD P  K+    N P  + A + E         NF  ++ + ++ +    ++   L  +NP  +    +L       R    + +H  
Subjt:  SHGGGHNIMKFGKDIPLFKD---LNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQHQI

Query:  RGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAK------
                 L  +H  +     LR   L   +                + D +K +      L +   + S+L    +  S+D     D +GA       
Subjt:  RGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAK------

Query:  --------GGLGLLWKDDIQIDLRSFSIHHIDLKVTWND-KVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREEL
                 GL L W D +Q+DL S++  +ID+ +      V W    +YG P+   RH+ W LLRR+ ++ ++ W + GD NE +   E      R E 
Subjt:  --------GGLGLLWKDDIQIDLRSFSIHHIDLKVTWND-KVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREEL

Query:  LMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHA
         + DFR  +  C    LG+ G  +T+ NK+   N V +RLDR +A+  + S+FP + V +L  + SDH PI L  S V+    ++G R  ++   W    
Subjt:  LMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHA

Query:  NCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWG--------------AVVNKGL-----SGS---IHKDPWLPRPSSFRVFTPGTQGKRCVVAELI
        +    +K +   +G +++  +++ +LK     LQ W                ++ +GL     SGS   I +D W+PR  + RV     + +   V++L+
Subjt:  NCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWG--------------AVVNKGL-----SGS---IHKDPWLPRPSSFRVFTPGTQGKRCVVAELI

Query:  SP-AGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGK---GSCSNSQSD--WWRHLWKLRIPRKIAIFMWKS
         P    W+ + +  +  P D E I RI +  S   D   WHYDK GMFTV S YK A+  ++        CSN   D   W+++W +++P+KI IF W+ 
Subjt:  SP-AGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGK---GSCSNSQSD--WWRHLWKLRIPRKIAIFMWKS

Query:  CHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSI
          DCL  + +   R +  +P C +C  + E   HA   C++A ++
Subjt:  CHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSI

KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]1.5e-8926Show/hide
Query:  LEEWKRFSLNNEEDERSLDVDRAAAILAE-ETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKK
        +++WK+ S N  EDE  +  D AA    E +    CL+G L  E     +   +  ++ W +  G+ +++L KN+++F F     R+R++K  PW+F K+
Subjt:  LEEWKRFSLNNEEDERSLDVDRAAAILAE-ETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKK

Query:  ILILQEPLTGVQPP-EMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIF-----DDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRW
        +L LQE +TG +PP  +    VPFWV    +         A+ +G  +GIF      D+   E    +G  LR       + P + G   +     GG  
Subjt:  ILILQEPLTGVQPP-EMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIF-----DDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRW

Query:  VAMRYERLPELCSFCGI-----IGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSN
          M ++   E     G+     + H+F                       + A V  G+  +   +   +AG +V + L+               +  S 
Subjt:  VAMRYERLPELCSFCGI-----IGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSN

Query:  CRKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSN--FRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWK
           ++S   + I+V           G  ++    K IP    +    +++A     + A V+L N    +K+GK + +   T  K    K    +    K
Subjt:  CRKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSN--FRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWK

Query:  KKLPFWIKEKR--------VHLL----LTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLN----PSLCFL
          +P  I++          VH++    +++   + + W+ +                +P     ++ WN RGL +  A R    +LKRLN    PSL FL
Subjt:  KKLPFWIKEKR--------VHLL----LTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLN----PSLCFL

Query:  SETKCNVSILNKLKNRLSYDGCFCVDSVG----AKGGLGLLWKDDIQIDLRSFSIHHIDLKVT-WNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDS
         ETK   S + +LK +L +D  F VD  G      GGL L W + + + L SFS +HID+ V+     + W   G++G PE   +H TW+LLR L ++  
Subjt:  SETKCNVSILNKLKNRLSYDGCFCVDSVG----AKGGLGLLWKDDIQIDLRSFSIHHIDLKVT-WNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDS

Query:  SAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIEL
          W+  GD NEI+   EK+GG ++ +  M  FR A + C F  +G+ G  FTW N R   + +  RLDR  A  E+L  FP + V ++    SDH  +E+
Subjt:  SAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIEL

Query:  LLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQ--------------------KW------GAVVNKGLSGS
            +      +  R F+F + W     C+ ++  +  W+G+       S KL   A NL                     +W      G  V  G   +
Subjt:  LLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQ--------------------KW------GAVVNKGLSGS

Query:  IHKDPWLPRPSSFRVFTPGTQGKRCV-VAELISPAG-TWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGKGSC
        I +D W+   +  R+ +P       V VA+LI+    TW    +N    P +  +I  +P+S     D ++W ++KH  ++V+S Y + I   +      
Subjt:  IHKDPWLPRPSSFRVFTPGTQGKRCV-VAELISPAG-TWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGKGSC

Query:  SNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCFDIWLNFASH
        ++S    W  +WKL +  KI IF W+ CH+ LP  ++L  R +     CP CR   E   H    C+ A+ +W F S  G+         F  WL+  S 
Subjt:  SNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCFDIWLNFASH

Query:  LNDNELET---TCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGV---EPIEIDHFKSIVLPPLQFDFGHVHTLHVDAA-FSPT---FGGVIGS
        L D  +E       V W+IW D NN++ + K  S+E     +L       +   + +  D   S ++   +    + H L+VDAA  S T    G V+  
Subjt:  LNDNELET---TCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGV---EPIEIDHFKSIVLPPLQFDFGHVHTLHVDAA-FSPT---FGGVIGS

Query:  KSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQ---------MLKGTTGIYLEVSNYIEDIKCCLSTFRSVVSRSSNQAA
          G++    A   + L     AEA A+  GL LV +L + NL V +D L   +         ++         +SNY +           +V R  NQ A
Subjt:  KSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQ---------MLKGTTGIYLEVSNYIEDIKCCLSTFRSVVSRSSNQAA

Query:  HLLARDGSSGVSLLRLNSFPDMLSHLYLGDV
        H +A+   +    + L++FP+ +  L L DV
Subjt:  HLLARDGSSGVSLLRLNSFPDMLSHLYLGDV

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]1.7e-7230.13Show/hide
Query:  LYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKV-TWNDKVSW-F
        L WN +GL + W  R + + +K+ +P   FL ET+     + K+K  L +     V+SVG  GGL LLW+ D+ I L+S+S +HID+ + T  ++  W  
Subjt:  LYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKV-TWNDKVSW-F

Query:  IGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLAN
         G+YGHPE   +  TW L+R L  S S  WV  GD NEI   EEK G + +    M  F+ A+ DC  +GLG+ GNTFTWCNKR+    V  RLDR +A 
Subjt:  IGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLAN

Query:  VEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWGAVVNKGLSG
         ++  +FP + V++L    SDH P+ LL    E+    +  R F+F   W     C +I+  +  W G     A +                        
Subjt:  VEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWGAVVNKGLSG

Query:  SIHKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKY--AILDRLIGKGS
               LPR +   +     Q              TW+   L  + +P + E+I  IP+S+    D  VWH+   G F+VRS Y     + DR     S
Subjt:  SIHKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKY--AILDRLIGKGS

Query:  CSNSQSDW--------WRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCF
         ++S S W        W  +W+L IP K+ IF+WK   + LP + +L  R +  E  C +C  + E++ H L  C  AR +W   S  G +  +      
Subjt:  CSNSQSDW--------WRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCF

Query:  DIWLN--FASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRC-----DWILDYANGVG-VEPIEIDHFKSIVLPPLQFDFGHVHTLHVDAAF---
          W+     SH  +  L    ++AWSIW   N +I S  V    + C       + D+ N      P  I   +S + PP     G +  +++D A    
Subjt:  DIWLN--FASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRC-----DWILDYANGVG-VEPIEIDHFKSIVLPPLQFDFGHVHTLHVDAAF---

Query:  --SPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQML
          S   G V+   +G+++A ++       S    EA A  +GL+    LG+  +++ SDS++ I+ L
Subjt:  --SPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQML

KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]6.4e-7228.37Show/hide
Query:  DGLLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFY
        D L   W    L NEE+E ++ +D       +      LIGK+  +  +  +V+ + F K W++   + ++ ++ N F+  F   A + RV    PWFF 
Subjt:  DGLLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFY

Query:  KKILILQEPLTGVQP-PEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAM
          + ++  P  G  P  EM F    FWV F +L L   N+    KLG+ IG  ++ E +E+ + WG SLRV+I++D+ KP  RG  I L G     W  +
Subjt:  KKILILQEPLTGVQP-PEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAM

Query:  RYERLPELCSFCGIIGH------LFKDCNDFYKSDLKQDNPHQSTPE--KLAATVSGGSA---MEISPLPEAKA----------GEAVSKNLEGGVSFFP
        +YE++P  C  CG I H      + KD +  + S L+ ++P +   E  K  +T  G S+   ME + + +  A          G  +S ++        
Subjt:  RYERLPELCSFCGIIGH------LFKDCNDFYKSDLKQDNPHQSTPE--KLAATVSGGSA---MEISPLPEAKA----------GEAVSKNLEGGVSFFP

Query:  DSMDVTV-AIND---SNCRKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQK
        D + V + +IND   ++C K        +     DL    + GG  I             +E     A     LG         ++ G  ++ L P   +
Subjt:  DSMDVTV-AIND---SNCRKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQK

Query:  PAHLKGKNPKATAWKKKLPFWIKEKRVHLL----LTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSL
              + P     +     W K+ R   L    L +  +R +    L L         + L  + +L    + WN RGL +    + +SD ++   P +
Subjt:  PAHLKGKNPKATAWKKKLPFWIKEKRVHLL----LTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSL

Query:  CFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVT---WNDKVSWFIGLYGHPETHLRHHTWNLLRRLHNSDS
         FL ETK N  ++ ++K +L +  C  V   G  GG+ L W +  ++++++FS  HI  KVT    N +  W  G YG+ +   RH +WNLLR L     
Subjt:  CFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVT---WNDKVSWFIGLYGHPETHLRHHTWNLLRRLHNSDS

Query:  SAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIEL
          W++ GD NEILSN EK GG  + E  M  FR  +D+C    LG+ GN FTWCN+R   + +  RLDRFL+N+++ S +P + V +   A SDH PI L
Subjt:  SAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIEL

Query:  LLSGVESFYGRQGSRP--FQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKW
         L+        QG R   F+F   W    +C+++++++        + + +  K++ C   L  W
Subjt:  LLSGVESFYGRQGSRP--FQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKW

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.5e-8524.4Show/hide
Query:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKID-SGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYK
        LLEEWK F L +EE+E ++DVD +A           L+GKL  +  I   VM N  R AWK++ +  +V+ L  N+FLFSF +   R ++ K GPW F +
Subjt:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKID-SGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYK

Query:  KILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRY
         ++++ +P+  + P E+DF  +P WV FF L L C  R MA +LGN +G F++ + ++    WG +LRV++++DISKP RRGIK+NL+G  GG W+ ++Y
Subjt:  KILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRY

Query:  ERLPELCSFCGIIGHLFKDCNDF---YKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVT--------------
        ERLP+ C  CG+     K        Y+  +K   P    P++     SG ++   S  P     + V      G    P    VT              
Subjt:  ERLPELCSFCGIIGHLFKDCNDF---YKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVT--------------

Query:  --VAINDSNCRKNL-----------------------SSTFEDISVGKGDLF--LFSH--------------------------GGGHNIMKFGKDIPLF
          V I++   R N+                       S T  D+S G    F   + H                          GG  N + +  +    
Subjt:  --VAINDSNCRKNL-----------------------SSTFEDISVGKGDLF--LFSH--------------------------GGGHNIMKFGKDIPLF

Query:  KDLN-------EPVMDLASI--------------NEFLGAQV--SLSNF-------------------RTKEGKTKSSLGPTSQKPAHLKGKN-------
           +         +MD  S+              N F G Q+   L  F                    T       S+  +S    H    N       
Subjt:  KDLN-------EPVMDLASI--------------NEFLGAQV--SLSNF-------------------RTKEGKTKSSLGPTSQKPAHLKGKN-------

Query:  ---PKAT---AWKKKLP----------------------FWIKEKR-------------------VHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRS
            KA    A+ + LP                      FW +  R                   ++L+ T+        L          +  R++  +
Subjt:  ---PKAT---AWKKKLP----------------------FWIKEKR-------------------VHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRS

Query:  --------PALNYENLYWNVRG---LESSWAFREVSDELKRLNP---------------------SLCFLSETKCNVSILNKLKNRLS------------
                PAL Y+  YW+V G   LE+        D++K+ N                      SLC +S    + SI N+LKN +             
Subjt:  --------PALNYENLYWNVRG---LESSWAFREVSDELKRLNP---------------------SLCFLSETKCNVSILNKLKNRLS------------

Query:  --------YDGCFCVDSVGAKGGLGLLWKDDIQIDL-RSFS------IHHIDLKVTWND-----------KVSWFIGLYGHPETHLRHHTWNLLRRLHNS
                  G  C+ ++ +    GL+    +++DL ++F       +  I  K+ +N+            V + I L G P        +   R +   
Subjt:  --------YDGCFCVDSVGAKGGLGLLWKDDIQIDL-RSFS------IHHIDLKVTWND-----------KVSWFIGLYGHPETHLRHHTWNLLRRLHNS

Query:  DSSA----WVVGGDLNEILSNEEKEG---GLSREE--------LLMDD----FRLAMDDCAFMG--LGYFGNTFTWCNKRNYPNQVCL----------RL
        D  +     +    L+ ++++E   G   G+  EE        L  DD     R    +C  +   L  +G     C   N+     L          + 
Subjt:  DSSA----WVVGGDLNEILSNEEKEG---GLSREE--------LLMDD----FRLAMDDCAFMG--LGYFGNTFTWCNKRNYPNQVCL----------RL

Query:  DRFLANVEFLSIFPD------------SRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDH-WRL--HANC-------EKIVKESVNWEGSVTN
         + + NV+ +S F +               + L W K           G+ +F   +G        H WR   H N         K  K++   + S  +
Subjt:  DRFLANVEFLSIFPD------------SRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDH-WRL--HANC-------EKIVKESVNWEGSVTN

Query:  PATISLKLKVCARNLQKWGAVVNKGLSGSI--HKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWV
         ++   K  +  R+L   G  +  G   +I    DPWLPRP++F+            VA  I+  G WD+  ++      D ++I  +PIS     D+W+
Subjt:  PATISLKLKVCARNLQKWGAVVNKGLSGSI--HKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWV

Query:  WHYDKHGMFTVRSGYKYAILDRLIGKGSCSNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARS
        WHYDK G ++VRSGYK  +  +     + +N +   W  +WKL +P KI IF+W+S H+ +P   +L  R +   P C +C  ++ES+ HA F C RAR 
Subjt:  WHYDKHGMFTVRSGYKYAILDRLIGKGSCSNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARS

Query:  IW-DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGVEPIE--IDHFKSIVLPPLQF--
        IW   F F         +S  ++W +    L   +L    I  W IW D N+ IH K+V  +E++C+W+  + +      +       +S   P +Q+  
Subjt:  IW-DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGVEPIE--IDHFKSIVLPPLQF--

Query:  -DFGHVHTLHVDAA---FSPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDI
                L+ DAA    S +FG +I   S  ++A  +       S L AE   + +GL+        +L V SDSL  IQ+++       +  N++ +I
Subjt:  -DFGHVHTLHVDAA---FSPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDI

Query:  K---CCLSTFR-SVVSRSSNQAAHLLARDG--SSGVSLLRLNSFPDMLSHLYLGD
        +   CC +    S  SR  N+AAH LA+ G  S   +   L +FP  L  L   D
Subjt:  K---CCLSTFR-SVVSRSSNQAAHLLARDG--SSGVSLLRLNSFPDMLSHLYLGD

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248744.1e-8524.4Show/hide
Query:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKID-SGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYK
        LLEEWK F L +EE+E ++DVD +A           L+GKL  +  I   VM N  R AWK++ +  +V+ L  N+FLFSF +   R ++ K GPW F +
Subjt:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKID-SGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYK

Query:  KILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRY
         ++++ +P+  + P E+DF  +P WV FF L L C  R MA +LGN +G F++ + ++    WG +LRV++++DISKP RRGIK+NL+G  GG W+ ++Y
Subjt:  KILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRY

Query:  ERLPELCSFCGIIGHLFKDCNDF---YKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVT--------------
        ERLP+ C  CG+     K        Y+  +K   P    P++     SG ++   S  P     + V      G    P    VT              
Subjt:  ERLPELCSFCGIIGHLFKDCNDF---YKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVT--------------

Query:  --VAINDSNCRKNL-----------------------SSTFEDISVGKGDLF--LFSH--------------------------GGGHNIMKFGKDIPLF
          V I++   R N+                       S T  D+S G    F   + H                          GG  N + +  +    
Subjt:  --VAINDSNCRKNL-----------------------SSTFEDISVGKGDLF--LFSH--------------------------GGGHNIMKFGKDIPLF

Query:  KDLN-------EPVMDLASI--------------NEFLGAQV--SLSNF-------------------RTKEGKTKSSLGPTSQKPAHLKGKN-------
           +         +MD  S+              N F G Q+   L  F                    T       S+  +S    H    N       
Subjt:  KDLN-------EPVMDLASI--------------NEFLGAQV--SLSNF-------------------RTKEGKTKSSLGPTSQKPAHLKGKN-------

Query:  ---PKAT---AWKKKLP----------------------FWIKEKR-------------------VHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRS
            KA    A+ + LP                      FW +  R                   ++L+ T+        L          +  R++  +
Subjt:  ---PKAT---AWKKKLP----------------------FWIKEKR-------------------VHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRS

Query:  --------PALNYENLYWNVRG---LESSWAFREVSDELKRLNP---------------------SLCFLSETKCNVSILNKLKNRLS------------
                PAL Y+  YW+V G   LE+        D++K+ N                      SLC +S    + SI N+LKN +             
Subjt:  --------PALNYENLYWNVRG---LESSWAFREVSDELKRLNP---------------------SLCFLSETKCNVSILNKLKNRLS------------

Query:  --------YDGCFCVDSVGAKGGLGLLWKDDIQIDL-RSFS------IHHIDLKVTWND-----------KVSWFIGLYGHPETHLRHHTWNLLRRLHNS
                  G  C+ ++ +    GL+    +++DL ++F       +  I  K+ +N+            V + I L G P        +   R +   
Subjt:  --------YDGCFCVDSVGAKGGLGLLWKDDIQIDL-RSFS------IHHIDLKVTWND-----------KVSWFIGLYGHPETHLRHHTWNLLRRLHNS

Query:  DSSA----WVVGGDLNEILSNEEKEG---GLSREE--------LLMDD----FRLAMDDCAFMG--LGYFGNTFTWCNKRNYPNQVCL----------RL
        D  +     +    L+ ++++E   G   G+  EE        L  DD     R    +C  +   L  +G     C   N+     L          + 
Subjt:  DSSA----WVVGGDLNEILSNEEKEG---GLSREE--------LLMDD----FRLAMDDCAFMG--LGYFGNTFTWCNKRNYPNQVCL----------RL

Query:  DRFLANVEFLSIFPD------------SRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDH-WRL--HANC-------EKIVKESVNWEGSVTN
         + + NV+ +S F +               + L W K           G+ +F   +G        H WR   H N         K  K++   + S  +
Subjt:  DRFLANVEFLSIFPD------------SRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDH-WRL--HANC-------EKIVKESVNWEGSVTN

Query:  PATISLKLKVCARNLQKWGAVVNKGLSGSI--HKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWV
         ++   K  +  R+L   G  +  G   +I    DPWLPRP++F+            VA  I+  G WD+  ++      D ++I  +PIS     D+W+
Subjt:  PATISLKLKVCARNLQKWGAVVNKGLSGSI--HKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWV

Query:  WHYDKHGMFTVRSGYKYAILDRLIGKGSCSNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARS
        WHYDK G ++VRSGYK  +  +     + +N +   W  +WKL +P KI IF+W+S H+ +P   +L  R +   P C +C  ++ES+ HA F C RAR 
Subjt:  WHYDKHGMFTVRSGYKYAILDRLIGKGSCSNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARS

Query:  IW-DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGVEPIE--IDHFKSIVLPPLQF--
        IW   F F         +S  ++W +    L   +L    I  W IW D N+ IH K+V  +E++C+W+  + +      +       +S   P +Q+  
Subjt:  IW-DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYANGVGVEPIE--IDHFKSIVLPPLQF--

Query:  -DFGHVHTLHVDAA---FSPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDI
                L+ DAA    S +FG +I   S  ++A  +       S L AE   + +GL+        +L V SDSL  IQ+++       +  N++ +I
Subjt:  -DFGHVHTLHVDAA---FSPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDI

Query:  K---CCLSTFR-SVVSRSSNQAAHLLARDG--SSGVSLLRLNSFPDMLSHLYLGD
        +   CC +    S  SR  N+AAH LA+ G  S   +   L +FP  L  L   D
Subjt:  K---CCLSTFR-SVVSRSSNQAAHLLARDG--SSGVSLLRLNSFPDMLSHLYLGD

A0A7N2LIH6 Uncharacterized protein5.6e-7428.07Show/hide
Query:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKK
        L E WK+ ++   EDE  + +   +   A+E    C++ K+L +  +    +    R  WK   G+++  + +++FL  F     + +VM+  PW + K+
Subjt:  LLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKK

Query:  ILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYE
        ++++QE    + P E+  KW PFWV  F+L L C  R    ++G  IG   + +  E  + WG+ LRV+I  D +    RG K+++EG   GRWV  +YE
Subjt:  ILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYE

Query:  RLPELCSFCGIIGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNC--RKNLSSTF
        RLP  C  CG + H  KDC +    +   D   +     L       S  +   + E    E    + E           V   + +S    R+++S   
Subjt:  RLPELCSFCGIIGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNC--RKNLSSTF

Query:  EDISVGKGD---LFLFSHGG-GHNIMKFGK-----DIPLFKDLNEPVMDLASIN--EFLGAQVSLSNFRTK---EGKTKSSLGPTSQKPAHLKGKNPKAT
         + ++G+ D     L  HGG G   + + K     +   F    E   D  +I+  + L   V+  +   K   E          S++ +  K       
Subjt:  EDISVGKGD---LFLFSHGG-GHNIMKFGK-----DIPLFKDLNEPVMDLASIN--EFLGAQVSLSNFRTK---EGKTKSSLGPTSQKPAHLKGKNPKAT

Query:  AWKKKLPFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLR------------------------------SPALNYENLYWNVRGLESSWAF
           K+   W + +   L +T  Q +G     L     H    W+RL R                              +P  +   L WN RGL +S A 
Subjt:  AWKKKLPFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLR------------------------------SPALNYENLYWNVRGLESSWAF

Query:  REVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTW-NDKVSW-FIGLYGHPETHLRHH
        R ++DE+K+ NP L FL ETK +V  +   +N+L +     V S G  GGL LLWK+   I  +S S  HID+ V        W   G YGHP+T  R+ 
Subjt:  REVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTW-NDKVSW-FIGLYGHPETHLRHH

Query:  TWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQN
        +W LL  L+      W+V GD NEI+  +EK G   R+   MD FR  +  C  + LG+ G  FTWCN R    +  +RLDR +AN  +  +FP+++V +
Subjt:  TWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQN

Query:  LRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPA-TISLKLKVCARNLQKWGAVVNKGLSGSIHK
        +  + SDH  + L L+ V +   R+G + F F + W     C++IV+  + W+    + A  +  +L+ C + LQ+W    N+   G+++K
Subjt:  LRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPA-TISLKLKVCARNLQKWGAVVNKGLSGSIHK

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.4e-7827.92Show/hide
Query:  WKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILIL
        W+R  +  EE+E  L  D      A E    C+  K++    +    +    R  WK +  +++ ++ + +FL  F  E  + RVM   PW + K++++ 
Subjt:  WKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILIL

Query:  QEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPE
        +E      P ++  KW PFWV  ++L L    +   + +G  IG F + + EE  + WG  LRV++ ID+++   RG KIN+E     RWV  +YERLP 
Subjt:  QEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPE

Query:  LCSFCGIIGHLFKDCND--------------------------------FYK----SDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSK--
         C  CG++ H  KDC +                                F K     ++K     ++   K    V  G A E   L     G++  +  
Subjt:  LCSFCGIIGHLFKDCND--------------------------------FYK----SDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSK--

Query:  -NLEGGVSFFPDSMDVTVAINDSNCRKNL--SSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPV--MDLASINEFLGAQ----VSLSNFRT
         +L GG          TV   ++   +++  S + E + VG+ +  + +  GG    + G+     ++LN+P+   +    NE +       + L   + 
Subjt:  -NLEGGVSFFPDSMDVTVAINDSNCRKNL--SSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPV--MDLASINEFLGAQ----VSLSNFRT

Query:  KEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREV
        K+G       P     A+  G  P +  WK+ +          +  +   ++ KR   L L  + + V   +  ++P      L WN RG+ S+ A R +
Subjt:  KEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREV

Query:  SDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTW-NDKVSW-FIGLYGHPETHLRHHTWN
        +DE+K  +P L FL+ETK +   +  L+ +L       V S G  GGL +LW++ + + L+S S  HID+ V   N  V W   G YGHP+  +R  +W 
Subjt:  SDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTW-NDKVSW-FIGLYGHPETHLRHHTWN

Query:  LLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRW
        LL  L    +  WVV GD NEIL+++EK G L R+   M+ FR  + +C  + LG+ G  FTWCN R    +  +RLDR +AN E++++FP+++V +   
Subjt:  LLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRW

Query:  AKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNP-ATISLKLKVCARNLQKWGAVVNKGLSGSIHK
        A SDH  + L +   E+   +   R F F + W     C ++++ +  W+    NP  TI  +LK C   LQ W    N+ + G+++K
Subjt:  AKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNP-ATISLKLKVCARNLQKWGAVVNKGLSGSIHK

A0A803QI56 Uncharacterized protein1.3e-9124.88Show/hide
Query:  EEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTGVQ
        +E+E SL     A  L E    WCL+G+ L    +    M N     W+   G+ VK L  N+FLF F  E    RV++  PW F +  LI +    G  
Subjt:  EEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTGVQ

Query:  PPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPELCSFCGII
        P  +    + FWV   +L+       +   LGN +G F   +       W + LRV++ I+I  P +R  K+  +G     +   +YE L   C   GI+
Subjt:  PPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPELCSFCGII

Query:  GHLFKDCNDFYKSDLKQ-DNPH----QSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNCRKNLSSTFEDISVGKGD
        GH  + C   + + L Q   P+    ++TP +    +  G     S       G A S +   G    P + + T+   +++ R++  +     ++G   
Subjt:  GHLFKDCNDFYKSDLKQ-DNPH----QSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNCRKNLSSTFEDISVGKGD

Query:  LFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSL-SNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQH
            +  G  N+M     +P  K+    ++D  S+ E +    +L SN   KE  T   +  T ++   +             L     +   H+ +T  
Subjt:  LFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSL-SNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQH

Query:  QIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLG
        +                       ++ P      L WN RGL +  A + + + + + NP+  FL ETKC+   +  +   L ++G F V++ G+ GGL 
Subjt:  QIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLG

Query:  LLWKDDIQIDLRSFSIHHIDLKVTWNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCA
        LLWK+     +  +S +HID  V  ++K +W   G YG PE + RH +W LLR L    +  W V GDLN I+ +E+K GG    + L+D F+ A++DC 
Subjt:  LLWKDDIQIDLRSFSIHHIDLKVTWNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCA

Query:  FMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWE
         + +   G+ FTW   R   N + +RLDR L N  +  IF  + + NL  + SDH PI L     E  +    +R F+F + W     C +IV++    E
Subjt:  FMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWE

Query:  GSVTNPATISLKLKVCARNLQKWGAVV-------------------NKGLSGSIH---------------------------------------------
        G    P     KL  CA  L  WG  V                   NK  + SI                                              
Subjt:  GSVTNPATISLKLKVCARNLQKWGAVV-------------------NKGLSGSIH---------------------------------------------

Query:  ------KDPWLPRPSSFRVFT-PGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGK
              K+PWLP      V +   + G+  V   ++   G+WD+  LNDL +  D ++I  IP++ S  +D   W Y+  G+++V+SGY   +L ++ G+
Subjt:  ------KDPWLPRPSSFRVFT-PGTQGKRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGK

Query:  GSCSNSQ-SDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCFDIWLN
           +    + +W   WK +IP K+   +W++  +CLP    L+ + +D +  CP+C  + ESV HAL  C++ + +WD     G    +     F  W  
Subjt:  GSCSNSQ-SDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCFDIWLN

Query:  FASHLNDNELETTC-IVAWSIWGDMNNWIHSKK---VPSMEYRCDWILDYANGVGVEPIEI--------DHFKSIVLPPLQFDFGHVHTLHVDAAFSPTF
         A    + E +    ++ W+IW   N+ +  KK     S+       L+         IE         D  +   +P +     +V     D+      
Subjt:  FASHLNDNELETTC-IVAWSIWGDMNNWIHSKK---VPSMEYRCDWILDYANGVGVEPIEI--------DHFKSIVLPPLQFDFGHVHTLHVDAAFSPTF

Query:  GGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDIKCCLSTFRSV----VSRSSNQA
        G V     G ++        G  S   AE+  + + L  ++R    ++ + +D L+++Q ++    +       I++ K  L   +++    V RS+N  
Subjt:  GGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLEVSNYIEDIKCCLSTFRSV----VSRSSNQA

Query:  AHLLAR
        AH  AR
Subjt:  AHLLAR

A0A803QN60 Uncharacterized protein5.1e-7524.67Show/hide
Query:  MADSFFHDGLLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMK
        MA S     +LE  K      EE+   L +  ++         WCL+GKLL       +V  N     W+   G+ VK L+ N+FL  F  E    RV+ 
Subjt:  MADSFFHDGLLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMK

Query:  QGPWFFYKKILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSG
          PW + +K  I      G  P  ++   +  WV   +L        +   LGN IG F + +       W + LRV++ I++ KP +R +KI+++  S 
Subjt:  QGPWFFYKKILILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSG

Query:  GRWVAMRYERLPELCSFCGIIGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPE-AKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNC
          W   +YE++P  C  CGIIGH  K C   +   L       S  +K A T    S      L   A   E+   + E GV+  P              
Subjt:  GRWVAMRYERLPELCSFCGIIGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPE-AKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNC

Query:  RKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKL
         +N+ +T E          L     GH+         +FKD                   SL N      K    +             N     +   +
Subjt:  RKNLSSTFEDISVGKGDLFLFSHGGGHNIMKFGKDIPLFKDLNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKL

Query:  PFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSY
           +  KR     T     G R                     P      L WN RGL +  A + + D + +  PSL FL ET CN S L  ++  L +
Subjt:  PFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLLRSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSY

Query:  DGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTWNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSR
        + CF V++ G  GGL LLWKD  ++ ++ FS HHID  V       W F G+YG P+  LR  TW+L R L N     W + GD+N + S+ EK+GG   
Subjt:  DGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTWNDKVSW-FIGLYGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSR

Query:  EELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWR
         + L++ F  A+DDC  M L   G  FTW   RN    +  RLD+ L    +LS+FP S + NL ++ SDH P+ L+L G   F      R F+  + W 
Subjt:  EELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQNLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWR

Query:  LHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWGAVVN-------KGLSGSIHKDPWLPRPSSF---RVFTPGTQGKRCVVAELISPAGTW---
            CE++V+    W+GS      I  K+  C   L +WG  +              I    W     S    + +      +   +  L    G W   
Subjt:  LHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWGAVVN-------KGLSGSIHKDPWLPRPSSF---RVFTPGTQGKRCVVAELISPAGTW---

Query:  -------------DIKKLNDLLLPVDVEVIR------------RIPISQSGRSDNWVWHYDK------------------------HGMFTVR----SGY
                     D+   + + +   ++ IR            RI   +  R+  +  H DK                          M  +R       
Subjt:  -------------DIKKLNDLLLPVDVEVIR------------RIPISQSGRSDNWVWHYDK------------------------HGMFTVR----SGY

Query:  KYAILDRLIGKG-------SCSNSQS----------------DWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHAL
         Y I  +++          + S +QS                +   +L +    RK     + +  +CL  KV+L  + +  E  CP+C    E+  H L
Subjt:  KYAILDRLIGKG-------SCSNSQS----------------DWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHAL

Query:  FWCSRARSIWDFFSFYGWQFCSKFMSCFDIW-LNFASHLNDNELETTCIVAWSIWGDMNNWI---HSKKVPSMEYRCDWILDY---ANGVG----VEPIE
          C+ A + W+   F G    S+  S    W ++    L D+      ++ W++W   N+ I    S+ V  +    +  LD    A G G    + P++
Subjt:  FWCSRARSIWDFFSFYGWQFCSKFMSCFDIW-LNFASHLNDNELETTCIVAWSIWGDMNNWI---HSKKVPSMEYRCDWILDY---ANGVG----VEPIE

Query:  IDHFKSIVLPPLQFDFGHVHTLHVDAAFSPT-----FGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKG
              + L P          L+VDAA S       FG V+ +  G ++A  AG + G  S   AE   + + L  ++       +V +DSL   + ++ 
Subjt:  IDHFKSIVLPPLQFDFGHVHTLHVDAAFSPT-----FGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKG

Query:  TTGIYLEVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDG-SSGVSLLRLNSFPDMLSHLYLGD
                 + +ED K    +  +V    V RS+N  AH +AR   S    +  +NS P  L  +   D
Subjt:  TTGIYLEVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDG-SSGVSLLRLNSFPDMLSHLYLGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding2.3e-1125.12Show/hide
Query:  NNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTG
        + E++E  + +           +  C+I K+L    I  SV+    R+ WK    + V  L +  F+  F  E      +  GPW      L++Q+  + 
Subjt:  NNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKILILQEPLTG

Query:  VQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYW--GESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPELCSF
          P   D    P WV   ++    ++R +  ++    G+    + + NT+ +  G   RV I ++++KP +  + IN     G R+  + YE L ++CS 
Subjt:  VQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYW--GESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPELCSF

Query:  CGIIGHLFKDC
        CGI GHL   C
Subjt:  CGIIGHLFKDC

AT3G09510.1 Ribonuclease H-like superfamily protein9.3e-2125.06Show/hide
Query:  LKVCARNLQKWGAVVNKGLSGSIHKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGT---WDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKH
        LK   R+L   G  +  GL   +   P  PRP +    T  T  K   +  L    G+   WD  K++  +   D   I RI +++S + D  +W+Y+  
Subjt:  LKVCARNLQKWGAVVNKGLSGSIHKDPWLPRPSSFRVFTPGTQGKRCVVAELISPAGT---WDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKH

Query:  GMFTVRSGYKYAILDRLIGKGSCS--NSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIW--
        G +TVRSGY     D      + +  +   D    +W L I  K+  F+W++    L     L  R +  +P CP C R+ ES++HALF C  A   W  
Subjt:  GMFTVRSGYKYAILDRLIGKGSCS--NSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIW--

Query:  -DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIV--AWSIWGDMNNWIH-------SKKVPSMEYRC-DWILDYANGVGVEPIEIDHFKSIVL-
         D          + F       LNF      ++      V   W IW   NN +        SK V S +    DW L+        P          + 
Subjt:  -DFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIV--AWSIWGDMNNWIH-------SKKVPSMEYRC-DWILDYANGVGVEPIEIDHFKSIVL-

Query:  ---PPLQFDFGHVHTLHVDAAF-----SPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYL
           PP  +        + DA F       T G +I +  G  ++  +       + L AE  A+   L+     G   + +  D  +LI ++ G +  + 
Subjt:  ---PPLQFDFGHVHTLHVDAAF-----SPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYL

Query:  EVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDG
         ++N++EDI    + F S+    + R  N+ AH+LA+ G
Subjt:  EVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDG

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-0634.85Show/hide
Query:  SDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRAR
        ++W   +W L+I  KI + +WK+ ++ LP    L +R +  EP+C  C R  E+++H LF C  A+
Subjt:  SDWWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRAR

AT4G29090.1 Ribonuclease H-like superfamily protein6.9e-1626.1Show/hide
Query:  AVVNKGLSGSIHKDPWL---PRPSSFRVFTPGTQGKRCV-----VAELISPAGTWDIKKLNDLLLPVDVEVIRRI--PISQSGRS--DNWVWHYDKHGMF
        AVV  G    I +  WL   P  ++ R+     Q    V     V++LI  +G    K + ++L P   EV R++   +   GR   D++ W Y   G +
Subjt:  AVVNKGLSGSIHKDPWL---PRPSSFRVFTPGTQGKRCV-----VAELISPAGTWDIKKLNDLLLPVDVEVIRRI--PISQSGRS--DNWVWHYDKHGMF

Query:  TVRSGYKYAILDRLIGKGSCSNSQSD-----WWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDF
        TV+SG  Y +L ++I K S     S+      ++ +WK +   KI  F+WK   + LP   +L  R L  E  C  C   KE+V+H LF C+ AR  W  
Subjt:  TVRSGYKYAILDRLIGKGSCSNSQSD-----WWRHLWKLRIPRKIAIFMWKSCHDCLPNKVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDF

Query:  FSF---YGWQFC-SKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNN----WIHSKKVPSMEYRC-------DW-ILDYANGVGVEP-IEIDHFKS
         S     G ++  S +++ +  W+    + N    + + +V W +W    N        ++  + E          +W I   A   G +P +       
Subjt:  FSF---YGWQFC-SKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNN----WIHSKKVPSMEYRC-------DW-ILDYANGVGVEP-IEIDHFKS

Query:  IVLPPLQFDFGHVHTLHVDAAFSPT-----FGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYL
           PP Q+        + DA ++        G V+ ++ GE+    A     L SVL AE  A+   +  + R     ++  SDS  LI++L     I+ 
Subjt:  IVLPPLQFDFGHVHTLHVDAAFSPT-----FGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYL

Query:  EVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDGSSGVSLLRLNSFPDMLS
         +   I+D++  LS F  V    + R  N  A  +AR+     SL  LN  P + S
Subjt:  EVSNYIEDIKCCLSTFRSV----VSRSSNQAAHLLARDGSSGVSLLRLNSFPDMLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACAGTTTCTTCCATGATGGTCTCCTTGAGGAATGGAAAAGGTTTAGCCTAAATAACGAAGAAGACGAGAGGTCTCTTGATGTGGACCGAGCGGCCGCTATTCT
GGCCGAAGAGACATTCAATTGGTGTCTTATTGGAAAGCTTCTATGTGAATGCTTTATATTTGATTCTGTGATGTTTAATTTTTTTCGGAAGGCTTGGAAAATTGATAGTG
GTCTTAAGGTAAAACTTCTCAGCAAGAATATTTTCCTCTTCTCTTTTTGCCAGGAGGCTATTCGAACAAGAGTTATGAAGCAAGGGCCTTGGTTTTTTTATAAGAAAATC
CTTATCCTTCAAGAGCCATTGACAGGGGTGCAGCCACCGGAGATGGATTTCAAATGGGTGCCTTTTTGGGTACATTTTTTTCACCTATCCCTTTGTTGTTTCAATCGTTT
GATGGCCGAAAAATTGGGGAATGTTATTGGAATTTTTGATGATTTTGAGGGGGAAGAAAACACAATGTATTGGGGCGAGAGTTTGCGAGTTCAGATCCTTATTGACATTT
CAAAGCCAAACCGTAGGGGAATAAAGATTAACCTTGAAGGCACCTCTGGTGGCCGATGGGTGGCGATGCGTTATGAGCGCCTTCCGGAGTTGTGTTCTTTCTGTGGAATA
ATCGGACATCTTTTCAAGGATTGCAACGATTTTTATAAATCTGATCTCAAACAAGACAACCCACATCAGTCGACACCGGAAAAGCTAGCGGCGACAGTTTCAGGTGGTTC
AGCGATGGAGATTTCGCCGTTACCGGAAGCTAAGGCGGGGGAGGCCGTTTCAAAGAATCTTGAAGGAGGTGTTTCGTTCTTTCCAGATTCTATGGATGTTACGGTGGCTA
TTAATGACAGTAATTGCAGGAAAAATTTATCTTCAACTTTTGAGGATATCTCGGTGGGCAAGGGAGATTTATTTCTCTTTTCTCATGGTGGGGGCCACAATATTATGAAA
TTTGGAAAGGATATCCCCCTTTTTAAGGACCTAAATGAGCCAGTAATGGACCTTGCTTCAATCAATGAGTTTTTGGGGGCCCAGGTGTCTTTATCAAATTTTCGGACCAA
AGAGGGGAAGACTAAGTCAAGTCTGGGCCCCACTTCACAAAAACCTGCTCATCTGAAAGGTAAAAACCCAAAGGCAACTGCATGGAAGAAAAAGCTCCCTTTTTGGATAA
AAGAAAAGAGGGTCCACCTCTTGTTAACACAGCACCAGATAAGAGGCAAAAGATGGTTGAGGTTGAGGCTTCATCATATGCACCGAGTGGTGATTTGGCGGAGACTGCTT
CGCAGCCCCGCCTTGAATTATGAAAACCTTTATTGGAATGTCCGTGGATTGGAGAGTTCATGGGCATTCCGAGAGGTGTCGGACGAGCTCAAACGTCTAAATCCCTCGTT
ATGCTTTCTATCGGAAACAAAGTGTAATGTCTCTATCTTGAATAAATTGAAAAATCGATTAAGTTATGATGGTTGTTTTTGTGTGGATAGTGTGGGTGCAAAGGGTGGTT
TGGGTCTTCTATGGAAGGACGATATTCAAATTGATCTTCGGTCTTTTTCAATTCATCATATTGATCTTAAGGTGACTTGGAACGATAAGGTTTCGTGGTTTATTGGCCTG
TATGGTCATCCTGAAACTCACTTACGCCATCATACGTGGAATCTCTTAAGGAGGTTACATAATTCAGATAGTTCAGCTTGGGTGGTGGGGGGAGATTTGAATGAGATACT
CTCGAATGAGGAAAAGGAGGGGGGTCTGTCAAGAGAGGAGTTGTTGATGGATGATTTTCGTCTAGCAATGGATGATTGTGCTTTCATGGGCCTTGGCTATTTTGGCAATA
CTTTTACATGGTGCAACAAAAGGAATTATCCAAATCAGGTTTGTCTCCGCCTTGATCGTTTTTTAGCAAATGTTGAGTTCCTTTCCATTTTTCCTGACTCCAGGGTCCAA
AATTTGAGATGGGCCAAATCAGATCACAGACCTATTGAACTTTTGCTATCTGGGGTTGAGTCCTTCTATGGGAGACAAGGGAGTAGGCCTTTTCAGTTTAATGATCATTG
GCGCTTACATGCTAATTGTGAAAAAATTGTTAAGGAATCCGTGAATTGGGAAGGATCAGTAACGAATCCTGCTACAATTAGCTTAAAGCTTAAAGTTTGTGCCCGCAACC
TCCAAAAATGGGGTGCTGTAGTTAATAAAGGCCTTTCTGGTAGTATCCACAAAGACCCATGGTTACCTCGACCTTCTTCCTTTAGAGTCTTCACTCCTGGGACCCAAGGG
AAACGGTGTGTGGTGGCTGAGTTAATTTCTCCTGCAGGCACATGGGATATAAAAAAATTGAATGATTTACTTTTGCCGGTGGATGTTGAAGTAATTAGAAGAATTCCAAT
CAGTCAAAGTGGAAGAAGTGACAACTGGGTCTGGCATTATGATAAGCATGGGATGTTTACGGTTCGTAGTGGCTATAAGTATGCTATTCTCGATCGTCTTATTGGTAAAG
GCTCATGCTCTAATTCTCAGTCGGATTGGTGGCGTCACCTTTGGAAGCTTCGAATTCCTAGGAAGATTGCTATTTTTATGTGGAAATCGTGTCATGATTGTCTTCCTAAT
AAGGTCTCCTTACAGAATCGAATGCTGGATACTGAACCGTGGTGCCCTTTATGTAGGAGGAAAAAAGAATCTGTGAGTCACGCTTTATTCTGGTGCTCTAGAGCGAGAAG
CATTTGGGATTTTTTCTCTTTTTATGGTTGGCAATTTTGTTCAAAATTTATGTCCTGCTTTGATATTTGGTTGAATTTTGCTTCTCACCTTAATGACAATGAATTGGAAA
CAACATGTATTGTGGCATGGTCAATTTGGGGGGATATGAATAATTGGATTCATTCTAAGAAAGTCCCTTCTATGGAGTATCGATGTGATTGGATCCTAGACTATGCTAAC
GGTGTTGGCGTCGAGCCAATTGAAATAGATCATTTCAAATCTATCGTTTTGCCGCCTCTACAGTTTGATTTTGGGCATGTGCATACTTTACACGTTGATGCTGCTTTCTC
TCCTACTTTTGGTGGAGTGATTGGTTCGAAGTCTGGAGAAATTTTGGCTACTCTGGCTGGGCATCGTGAAGGGTTGTGTAGTGTGTTGGGAGCTGAGGCCACTGCTGTGT
ATAAGGGTCTCAGGCTGGTAGAGAGATTGGGTCTTCCTAATCTTCTGGTTCTTTCGGACTCTTTATCTCTTATTCAAATGCTTAAAGGGACTACTGGTATCTACTTGGAG
GTTTCTAATTATATTGAAGATATCAAATGTTGTTTATCCACTTTTCGGTCAGTTGTGTCAAGATCTTCCAATCAAGCGGCACACTTGTTGGCTCGTGATGGTTCGTCGGG
TGTGAGTCTTTTACGGCTCAATTCTTTTCCGGATATGTTGTCCCATTTGTATTTAGGGGATGTCCGTTGTTTGAACAACATTGAGACCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACAGTTTCTTCCATGATGGTCTCCTTGAGGAATGGAAAAGGTTTAGCCTAAATAACGAAGAAGACGAGAGGTCTCTTGATGTGGACCGAGCGGCCGCTATTCT
GGCCGAAGAGACATTCAATTGGTGTCTTATTGGAAAGCTTCTATGTGAATGCTTTATATTTGATTCTGTGATGTTTAATTTTTTTCGGAAGGCTTGGAAAATTGATAGTG
GTCTTAAGGTAAAACTTCTCAGCAAGAATATTTTCCTCTTCTCTTTTTGCCAGGAGGCTATTCGAACAAGAGTTATGAAGCAAGGGCCTTGGTTTTTTTATAAGAAAATC
CTTATCCTTCAAGAGCCATTGACAGGGGTGCAGCCACCGGAGATGGATTTCAAATGGGTGCCTTTTTGGGTACATTTTTTTCACCTATCCCTTTGTTGTTTCAATCGTTT
GATGGCCGAAAAATTGGGGAATGTTATTGGAATTTTTGATGATTTTGAGGGGGAAGAAAACACAATGTATTGGGGCGAGAGTTTGCGAGTTCAGATCCTTATTGACATTT
CAAAGCCAAACCGTAGGGGAATAAAGATTAACCTTGAAGGCACCTCTGGTGGCCGATGGGTGGCGATGCGTTATGAGCGCCTTCCGGAGTTGTGTTCTTTCTGTGGAATA
ATCGGACATCTTTTCAAGGATTGCAACGATTTTTATAAATCTGATCTCAAACAAGACAACCCACATCAGTCGACACCGGAAAAGCTAGCGGCGACAGTTTCAGGTGGTTC
AGCGATGGAGATTTCGCCGTTACCGGAAGCTAAGGCGGGGGAGGCCGTTTCAAAGAATCTTGAAGGAGGTGTTTCGTTCTTTCCAGATTCTATGGATGTTACGGTGGCTA
TTAATGACAGTAATTGCAGGAAAAATTTATCTTCAACTTTTGAGGATATCTCGGTGGGCAAGGGAGATTTATTTCTCTTTTCTCATGGTGGGGGCCACAATATTATGAAA
TTTGGAAAGGATATCCCCCTTTTTAAGGACCTAAATGAGCCAGTAATGGACCTTGCTTCAATCAATGAGTTTTTGGGGGCCCAGGTGTCTTTATCAAATTTTCGGACCAA
AGAGGGGAAGACTAAGTCAAGTCTGGGCCCCACTTCACAAAAACCTGCTCATCTGAAAGGTAAAAACCCAAAGGCAACTGCATGGAAGAAAAAGCTCCCTTTTTGGATAA
AAGAAAAGAGGGTCCACCTCTTGTTAACACAGCACCAGATAAGAGGCAAAAGATGGTTGAGGTTGAGGCTTCATCATATGCACCGAGTGGTGATTTGGCGGAGACTGCTT
CGCAGCCCCGCCTTGAATTATGAAAACCTTTATTGGAATGTCCGTGGATTGGAGAGTTCATGGGCATTCCGAGAGGTGTCGGACGAGCTCAAACGTCTAAATCCCTCGTT
ATGCTTTCTATCGGAAACAAAGTGTAATGTCTCTATCTTGAATAAATTGAAAAATCGATTAAGTTATGATGGTTGTTTTTGTGTGGATAGTGTGGGTGCAAAGGGTGGTT
TGGGTCTTCTATGGAAGGACGATATTCAAATTGATCTTCGGTCTTTTTCAATTCATCATATTGATCTTAAGGTGACTTGGAACGATAAGGTTTCGTGGTTTATTGGCCTG
TATGGTCATCCTGAAACTCACTTACGCCATCATACGTGGAATCTCTTAAGGAGGTTACATAATTCAGATAGTTCAGCTTGGGTGGTGGGGGGAGATTTGAATGAGATACT
CTCGAATGAGGAAAAGGAGGGGGGTCTGTCAAGAGAGGAGTTGTTGATGGATGATTTTCGTCTAGCAATGGATGATTGTGCTTTCATGGGCCTTGGCTATTTTGGCAATA
CTTTTACATGGTGCAACAAAAGGAATTATCCAAATCAGGTTTGTCTCCGCCTTGATCGTTTTTTAGCAAATGTTGAGTTCCTTTCCATTTTTCCTGACTCCAGGGTCCAA
AATTTGAGATGGGCCAAATCAGATCACAGACCTATTGAACTTTTGCTATCTGGGGTTGAGTCCTTCTATGGGAGACAAGGGAGTAGGCCTTTTCAGTTTAATGATCATTG
GCGCTTACATGCTAATTGTGAAAAAATTGTTAAGGAATCCGTGAATTGGGAAGGATCAGTAACGAATCCTGCTACAATTAGCTTAAAGCTTAAAGTTTGTGCCCGCAACC
TCCAAAAATGGGGTGCTGTAGTTAATAAAGGCCTTTCTGGTAGTATCCACAAAGACCCATGGTTACCTCGACCTTCTTCCTTTAGAGTCTTCACTCCTGGGACCCAAGGG
AAACGGTGTGTGGTGGCTGAGTTAATTTCTCCTGCAGGCACATGGGATATAAAAAAATTGAATGATTTACTTTTGCCGGTGGATGTTGAAGTAATTAGAAGAATTCCAAT
CAGTCAAAGTGGAAGAAGTGACAACTGGGTCTGGCATTATGATAAGCATGGGATGTTTACGGTTCGTAGTGGCTATAAGTATGCTATTCTCGATCGTCTTATTGGTAAAG
GCTCATGCTCTAATTCTCAGTCGGATTGGTGGCGTCACCTTTGGAAGCTTCGAATTCCTAGGAAGATTGCTATTTTTATGTGGAAATCGTGTCATGATTGTCTTCCTAAT
AAGGTCTCCTTACAGAATCGAATGCTGGATACTGAACCGTGGTGCCCTTTATGTAGGAGGAAAAAAGAATCTGTGAGTCACGCTTTATTCTGGTGCTCTAGAGCGAGAAG
CATTTGGGATTTTTTCTCTTTTTATGGTTGGCAATTTTGTTCAAAATTTATGTCCTGCTTTGATATTTGGTTGAATTTTGCTTCTCACCTTAATGACAATGAATTGGAAA
CAACATGTATTGTGGCATGGTCAATTTGGGGGGATATGAATAATTGGATTCATTCTAAGAAAGTCCCTTCTATGGAGTATCGATGTGATTGGATCCTAGACTATGCTAAC
GGTGTTGGCGTCGAGCCAATTGAAATAGATCATTTCAAATCTATCGTTTTGCCGCCTCTACAGTTTGATTTTGGGCATGTGCATACTTTACACGTTGATGCTGCTTTCTC
TCCTACTTTTGGTGGAGTGATTGGTTCGAAGTCTGGAGAAATTTTGGCTACTCTGGCTGGGCATCGTGAAGGGTTGTGTAGTGTGTTGGGAGCTGAGGCCACTGCTGTGT
ATAAGGGTCTCAGGCTGGTAGAGAGATTGGGTCTTCCTAATCTTCTGGTTCTTTCGGACTCTTTATCTCTTATTCAAATGCTTAAAGGGACTACTGGTATCTACTTGGAG
GTTTCTAATTATATTGAAGATATCAAATGTTGTTTATCCACTTTTCGGTCAGTTGTGTCAAGATCTTCCAATCAAGCGGCACACTTGTTGGCTCGTGATGGTTCGTCGGG
TGTGAGTCTTTTACGGCTCAATTCTTTTCCGGATATGTTGTCCCATTTGTATTTAGGGGATGTCCGTTGTTTGAACAACATTGAGACCATCTAA
Protein sequenceShow/hide protein sequence
MADSFFHDGLLEEWKRFSLNNEEDERSLDVDRAAAILAEETFNWCLIGKLLCECFIFDSVMFNFFRKAWKIDSGLKVKLLSKNIFLFSFCQEAIRTRVMKQGPWFFYKKI
LILQEPLTGVQPPEMDFKWVPFWVHFFHLSLCCFNRLMAEKLGNVIGIFDDFEGEENTMYWGESLRVQILIDISKPNRRGIKINLEGTSGGRWVAMRYERLPELCSFCGI
IGHLFKDCNDFYKSDLKQDNPHQSTPEKLAATVSGGSAMEISPLPEAKAGEAVSKNLEGGVSFFPDSMDVTVAINDSNCRKNLSSTFEDISVGKGDLFLFSHGGGHNIMK
FGKDIPLFKDLNEPVMDLASINEFLGAQVSLSNFRTKEGKTKSSLGPTSQKPAHLKGKNPKATAWKKKLPFWIKEKRVHLLLTQHQIRGKRWLRLRLHHMHRVVIWRRLL
RSPALNYENLYWNVRGLESSWAFREVSDELKRLNPSLCFLSETKCNVSILNKLKNRLSYDGCFCVDSVGAKGGLGLLWKDDIQIDLRSFSIHHIDLKVTWNDKVSWFIGL
YGHPETHLRHHTWNLLRRLHNSDSSAWVVGGDLNEILSNEEKEGGLSREELLMDDFRLAMDDCAFMGLGYFGNTFTWCNKRNYPNQVCLRLDRFLANVEFLSIFPDSRVQ
NLRWAKSDHRPIELLLSGVESFYGRQGSRPFQFNDHWRLHANCEKIVKESVNWEGSVTNPATISLKLKVCARNLQKWGAVVNKGLSGSIHKDPWLPRPSSFRVFTPGTQG
KRCVVAELISPAGTWDIKKLNDLLLPVDVEVIRRIPISQSGRSDNWVWHYDKHGMFTVRSGYKYAILDRLIGKGSCSNSQSDWWRHLWKLRIPRKIAIFMWKSCHDCLPN
KVSLQNRMLDTEPWCPLCRRKKESVSHALFWCSRARSIWDFFSFYGWQFCSKFMSCFDIWLNFASHLNDNELETTCIVAWSIWGDMNNWIHSKKVPSMEYRCDWILDYAN
GVGVEPIEIDHFKSIVLPPLQFDFGHVHTLHVDAAFSPTFGGVIGSKSGEILATLAGHREGLCSVLGAEATAVYKGLRLVERLGLPNLLVLSDSLSLIQMLKGTTGIYLE
VSNYIEDIKCCLSTFRSVVSRSSNQAAHLLARDGSSGVSLLRLNSFPDMLSHLYLGDVRCLNNIETI