; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008053 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008053
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr9:10945633..10949312
RNA-Seq ExpressionLag0008053
SyntenyLag0008053
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96499.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.2e-6145.83Show/hide
Query:  MRLAQERMKKFADKHRREVEFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAATVHPVFHVSQLKKQVGASKAVQTNQP
        ++LAQERMKK AD  RREVEF+EGD+VFLKLRPYRQTSL ++RNEKLSPKYFGPYRV+ERIGKV Y+L+LP AA +HPVFHVSQLKK VG  + VQ   P
Subjt:  MRLAQERMKKFADKHRREVEFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAATVHPVFHVSQLKKQVGASKAVQTNQP

Query:  PLSDDFEWRSEPDDVFACHKNQDSGETEILVSWKHLPEHEATWLLLSNFREQFPEYHLEDKVQLTPDGDVRPPIILTYSRRDKKDNSGLQKAESAFLQGK
         ++ + EW + P++V++  KN  + E E L+SWK LP HEATW   ++ + QFP++HLEDKV L  + D RPPI+ TY R++KK +  +++ + A     
Subjt:  PLSDDFEWRSEPDDVFACHKNQDSGETEILVSWKHLPEHEATWLLLSNFREQFPEYHLEDKVQLTPDGDVRPPIILTYSRRDKKDNSGLQKAESAFLQGK

Query:  LISDNIIIGHECINAIK--NSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELG
        +     II  + ++  K  + +L    M   +L + +    +    + E+  +I    +  ++I+ + EG    I K +  GR +E G
Subjt:  LISDNIIIGHECINAIK--NSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELG

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]9.6e-1352.78Show/hide
Query:  ESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS
        +SAF+  +LI+DN+IIG+EC++ I+ SK     +VA+KLD+SKAYDRVEW +L + M  +GF+ +W+ LIMS
Subjt:  ESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.9e-6126.26Show/hide
Query:  SGLQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS----------------------
        S + + +SAFL  +LI+DNI++  E +NAIKN       + ++KLD+SKA DRVEW ++ E+M K+GF   W+ LIM+                      
Subjt:  SGLQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS----------------------

Query:  ----------------FAEGLSHLIS---------------------------------KANE----------------DGRMSEL--------------
                         +EGLS L+                                  +ANE                 G++  +              
Subjt:  ----------------FAEGLSHLIS---------------------------------KANE----------------DGRMSEL--------------

Query:  ----------------------------------------GKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKV
                                                G  PS  W+ + WG+ L+  GLR++IG G    + +DPW+P   +F P        E  V
Subjt:  ----------------------------------------GKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKV

Query:  ADYISPSGGWDIEKLNTAVINFDINTIRGIPIK-TNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALK
        ++ I+    WD++ LN      D++ I  IP+     +D LI + + +G YTV++GY     ++ +  SSSSS     W  LW LK+P KIK F WR   
Subjt:  ADYISPSGGWDIEKLNTAVINFDINTIRGIPIK-TNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALK

Query:  DSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHG--CFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGE
        +++P    L  + +     C IC+   E+I H LF C  AR +W+  ++N  L+ +     C  D  V + +  ++ ++       W+IWT+ N+ VHG 
Subjt:  DSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHG--CFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGE

Query:  SIPPSQVRSNWIKEYLDSF---WKANANDPRSKFVGSKT---ITSSSRNVCGL---LPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLD
            +++ S++   Y  +F    + + +D   +  GS T   +   S     L    P  G   LN DAA  S ++ +G+GAL+RD NG + AA S  + 
Subjt:  SIPPSQVRSNWIKEYLDSF---WKANANDPRSKFVGSKT---ITSSSRNVCGL---LPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLD

Query:  YYMDPLMAELKAILEGLSLAMGC-LNI-KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDH
           D    E  A+   L+L +   L I ++ TD    SN + K ++  S  + L+ +I SL+  F N+   ++ R  N++AD LAK+A  + ++  W D+
Subjt:  YYMDPLMAELKAILEGLSLAMGC-LNI-KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDH

Query:  FPSWLCNLVFDD
         P  + +++ +D
Subjt:  FPSWLCNLVFDD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]7.7e-8729.4Show/hide
Query:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVE----------------------------
        +  A+SAF+  + ISDN+IIGHEC++ I + K     M A+KLDLSKA+DRVEW YL  IM K+GFN  W++                            
Subjt:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVE----------------------------

Query:  ----------LIMSFAEGLSHLISKANEDGRMS-------------------------------------------------------------------
                  L +  AEGLS LI+  N  GR++                                                                   
Subjt:  ----------LIMSFAEGLSHLISKANEDGRMS-------------------------------------------------------------------

Query:  -----------------------------------------------ELG-----------------------KHP------------------------
                                                       E G                       +HP                        
Subjt:  -----------------------------------------------ELG-----------------------KHP------------------------

Query:  ---SYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKL
           SY WK  LWGR+L+ KGLR R+GNG     F DPW+P+  TFKP+  +    +T VA +I+  G WD+  ++ +  N D + I  +PI + NL D  
Subjt:  ---SYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKL

Query:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR
        + H+DK G Y+V+SGYKLYM +K N  S+S++     WN++WKL +P KIK F WR+  + IP   NL  RGI     C IC  + E+I H  F C RAR
Subjt:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR

Query:  EIWKLTYNNV-FLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSR
        +IW+  +  +  L  + +  F++ W  +      KDL+LA +T W IW D N  +HG+ + P + +  W+  +LDS  +A  ++        +T ++   
Subjt:  EIWKLTYNNV-FLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSR

Query:  NVCGLLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAVWSE
         V    PS    L LNTDAAC      +  G +IRD +  +VAA+S  + + + PL+AE++ ILEGL  A      +++V +D  +A   I  +     +
Subjt:  NVCGLLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAVWSE

Query:  VEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRD-SMCWVDHFPSWLCNLVFDDLLS-FAQVA
         +  V EI +L   F  I F +  R CN +A  LAK+       +  W+ +FP+WL +LV  D  S FA VA
Subjt:  VEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRD-SMCWVDHFPSWLCNLVFDDLLS-FAQVA

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.8e-6431.5Show/hide
Query:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-
        +++G +PSY+W+S+LWGR++IC G R+RIGNG+D ++ K  WIPK FTFKP+       E  V++ I+    WD E +       D + I  IP+   L 
Subjt:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-

Query:  NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDC
         D+LI HF K+G+YTVKSGY+  +KI+   + SSS      WN +W L +P KI+ F WRA K+ +P+  NL  R I     C +C   +EN+ H L DC
Subjt:  NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDC

Query:  PRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFV-HGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTIT
          A+++W+L+   + ++       +     +    S  D+ L  V  W  W   N+++  G+   P  V +   +  ++++ +   +   S     K + 
Subjt:  PRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFV-HGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTIT

Query:  SSSRNVCGLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAV
            N     P +GF  +NTDAA +S +  +GLGA+IRD+NG++ A +     ++     AE +A+  GL +A      ++ + +D Q   + +  +   
Subjt:  SSSRNVCGLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAV

Query:  WSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDDLLSFAQVA
         SE+  +V EI  L   F ++   Y  R CN  A SL K A +  +++ W   +P  +      D+  F  +A
Subjt:  WSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDDLLSFAQVA

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]4.8e-1245.12Show/hide
Query:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLS
        +   +SAF+  +LI+DNII+G+EC++ I++ K     +VA+KLD+SKAYDR+EW +L +IM ++GF+ +W+ LIM     +S
Subjt:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLS

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.3e-6231.77Show/hide
Query:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-
        +++G +PS++W+S+LWG ++I KG+R+RIG+GK   ++KD WIP+  TF+PI       ET VAD I     W +++L    +  DI  I  I + +   
Subjt:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-

Query:  NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDC
         D+++ HFDK G+Y+VKSGY+L +         SS+  +R+W   W L +P K+K F WRALK+ +P   NL  R       C  C  Q+E + H L +C
Subjt:  NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDC

Query:  PRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKA--------NANDPRSKF
          AR+IW L    V   +D +  F     ++ + SS  +  L  V  W IW+  NKF+        +  S ++    DS  KA        N +  + + 
Subjt:  PRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKA--------NANDPRSKF

Query:  VGSKTITSSSRNVCGLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLA--MGCLNIKVVTDCQMASNF
        +  +     S+NV           LN DAA S+  Q  GLGA++RD  GKI+A       +     +AE +AI  GL +A  +   ++ V +DC+     
Subjt:  VGSKTITSSSRNVCGLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLA--MGCLNIKVVTDCQMASNF

Query:  ITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLV
        +       +E+  ++ ++     +F  + F +IPR CN  A +LAK+A     +  WV  FP+ + N++
Subjt:  ITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLV

TrEMBL top hitse value%identityAlignment
A0A5D3BBV5 Ty3/gypsy retrotransposon protein2.1e-6145.83Show/hide
Query:  MRLAQERMKKFADKHRREVEFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAATVHPVFHVSQLKKQVGASKAVQTNQP
        ++LAQERMKK AD  RREVEF+EGD+VFLKLRPYRQTSL ++RNEKLSPKYFGPYRV+ERIGKV Y+L+LP AA +HPVFHVSQLKK VG  + VQ   P
Subjt:  MRLAQERMKKFADKHRREVEFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAATVHPVFHVSQLKKQVGASKAVQTNQP

Query:  PLSDDFEWRSEPDDVFACHKNQDSGETEILVSWKHLPEHEATWLLLSNFREQFPEYHLEDKVQLTPDGDVRPPIILTYSRRDKKDNSGLQKAESAFLQGK
         ++ + EW + P++V++  KN  + E E L+SWK LP HEATW   ++ + QFP++HLEDKV L  + D RPPI+ TY R++KK +  +++ + A     
Subjt:  PLSDDFEWRSEPDDVFACHKNQDSGETEILVSWKHLPEHEATWLLLSNFREQFPEYHLEDKVQLTPDGDVRPPIILTYSRRDKKDNSGLQKAESAFLQGK

Query:  LISDNIIIGHECINAIK--NSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELG
        +     II  + ++  K  + +L    M   +L + +    +    + E+  +I    +  ++I+ + EG    I K +  GR +E G
Subjt:  LISDNIIIGHECINAIK--NSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELG

A0A6J1DX30 uncharacterized protein LOC1110248743.7e-8729.4Show/hide
Query:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVE----------------------------
        +  A+SAF+  + ISDN+IIGHEC++ I + K     M A+KLDLSKA+DRVEW YL  IM K+GFN  W++                            
Subjt:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVE----------------------------

Query:  ----------LIMSFAEGLSHLISKANEDGRMS-------------------------------------------------------------------
                  L +  AEGLS LI+  N  GR++                                                                   
Subjt:  ----------LIMSFAEGLSHLISKANEDGRMS-------------------------------------------------------------------

Query:  -----------------------------------------------ELG-----------------------KHP------------------------
                                                       E G                       +HP                        
Subjt:  -----------------------------------------------ELG-----------------------KHP------------------------

Query:  ---SYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKL
           SY WK  LWGR+L+ KGLR R+GNG     F DPW+P+  TFKP+  +    +T VA +I+  G WD+  ++ +  N D + I  +PI + NL D  
Subjt:  ---SYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKL

Query:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR
        + H+DK G Y+V+SGYKLYM +K N  S+S++     WN++WKL +P KIK F WR+  + IP   NL  RGI     C IC  + E+I H  F C RAR
Subjt:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR

Query:  EIWKLTYNNV-FLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSR
        +IW+  +  +  L  + +  F++ W  +      KDL+LA +T W IW D N  +HG+ + P + +  W+  +LDS  +A  ++        +T ++   
Subjt:  EIWKLTYNNV-FLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSR

Query:  NVCGLLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAVWSE
         V    PS    L LNTDAAC      +  G +IRD +  +VAA+S  + + + PL+AE++ ILEGL  A      +++V +D  +A   I  +     +
Subjt:  NVCGLLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC--LNIKVVTDCQMASNFITKKAAVWSE

Query:  VEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRD-SMCWVDHFPSWLCNLVFDDLLS-FAQVA
         +  V EI +L   F  I F +  R CN +A  LAK+       +  W+ +FP+WL +LV  D  S FA VA
Subjt:  VEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRD-SMCWVDHFPSWLCNLVFDDLLS-FAQVA

A0A803PM52 Uncharacterized protein2.8e-6627.42Show/hide
Query:  SGLQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS----------------------
        S + + +SAFL  +LI+DNI++  E +NAIKN       + ++KLD+SKA DRVEW ++ E+M K+GF   W+ LIM+                      
Subjt:  SGLQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS----------------------

Query:  ----------------FAEGLSHLISKANEDGRM--------------------------------------------------------------SELG
                         +EGLS L+      G +                                                              +  G
Subjt:  ----------------FAEGLSHLISKANEDGRM--------------------------------------------------------------SELG

Query:  KHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIK-TNLNDKL
          PS  W+ + WG+ L+  GLR++IG G    + +DPW+P   +F P        E  V++ I+    WD++ LN      D++ I  IP+     +D L
Subjt:  KHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIK-TNLNDKL

Query:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR
        I + + +G YTV++GY     ++ +  SSSSS     W  LW LK+P KIK F WR   +++P    L  + +     C IC+   E+I H LF C  AR
Subjt:  ICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAR

Query:  EIWKLTYNNVFLEEDFHG--CFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSF---WKANANDPRSKFVGSKT--
         +W+  ++N  L+ +     C  D  V + +  ++ ++       W+IWT+ N+ VHG     +++ S++   Y  +F    + + +D   +  GS T  
Subjt:  EIWKLTYNNVFLEEDFHG--CFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSF---WKANANDPRSKFVGSKT--

Query:  -ITSSSRNVCGL---LPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC-LNI-KVVTDCQMASNFI
         +   S     L    P  G   LN DAA  S ++ +G+GAL+RD NG + AA S  +    D    E  A+   L+L +   L I ++ TD    SN +
Subjt:  -ITSSSRNVCGL---LPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC-LNI-KVVTDCQMASNFI

Query:  TKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD
         K ++  S  + L+ +I SL+  F N+   ++ R  N++AD LAK+A  + ++  W D+ P  + +++ +D
Subjt:  TKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD

A0A803QH76 Uncharacterized protein9.8e-6429.54Show/hide
Query:  NSKLNWGR---MVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKA----NEDGRMSELGKHPSYLWKSLLWGRELICKGLRYR
        NSK++W +   M + K D    +    +I+  + ++      +    I+ F + L   + KA    N D  M+  G  PS  W+S+  G+EL+ KGLR++
Subjt:  NSKLNWGR---MVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKA----NEDGRMSELGKHPSYLWKSLLWGRELICKGLRYR

Query:  IGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKLICHFDKTGKYTVKSGYKLYMKIKI
        IGNG+     +DPW+P   TF P     D   T V  YI+ +  WDI+ L     + D+  I  IP+     +DK++ H   +G YTV+SGY L + +  
Subjt:  IGNGKDTYMFKDPWIPKEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKT-NLNDKLICHFDKTGKYTVKSGYKLYMKIKI

Query:  NGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRW
            SSSS   + WN LW L++P K+K F WR + D++P  VNL HR I  S  C +C    E++ H LF C RA+ +W   + NVF+    +    D +
Subjt:  NGVSSSSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRW

Query:  VKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSS-------SRNVCGLLPSDGFWLLNTDA
          + A  +  DL + T   W IW++ NK +HG    P+ +  ++   YL  +  A      +  + S +  +S               P +G + LN DA
Subjt:  VKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSS-------SRNVCGLLPSDGFWLLNTDA

Query:  ACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC-LNIKVV-TDCQMASNFITKKAAV--WSEVEALVEEIWSLMGQFHN
        AC+      G GA++RD +G ++A  S        P   E+ A+   L  A+   L I  + TD  +  N + K ++    +    LVE++  L+  F  
Subjt:  ACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGC-LNIKVV-TDCQMASNFITKKAAV--WSEVEALVEEIWSLMGQFHN

Query:  IDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD
        +   ++ R  N +A +LA +A  L +   W+   PS + +++  D
Subjt:  IDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD

A0A803QH76 Uncharacterized protein3.3e-1144.74Show/hide
Query:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS
        + + +SAFL  +LI+DN+++  E ++ +KN K       A+KLD+SKA+DRVEW +L E+M+K+GF+  WV L+M+
Subjt:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMS

A0A803QH76 Uncharacterized protein7.1e-6227.81Show/hide
Query:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIM--------SF---------------
        + + +SAFL  +LI+DNI+I  E I+ +++ +       A+KLD+SKA+DRVEW YL  IM K+GF+  W+ LIM        SF               
Subjt:  LQKAESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIM--------SF---------------

Query:  ---------------AEGLSHLISKANEDGRMSEL--------GKHPSYLWKSLLW----------------------GRELICKGLRYRIGNGKDTYMF
                       +EGLS  +    + G +  L          H  +   SLL+                      G+EL+ KGLR++IG+G      
Subjt:  ---------------AEGLSHLISKANEDGRMSEL--------GKHPSYLWKSLLW----------------------GRELICKGLRYRIGNGKDTYMF

Query:  KDPWIPKEFTFKPIC-IDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSP
        KDPWIP    FKP+  +  D     V+ +I+ +  W++  L +     DI+ I  IP+      D+L+ H    G Y+VK+G+ L   ++    SS+S+ 
Subjt:  KDPWIPKEFTFKPIC-IDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNL-NDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSP

Query:  LNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQ
         +  W   W LK+P KI+ F W+  ++ +P  V L  R +  S +C +C S  E+I H LF C  A++IWK++   +   +  +    D    +     Q
Subjt:  LNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQ

Query:  KDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDP----RSKFVGSKTITSSSRNVCGLLPS-DGFWLLNTDAACSSLRQDSGL
         D        W IWTD NK VHG           +  ++ + F KA    P     S+   S + +++ ++V    P     + LN DAA +  ++  G+
Subjt:  KDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDP----RSKFVGSKTITSSSRNVCGLLPS-DGFWLLNTDAACSSLRQDSGL

Query:  GALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGCL--NIKVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVS
        GA++RD  G ++AA S  +         E KA+   ++           + TD    SN + +  +  S    L+ +I  L+  F  +   ++ R  N +
Subjt:  GALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGCL--NIKVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVS

Query:  ADSLAKYACDLRDSMCWVDHFP
        A  LAKYA  L +   W+   P
Subjt:  ADSLAKYACDLRDSMCWVDHFP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.0e-0435.71Show/hide
Query:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK
        +MKK+ D   +E+ EF+ GD+V +K    R  +    ++ KL+P + GP+ V+++ G   Y LDLP +        FHVS L+K
Subjt:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK

P0CT35 Transposon Tf2-2 polyprotein5.0e-0435.71Show/hide
Query:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK
        +MKK+ D   +E+ EF+ GD+V +K    R  +    ++ KL+P + GP+ V+++ G   Y LDLP +        FHVS L+K
Subjt:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK

P0CT36 Transposon Tf2-3 polyprotein5.0e-0435.71Show/hide
Query:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK
        +MKK+ D   +E+ EF+ GD+V +K    R  +    ++ KL+P + GP+ V+++ G   Y LDLP +        FHVS L+K
Subjt:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK

P0CT41 Transposon Tf2-12 polyprotein5.0e-0435.71Show/hide
Query:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK
        +MKK+ D   +E+ EF+ GD+V +K    R  +    ++ KL+P + GP+ V+++ G   Y LDLP +        FHVS L+K
Subjt:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK

Q9UR07 Transposon Tf2-11 polyprotein5.0e-0435.71Show/hide
Query:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK
        +MKK+ D   +E+ EF+ GD+V +K    R  +    ++ KL+P + GP+ V+++ G   Y LDLP +        FHVS L+K
Subjt:  RMKKFADKHRREV-EFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAA--TVHPVFHVSQLKK

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.7e-1523.24Show/hide
Query:  LWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWK---LTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLS
        +WKL +  KIKHF WR +  ++  N  L  R I+    C  C  + E I H +F+CP  + +W+   +   N +         ++R +++    +   L 
Subjt:  LWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWK---LTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLS

Query:  --LATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSK---FVGSKTITSSSRNVCGLLPSDGFWL-LNTDAACSSLRQDSGLGAL
          L     W +W   N F+  +        +   K   D+    NAN+        V +  I +S R+     P    W+  N D+  +     +  G  
Subjt:  --LATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANANDPRSK---FVGSKTITSSSRNVCGLLPSDGFWL-LNTDAACSSLRQDSGLGAL

Query:  IRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSL--AMGCLNIKVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADS
        IR+ NG IV   +  L      L AE    L  L +  A G   +   +D +     I       S +  L+ +I   M +       ++ R  N +AD+
Subjt:  IRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSL--AMGCLNIKVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADS

Query:  LAKYACDLRDSMCWVDHFPSWLCNLVF
        LA +              PSWL N ++
Subjt:  LAKYACDLRDSMCWVDHFPSWLCNLVF

AT3G09510.1 Ribonuclease H-like superfamily protein7.0e-3828.3Show/hide
Query:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGK------DTYMFKDPWIP--KEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRG
        +++ K  SY W SLL G  L+ KG R+ IG+G+      D  +   P  P   E T+K + I+ ++FE K + Y      WD  K++  V   D   I  
Subjt:  SELGKHPSYLWKSLLWGRELICKGLRYRIGNGK------DTYMFKDPWIP--KEFTFKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRG

Query:  IPI-KTNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRV--WNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQL
        I + K+   DK+I +++ TG+YTV+SGY L        + + + P   +     +W L I  K+KHF WRAL  ++     L  RG+ +   CP C+ + 
Subjt:  IPI-KTNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRV--WNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQL

Query:  ENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRWVKI-----DACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANA
        E+I+H LF CP A   W+L+ +++   +     F +    I     D   S     L     W IW   N  V       ++ R +  K  L +  KA  
Subjt:  ENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRWVKI-----DACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVRSNWIKEYLDSFWKANA

Query:  NDPRSKFVGSKTITSSSRNVC-----GLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGL--SLAMGCLNI
        +D  +     K   S +R +         P   +   N DA     + ++  G +IR+  G  ++  S  L +  +PL AE KA+L  L  +   G   +
Subjt:  NDPRSKFVGSKTITSSSRNVC-----GLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGL--SLAMGCLNI

Query:  KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWL
         +  DCQ   N I    +  S +   +E+I     +F +I F +I R  N  A  LAKY C            P WL
Subjt:  KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWL

AT3G25270.1 Ribonuclease H-like superfamily protein8.4e-1521.13Show/hide
Query:  LWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWK--------LTYNNVFLEEDFHGCFIDRWVKIDACSS
        +WKLK   KIKHF W+ L  ++    NL  R I    QC  C  + E   H  FDC  A+++W+        L    + +E           + + +C +
Subjt:  LWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWK--------LTYNNVFLEEDFHGCFIDRWVKIDACSS

Query:  QKD---LSLATVTYWTIWTDINKFVHGESI----PPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSRNVCGLLPSDGFWLLNTDAACSSLRQD
         +     +LA    W +W   N+ V  +         Q   N ++E+ D+           +   S+    +        P   +   N D A +   ++
Subjt:  QKD---LSLATVTYWTIWTDINKFVHGESI----PPSQVRSNWIKEYLDSFWKANANDPRSKFVGSKTITSSSRNVCGLLPSDGFWLLNTDAACSSLRQD

Query:  SGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLA--MGCLNIKVVTDCQMASNFITKKAAVWSEVEALVE-EIWSLMGQFHNIDFHYIPRL
        +  G L+RD+NG  + +         D L +E +A++  +  A   G   +    D +     +  +   +     + E   W    +F    F ++PR 
Subjt:  SGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLA--MGCLNIKVVTDCQMASNFITKKAAVWSEVEALVE-EIWSLMGQFHNIDFHYIPRL

Query:  CNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD
         N  AD LAK+      S  +  + P+++ + ++ D
Subjt:  CNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDD

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.4e-0834.69Show/hide
Query:  AESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELGKHP
        A+++F+ G++ +DNI+   E +++++  K   G M+ +KLDL KAYDR+ W YL + +I  GF   W+  I     G   +   A E GR ++  K P
Subjt:  AESAFLQGKLISDNIIIGHECINAIKNSKLNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELGKHP

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-3225.53Show/hide
Query:  LGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFE--------TKVADYISPSG-GWD---IEKLNTAVINFDINTI
        LG  PS++WKS+   +E++ +G R  +GNG+D  +++  W+  +     + + R   +         KV+D I  SG  W    IE L   V    I  +
Subjt:  LGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFTFKPICIDRDMFE--------TKVADYISPSG-GWD---IEKLNTAVINFDINTI

Query:  RGIPIKTNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSS----SSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICN
        R  P    + D     +  +G YTVKSGY +  +I IN  SS    S   LN ++  +WK +   KI+HF W+ L +S+P    L +R ++    C  C 
Subjt:  RGIPIKTNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSS----SSSPLNRVWNNLWKLKIPAKIKHFCWRALKDSIPNNVNLNHRGINVSVQCPICN

Query:  SQLENIDHCLFDCPRAREIWKLTYNNVFLEEDF-HGCFIDRWVKIDACSS----QKDLSLATVTYWTIWTDINKFV-HGESIPPSQVRSNWIKEYLDSFW
        S  E ++H LF C  AR  W ++   + L  ++    +++ +   +  +     +K   L     W +W + N+ V  G      +V     ++ L+  W
Subjt:  SQLENIDHCLFDCPRAREIWKLTYNNVFLEEDF-HGCFIDRWVKIDACSS----QKDLSLATVTYWTIWTDINKFV-HGESIPPSQVRSNWIKEYLDSFW

Query:  KANANDPRSKFVGSKTITSSSRNVCG-LLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEG-LSLAMGCLNI
        +      R++     T    +R+ CG   P    W+  NTDA  +   +  G+G ++R++ G++    +  L      L AEL+A+    LSL+    N 
Subjt:  KANANDPRSKFVGSKTITSSSRNVCG-LLPSDGFWL-LNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEG-LSLAMGCLNI

Query:  KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFP
         +          I     +W  ++  ++++  L+ QF  + F +IPR  N  A+ +A      R+S+ ++++ P
Subjt:  KVVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCTGGCTCAAGAGCGTATGAAAAAGTTCGCGGACAAGCATCGAAGGGAGGTAGAGTTTGAAGAAGGTGACATGGTGTTCTTAAAATTACGCCCGTACCGTCAGAC
TTCATTGGCACGACGGAGGAATGAAAAGCTCTCTCCAAAGTACTTTGGTCCTTATCGAGTGGTTGAGAGAATCGGCAAAGTAGTCTACAGATTGGATCTCCCTCAGGCGG
CGACAGTGCATCCAGTGTTTCACGTGTCACAGCTGAAGAAACAAGTAGGTGCATCAAAGGCGGTACAGACTAACCAACCGCCATTGTCTGATGACTTTGAATGGAGGTCA
GAACCAGATGATGTGTTTGCTTGCCATAAGAATCAGGATAGCGGGGAAACAGAAATCCTAGTCAGTTGGAAGCATTTACCCGAGCATGAAGCGACATGGTTACTACTGTC
GAATTTTCGAGAGCAATTTCCTGAATATCACCTTGAGGACAAGGTGCAACTGACCCCGGATGGTGATGTTAGGCCTCCAATTATCTTAACTTATTCTAGAAGGGACAAAA
AGGATAATTCGGGCCTGCAAAAAGCCGAGTCTGCTTTTCTCCAAGGGAAGCTTATATCAGACAACATCATCATTGGTCATGAGTGTATTAATGCCATAAAAAATAGCAAA
CTCAATTGGGGTAGGATGGTTGCTATCAAGCTTGATCTTAGTAAAGCATATGACCGGGTGGAGTGGATTTACCTAAGAGAAATTATGATTAAGATTGGGTTTAATATCAG
ATGGGTTGAACTAATAATGAGCTTTGCTGAAGGATTATCACATTTGATCTCAAAAGCTAATGAAGATGGTAGGATGTCTGAGTTAGGGAAGCACCCATCCTACCTTTGGA
AAAGCCTTTTATGGGGGAGAGAGTTGATTTGTAAAGGGCTTAGGTATAGGATTGGCAATGGGAAAGATACCTACATGTTTAAGGATCCATGGATTCCCAAAGAATTTACC
TTCAAACCTATTTGTATAGATAGAGATATGTTTGAGACTAAGGTAGCAGATTATATCTCCCCTTCGGGAGGTTGGGATATAGAGAAACTCAATACGGCAGTTATAAATTT
TGACATTAATACTATCAGAGGTATTCCTATAAAAACAAATTTAAATGACAAATTAATATGCCATTTTGATAAAACCGGGAAGTACACGGTCAAGAGTGGATATAAGCTCT
ATATGAAAATTAAAATTAATGGAGTTTCATCAAGTTCTTCCCCTTTAAATCGTGTATGGAATAACCTTTGGAAACTCAAAATTCCAGCCAAGATTAAACATTTTTGCTGG
AGAGCTCTTAAGGATTCGATACCTAACAATGTTAATCTAAACCATAGAGGTATTAATGTTTCTGTGCAGTGCCCAATTTGCAATTCCCAATTAGAAAATATTGATCATTG
TTTGTTTGACTGTCCAAGAGCAAGGGAGATATGGAAACTAACTTACAATAATGTTTTTTTGGAGGAGGACTTCCATGGATGCTTTATTGATCGATGGGTGAAAATTGATG
CCTGCTCGTCCCAGAAAGATCTAAGTCTCGCGACGGTTACCTACTGGACTATTTGGACAGACATAAACAAATTTGTTCATGGCGAGTCAATCCCCCCTTCCCAAGTCAGA
AGCAATTGGATAAAAGAGTACCTTGATTCTTTCTGGAAAGCAAATGCCAATGATCCTCGGTCAAAATTCGTTGGTAGCAAGACGATCACTTCCTCCTCTCGAAATGTATG
TGGGCTTCTTCCTTCGGATGGCTTTTGGTTGCTGAATACAGATGCTGCTTGTTCTTCCTTGAGGCAAGATTCAGGTCTGGGAGCTTTGATCAGAGACAAAAATGGAAAAA
TCGTGGCTGCTTCTTCAAATTTTTTGGATTATTACATGGATCCTCTTATGGCTGAGCTTAAAGCTATTCTTGAAGGATTGTCCCTTGCTATGGGATGCTTGAATATCAAA
GTAGTTACAGACTGCCAGATGGCATCTAATTTCATTACCAAAAAGGCGGCAGTGTGGTCTGAGGTGGAGGCCTTAGTGGAGGAGATTTGGAGTTTAATGGGGCAGTTTCA
TAACATAGATTTTCATTATATTCCAAGGCTCTGTAACGTGAGTGCTGATTCTTTGGCTAAGTATGCATGTGATTTGAGAGATTCTATGTGCTGGGTAGATCATTTTCCCA
GTTGGTTATGTAATTTGGTCTTTGATGACCTTCTTTCTTTTGCCCAAGTGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCTGGCTCAAGAGCGTATGAAAAAGTTCGCGGACAAGCATCGAAGGGAGGTAGAGTTTGAAGAAGGTGACATGGTGTTCTTAAAATTACGCCCGTACCGTCAGAC
TTCATTGGCACGACGGAGGAATGAAAAGCTCTCTCCAAAGTACTTTGGTCCTTATCGAGTGGTTGAGAGAATCGGCAAAGTAGTCTACAGATTGGATCTCCCTCAGGCGG
CGACAGTGCATCCAGTGTTTCACGTGTCACAGCTGAAGAAACAAGTAGGTGCATCAAAGGCGGTACAGACTAACCAACCGCCATTGTCTGATGACTTTGAATGGAGGTCA
GAACCAGATGATGTGTTTGCTTGCCATAAGAATCAGGATAGCGGGGAAACAGAAATCCTAGTCAGTTGGAAGCATTTACCCGAGCATGAAGCGACATGGTTACTACTGTC
GAATTTTCGAGAGCAATTTCCTGAATATCACCTTGAGGACAAGGTGCAACTGACCCCGGATGGTGATGTTAGGCCTCCAATTATCTTAACTTATTCTAGAAGGGACAAAA
AGGATAATTCGGGCCTGCAAAAAGCCGAGTCTGCTTTTCTCCAAGGGAAGCTTATATCAGACAACATCATCATTGGTCATGAGTGTATTAATGCCATAAAAAATAGCAAA
CTCAATTGGGGTAGGATGGTTGCTATCAAGCTTGATCTTAGTAAAGCATATGACCGGGTGGAGTGGATTTACCTAAGAGAAATTATGATTAAGATTGGGTTTAATATCAG
ATGGGTTGAACTAATAATGAGCTTTGCTGAAGGATTATCACATTTGATCTCAAAAGCTAATGAAGATGGTAGGATGTCTGAGTTAGGGAAGCACCCATCCTACCTTTGGA
AAAGCCTTTTATGGGGGAGAGAGTTGATTTGTAAAGGGCTTAGGTATAGGATTGGCAATGGGAAAGATACCTACATGTTTAAGGATCCATGGATTCCCAAAGAATTTACC
TTCAAACCTATTTGTATAGATAGAGATATGTTTGAGACTAAGGTAGCAGATTATATCTCCCCTTCGGGAGGTTGGGATATAGAGAAACTCAATACGGCAGTTATAAATTT
TGACATTAATACTATCAGAGGTATTCCTATAAAAACAAATTTAAATGACAAATTAATATGCCATTTTGATAAAACCGGGAAGTACACGGTCAAGAGTGGATATAAGCTCT
ATATGAAAATTAAAATTAATGGAGTTTCATCAAGTTCTTCCCCTTTAAATCGTGTATGGAATAACCTTTGGAAACTCAAAATTCCAGCCAAGATTAAACATTTTTGCTGG
AGAGCTCTTAAGGATTCGATACCTAACAATGTTAATCTAAACCATAGAGGTATTAATGTTTCTGTGCAGTGCCCAATTTGCAATTCCCAATTAGAAAATATTGATCATTG
TTTGTTTGACTGTCCAAGAGCAAGGGAGATATGGAAACTAACTTACAATAATGTTTTTTTGGAGGAGGACTTCCATGGATGCTTTATTGATCGATGGGTGAAAATTGATG
CCTGCTCGTCCCAGAAAGATCTAAGTCTCGCGACGGTTACCTACTGGACTATTTGGACAGACATAAACAAATTTGTTCATGGCGAGTCAATCCCCCCTTCCCAAGTCAGA
AGCAATTGGATAAAAGAGTACCTTGATTCTTTCTGGAAAGCAAATGCCAATGATCCTCGGTCAAAATTCGTTGGTAGCAAGACGATCACTTCCTCCTCTCGAAATGTATG
TGGGCTTCTTCCTTCGGATGGCTTTTGGTTGCTGAATACAGATGCTGCTTGTTCTTCCTTGAGGCAAGATTCAGGTCTGGGAGCTTTGATCAGAGACAAAAATGGAAAAA
TCGTGGCTGCTTCTTCAAATTTTTTGGATTATTACATGGATCCTCTTATGGCTGAGCTTAAAGCTATTCTTGAAGGATTGTCCCTTGCTATGGGATGCTTGAATATCAAA
GTAGTTACAGACTGCCAGATGGCATCTAATTTCATTACCAAAAAGGCGGCAGTGTGGTCTGAGGTGGAGGCCTTAGTGGAGGAGATTTGGAGTTTAATGGGGCAGTTTCA
TAACATAGATTTTCATTATATTCCAAGGCTCTGTAACGTGAGTGCTGATTCTTTGGCTAAGTATGCATGTGATTTGAGAGATTCTATGTGCTGGGTAGATCATTTTCCCA
GTTGGTTATGTAATTTGGTCTTTGATGACCTTCTTTCTTTTGCCCAAGTGGCGTAA
Protein sequenceShow/hide protein sequence
MRLAQERMKKFADKHRREVEFEEGDMVFLKLRPYRQTSLARRRNEKLSPKYFGPYRVVERIGKVVYRLDLPQAATVHPVFHVSQLKKQVGASKAVQTNQPPLSDDFEWRS
EPDDVFACHKNQDSGETEILVSWKHLPEHEATWLLLSNFREQFPEYHLEDKVQLTPDGDVRPPIILTYSRRDKKDNSGLQKAESAFLQGKLISDNIIIGHECINAIKNSK
LNWGRMVAIKLDLSKAYDRVEWIYLREIMIKIGFNIRWVELIMSFAEGLSHLISKANEDGRMSELGKHPSYLWKSLLWGRELICKGLRYRIGNGKDTYMFKDPWIPKEFT
FKPICIDRDMFETKVADYISPSGGWDIEKLNTAVINFDINTIRGIPIKTNLNDKLICHFDKTGKYTVKSGYKLYMKIKINGVSSSSSPLNRVWNNLWKLKIPAKIKHFCW
RALKDSIPNNVNLNHRGINVSVQCPICNSQLENIDHCLFDCPRAREIWKLTYNNVFLEEDFHGCFIDRWVKIDACSSQKDLSLATVTYWTIWTDINKFVHGESIPPSQVR
SNWIKEYLDSFWKANANDPRSKFVGSKTITSSSRNVCGLLPSDGFWLLNTDAACSSLRQDSGLGALIRDKNGKIVAASSNFLDYYMDPLMAELKAILEGLSLAMGCLNIK
VVTDCQMASNFITKKAAVWSEVEALVEEIWSLMGQFHNIDFHYIPRLCNVSADSLAKYACDLRDSMCWVDHFPSWLCNLVFDDLLSFAQVA