; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021074 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021074
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00153640:283810..285998
RNA-Seq ExpressionSgr021074
SyntenySgr021074
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057703.1 putative mitochondrial protein [Cucumis melo var. makuwa]7.6e-12444.48Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENT--VDLFSDVVLPVSLSNPLLDDFSLHQPSISSDFPLSSD
        L +QRSKFD +A PC+FIGYPP MK Y+LYDI +  IF+SRD  F E  F   SI  ++++   + F ++VLP+ + N  +   ++ +  I +    SS 
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENT--VDLFSDVVLPVSLSNPLLDDFSLHQPSISSDFPLSSD

Query:  AQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANA--PLSSTINYPLHKYLSYDKFSPSHQHFL
          +   DA+ +  + +P+        V   +             LM RKSTR   PP+ L +YHC+LL +   P  +T  +PL+K LSY+K +P+H+ FL
Subjt:  AQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANA--PLSSTINYPLHKYLSYDKFSPSHQHFL

Query:  LNVSTVFEPQFFHQASKA---------DFSLFTRGSSPSFTTL---------------LVYADD--------IILTGPCQKE-IDSV-------------
         NVST +E  FFHQA K+         + +   R ++ S   L                 +AD         ++  G  Q+E ID +             
Subjt:  LNVSTVFEPQFFHQASKA---------DFSLFTRGSSPSFTTL---------------LVYADD--------IILTGPCQKE-IDSV-------------

Query:  ---------KAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQ
                 K       LLKDLG  KYFLGLELS S  GIYLSQRKYCLQ+LED+GFLA+ PT  PM P L LS+TSG+ L  +D++ Y R+IGRLLYLQ
Subjt:  ---------KAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQ

Query:  ISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEV
        ISRPD+++ VH LSQFL++P + H+ AVHHLL YLKGT+ QGI+L  S  F +K FVD DW  CLDT+ S+TGFC+FL  S+++WK+KKQ+T SRSS E 
Subjt:  ISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEV

Query:  EYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMS
        +YR L SV+SE+ W+  LL+DL I    P L+YCDN+A I IASNP FHERTKH  ID HF+ DK+  G  KL+ I+S  QLA MFTK + +S ++  M 
Subjt:  EYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMS

Query:  KMGVFNL
        KM + NL
Subjt:  KMGVFNL

XP_022899321.1 uncharacterized protein LOC111412620 [Olea europaea var. sylvestris]5.5e-11462.46Show/hide
Query:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR
        SK+D+SLFT+G+  +F  LLVY DDI++T   Q EI+ +K  L SHF LKDLG +KYFL LE++ S++GI+LSQR+Y LQLLEDTGFLAS P ALPMDP+
Subjt:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR

Query:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS
        L LSS  GD   I D+++Y ++ GRLLYL ISRPDITF VH LSQF+S+P   H+ A HHLL Y+K + GQGIL     S  L+AF DADWGSCLDTR S
Subjt:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS

Query:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGF
        V GFCVFLGDS+I+WK+KKQ+TVSRSS E EYRALAS  SEL W+ QLL D   S   PT+++CDNQ A+++ASNP+FHERTKH  IDCHF+RDKV+DG 
Subjt:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGF

Query:  LKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        +KL+P+RS  QLAD+F KA+   +L SL+SKM V N   PS
Subjt:  LKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

XP_031744313.1 uncharacterized protein LOC116404977 [Cucumis sativus]2.1e-12162.43Show/hide
Query:  SHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCL
        S Q FL   + +    F    SKAD+SLFT+G+  +F  LLVY DDI+LTGP    I+SVK  L +HF LKDLG  +YFLGLELS S++G+ LSQRKYCL
Subjt:  SHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCL

Query:  QLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSI
        Q+LEDTGFL S P   PMDP L L  + G+ L  +D+T Y R+IGRL+YLQISRPDI F VH LSQFL KP   H+ A HHLL YLKG+SGQG+L++   
Subjt:  QLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSI

Query:  SFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFH
        SF LKAFV ADWGSCLDTR SVTGFC+FLGDS+I+WKSKKQ TVSRSS E EYRALASVTSEL+W+ QLL D  +   +PT ++CDNQAAI I SNP FH
Subjt:  SFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFH

Query:  ERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        ERTKH  IDCHFVRDK+ +GFLK++PI +  QLADMFTKA+ +S LN  +SK+G+ ++  P+
Subjt:  ERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

XP_031745923.1 uncharacterized protein LOC116406346 [Cucumis sativus]5.5e-12262.71Show/hide
Query:  SHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCL
        S Q FL   +T+    F    SKAD+SLFT+G+  +F  LLVY DDI+LTGP    I+SVK  L +HF LKDLG  +YFLGLELS S++G+ LSQRKYCL
Subjt:  SHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCL

Query:  QLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSI
        Q+LEDTGFL S     PMDP L L  + G+ L+ +D+T Y R+IGRL+YLQISRPDI F VH LSQFL KP   H+ A HHLL YLKG+ GQG+L++   
Subjt:  QLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSI

Query:  SFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFH
        SF LKAFVDADWGSCLDTR SVTGFC+FLGDS+I+WKSKKQ TVSRSS E EYRAL SVTSEL+W+ QLL D  I   +PT ++CDNQAAI IASNP FH
Subjt:  SFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFH

Query:  ERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        ERTKH  IDCHFVRDK+ +GFLK++PI +  QLADMFTKA+ +S LN  +SK+G+ ++  P+
Subjt:  ERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

XP_038896371.1 uncharacterized mitochondrial protein AtMg00810-like [Benincasa hispida]5.7e-11960.38Show/hide
Query:  KFSPSHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQR
        +   + + + L  +       FHQ SK+D+SLFTRG    F  LLVY DDI+LTGP  + I SVK +L  HF LKDLG  KYFLGLELS SQQGI +SQR
Subjt:  KFSPSHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQR

Query:  KYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILL
        KYCLQ+LEDTGFL + P   PMDP L LS   G+ L  DD T Y R+I RL+YLQIS+PDI F +H LSQF+  P    ++A HHLL YLK + GQGIL+
Subjt:  KYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILL

Query:  RRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASN
            SF LKAFVD DWGSC DTR S+TGFC+FLG  +I+WKSKKQ+TVSRSS E EYRALASVTSEL+W++QLLKDL + + +PT ++CDNQA I IASN
Subjt:  RRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASN

Query:  PVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        P+FHERTKH  ID HFV+DK++ GFLK++PI+S  QLADMFTKA+ TSVL+ L+SK+G+ ++  P+
Subjt:  PVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

TrEMBL top hitse value%identityAlignment
A0A2N9FBS5 Uncharacterized protein4.1e-11535.95Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQ---SIVHDENTVDLFS-DVVLPVSLSNPLLDDFSLHQPSISSDFPLS
        L++ ++KFD +A PCVF+GYP G KGYKL D++    F+SRD IF+E  FPF    S++H  +++D  S    +P   S+P     S   PS+ S  PL+
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQ---SIVHDENTVDLFS-DVVLPVSLSNPLLDDFSLHQPSISSDFPLS

Query:  SDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANAPLSST----INYPLHKYLSYDKFSPSH
        S                 PD            + ++  PL     +L  R+S+R   PPSYL +YHC+L  + P S+       YP+   LSY   S SH
Subjt:  SDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANAPLSST----INYPLHKYLSYDKFSPSH

Query:  QHFLLNVSTVFEPQFFHQA---------------------------------------------------------------------------------
        + F L +ST  EP F+H+A                                                                                 
Subjt:  QHFLLNVSTVFEPQFFHQA---------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------SKADFSLFTRGSSPS
                                                                                             SK D+SLFT+    +
Subjt:  -------------------------------------------------------------------------------------SKADFSLFTRGSSPS

Query:  FTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDD
        F  LLVY DDI++       + S+   L  HF LKDLG  KYFLGLEL+ S +GI L QRKY L +L+D+GFL S P   PM+  L LS  +G+PL   D
Subjt:  FTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDD

Query:  STLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITW
         T+Y R+IG+LLYL ++RPDI++ V  LSQF+  PC+ H+ A H +L YLKG+ GQG+      +  LKAF D+DW  C DTR SVTGFC+FLGDS+++W
Subjt:  STLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITW

Query:  KSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIRSQFQLADM
        +SKKQS VSRSS E EYRA+A  T E+ W+  LLKD  +  P+P +L+CDNQAA++IASNPVFHERTKHI  DCHF+RDK+ DG LK + + S  QLAD+
Subjt:  KSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIRSQFQLADM

Query:  FTKAVTTSVLNSLMSKMGVFNLCAPS
        FTK +  +  + L+SK+G+ N+ +P+
Subjt:  FTKAVTTSVLNSLMSKMGVFNLCAPS

A0A2N9IB47 Integrase catalytic domain-containing protein3.1e-11538.55Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENTVDLFSDVVLPVSLSN---PLLDDFSLHQPSISSDFPLSS
        L   R KF  RAK C+ +GY  G+KGY+LYD++  QIF+SRD +FYE  FPF +  H  +T    + +VLP  ++N   P+   FS    + +S     S
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENTVDLFSDVVLPVSLSN---PLLDDFSLHQPSISSDFPLSS

Query:  DAQRAGGDASIILENLLP-DDAQWAGGAVTPLVPEA--FSP--LVGDVISLMTRKSTRPRHPPSYLHNYHCN----LLANAPLS-------STINYPLHK
              G +       LP         AV   +P A   SP  ++ D  SL+ RKSTR   PPSYL  +HCN    L A +P S       ++ N+PL  
Subjt:  DAQRAGGDASIILENLLP-DDAQWAGGAVTPLVPEA--FSP--LVGDVISLMTRKSTRPRHPPSYLHNYHCN----LLANAPLS-------STINYPLHK

Query:  YLSYDKFSPSHQHFLLNVSTVFEPQFFHQA----------------------------------------------------------------------
        YL Y K +P +Q F+LN ST+ EP  FH+A                                                                      
Subjt:  YLSYDKFSPSHQHFLLNVSTVFEPQFFHQA----------------------------------------------------------------------

Query:  -------------------------------------------------SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKD
                                                         SKAD+SLFTR    SF  LLVY DDI++       +  +K  L + F LKD
Subjt:  -------------------------------------------------SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKD

Query:  LGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPC
        LG V+YFLGLE++ S QGI +SQRKY L++LED G L   PT  PMD  L LS   G  L   D T+Y R++GRL+YL ++RPDI F VH LSQF+  P 
Subjt:  LGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPC

Query:  SGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKD
          H  A  H+L Y+KG   QG+    +    +KAF D+DW  C DTR S TG+CVFLG S+++W+SKKQ+TVSRSS E EYRA+AS   E++W+  LL D
Subjt:  SGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKD

Query:  LWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        L I  P   LL+ D+QAAI+IA+NPVFHERTKH  IDCH VRDK+ +G ++ + + S+ Q+AD+ TKA+   + +SL  KMG+ NLC PS
Subjt:  LWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

A0A2N9IX68 Integrase catalytic domain-containing protein6.5e-11337.34Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIV--------------------HDENTVDLFSDVVLPVSLSNPLLD
        L+  R+KFD RAK C F+GYP G+KGYKL D+   ++FISRD IF+E+ FPFQ+ +                       NT D+     +PVS       
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIV--------------------HDENTVDLFSDVVLPVSLSNPLLD

Query:  DFSLHQPSISS-----DFPLSSDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNL------LANA
         F    PS+SS     D P+ SD      D   +  + LP+ +       T L P + SPL         R+STR    P+YL +YHC L          
Subjt:  DFSLHQPSISS-----DFPLSSDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNL------LANA

Query:  PLSST-INYPLHKYLSYDKFSPSHQHFLLNVSTVFEPQFFHQA---------------------------------------------------------
        P +ST   YPL   LSYD  SP+H+ F L+V  + EP  F QA                                                         
Subjt:  PLSST-INYPLHKYLSYDKFSPSHQHFLLNVSTVFEPQFFHQA---------------------------------------------------------

Query:  ----------------------------------------------------------------------------------SKADFSLFTRGSSPSFTT
                                                                                          SK+D+SLFTR     F  
Subjt:  ----------------------------------------------------------------------------------SKADFSLFTRGSSPSFTT

Query:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTL
        LLVY DDI++       + ++K  L + F LKDLG +KYFLGLE++ S +GI L QRKY L +L D+G LAS P A PM+  L +S ++GD L  DD ++
Subjt:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTL

Query:  YHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSK
        Y R++GRLLYL ++RPDI++ V  LSQF+S+P + H++A + +L Y+KGTSGQG+      S  LKAF D+DW  C DTR S+TG+CV+LGDS+I+WKSK
Subjt:  YHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSK

Query:  KQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIRSQFQLADMFTK
        KQ TVSRSS E EYRA+ASV  EL+W+  LL +L    P   LL+CD+QAAI+IA+NPV+HERTKHI  DCH +R+K+  G ++ + + SQ QLAD+ TK
Subjt:  KQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIRSQFQLADMFTK

Query:  AVTTSVLNSLMSKMGVFNLCAPS
        A+ +   +SL+SKMGV N+ APS
Subjt:  AVTTSVLNSLMSKMGVFNLCAPS

A0A2N9IZK3 Uncharacterized protein1.1e-11536.38Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSI------VHDEN---TVDLFSDVVLPVSLSNPLLDDFSLHQP-SIS
        L+  R+KFD RAKPCVF+GYP G+KGYKL D+T   + ISRD IF+E  FPF +         D N   +   FSD+ L  ++S P+    S  +P S+S
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSI------VHDEN---TVDLFSDVVLPVSLSNPLLDDFSLHQP-SIS

Query:  SDFPLSSDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANAPLSSTIN-------YPLHKYL
        +    S  A+                         +P +P    P   + +S   R+STR   PP+YL +YHC +  +AP +S+ +       YPL   L
Subjt:  SDFPLSSDAQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANAPLSSTIN-------YPLHKYL

Query:  SYDKFSPSHQHFLLNVSTVFEPQFFHQA------------------------------------------------------------------------
        SYD  SPSH+ F L+V+ + EP  F QA                                                                        
Subjt:  SYDKFSPSHQHFLLNVSTVFEPQFFHQA------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------SKADFSL
                                                                                                     SK+D+SL
Subjt:  ---------------------------------------------------------------------------------------------SKADFSL

Query:  FTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTS
        FTR    +F  LLVY DDI++       + ++K  L + F LKDLG +KYFLGLE++ S +GI L QRKY L +L D+G L S P   PM+ +L LS + 
Subjt:  FTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTS

Query:  GDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVF
        GD LS  D + Y R++GRLLYL ++RPDI++ V  LSQF++KP + H++A + +L Y+KGTSGQG+    +    LK+F D+DW SC DTR SVTG+CVF
Subjt:  GDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVF

Query:  LGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIR
        LG+S+I+WKSKKQ T+SRSS E EYRA+AS   EL+W+  LLK+L ++ P   LLYCD+QAA++IA+NPVFHERTKHI  DCH +R+K+ DG ++ + + 
Subjt:  LGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSDGFLKLMPIR

Query:  SQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS
        SQ QLAD+ TKA+ +   NSL+SKMGV N+ APS
Subjt:  SQFQLADMFTKAVTTSVLNSLMSKMGVFNLCAPS

A0A5D3DVP4 Putative mitochondrial protein3.7e-12444.48Show/hide
Query:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENT--VDLFSDVVLPVSLSNPLLDDFSLHQPSISSDFPLSSD
        L +QRSKFD +A PC+FIGYPP MK Y+LYDI +  IF+SRD  F E  F   SI  ++++   + F ++VLP+ + N  +   ++ +  I +    SS 
Subjt:  LTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENT--VDLFSDVVLPVSLSNPLLDDFSLHQPSISSDFPLSSD

Query:  AQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANA--PLSSTINYPLHKYLSYDKFSPSHQHFL
          +   DA+ +  + +P+        V   +             LM RKSTR   PP+ L +YHC+LL +   P  +T  +PL+K LSY+K +P+H+ FL
Subjt:  AQRAGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANA--PLSSTINYPLHKYLSYDKFSPSHQHFL

Query:  LNVSTVFEPQFFHQASKA---------DFSLFTRGSSPSFTTL---------------LVYADD--------IILTGPCQKE-IDSV-------------
         NVST +E  FFHQA K+         + +   R ++ S   L                 +AD         ++  G  Q+E ID +             
Subjt:  LNVSTVFEPQFFHQASKA---------DFSLFTRGSSPSFTTL---------------LVYADD--------IILTGPCQKE-IDSV-------------

Query:  ---------KAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQ
                 K       LLKDLG  KYFLGLELS S  GIYLSQRKYCLQ+LED+GFLA+ PT  PM P L LS+TSG+ L  +D++ Y R+IGRLLYLQ
Subjt:  ---------KAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQ

Query:  ISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEV
        ISRPD+++ VH LSQFL++P + H+ AVHHLL YLKGT+ QGI+L  S  F +K FVD DW  CLDT+ S+TGFC+FL  S+++WK+KKQ+T SRSS E 
Subjt:  ISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEV

Query:  EYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMS
        +YR L SV+SE+ W+  LL+DL I    P L+YCDN+A I IASNP FHERTKH  ID HF+ DK+  G  KL+ I+S  QLA MFTK + +S ++  M 
Subjt:  EYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMS

Query:  KMGVFNL
        KM + NL
Subjt:  KMGVFNL

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.8e-4230.92Show/hide
Query:  EPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTP
        E +F + +      +  +G+      +L+Y DD+++       +++ K  L   F + DL  +K+F+G+ +   +  IYLSQ  Y  ++L          
Subjt:  EPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTP

Query:  TALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQI-SRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLK--AFVDA
         + P+  +++    + D    D +T    +IG L+Y+ + +RPD+T  V+ LS++ SK  S     +  +L YLKGT    ++ +++++F  K   +VD+
Subjt:  TALPMDPRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQI-SRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLK--AFVDA

Query:  DWGSCLDTRHSVTGFCVFLGD-SMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHIDC-
        DW      R S TG+   + D ++I W +K+Q++V+ SS E EY AL     E +W+  LL  + I L  P  +Y DNQ  I IA+NP  H+R KHID  
Subjt:  DWGSCLDTRHSVTGFCVFLGD-SMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHIDC-

Query:  -HFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGV
         HF R++V +  + L  I ++ QLAD+FTK +  +    L  K+G+
Subjt:  -HFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-4232.32Show/hide
Query:  PLHKYLSYDKFSPSHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLEL--S
        P   Y+ +D F  S  +    + T  +P  +          F R S  +F  LL+Y DD+++ G  +  I  +K  LS  F +KDLG  +  LG+++   
Subjt:  PLHKYLSYDKFSPSHQHFLLNVSTVFEPQFFHQASKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLEL--S

Query:  CSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDD-----STLYHRIIGRLLYLQI-SRPDITFLVHCLSQFLSKPCSGHMSAV
         + + ++LSQ KY  ++LE      + P + P+   L LS     P ++++        Y   +G L+Y  + +RPDI   V  +S+FL  P   H  AV
Subjt:  CSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDPLSIDD-----STLYHRIIGRLLYLQI-SRPDITFLVHCLSQFLSKPCSGHMSAV

Query:  HHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPL
          +L YL+GT+G  +    S    LK + DAD    +D R S TG+        I+W+SK Q  V+ S+ E EY A      E+IW+ + L++L +    
Subjt:  HHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPL

Query:  PTLLYCDNQAAIYIASNPVFHERTKHIDC--HFVRDKVSDGFLKLMPIRSQFQLADMFTKAV
          ++YCD+Q+AI ++ N ++H RTKHID   H++R+ V D  LK++ I +    ADM TK V
Subjt:  PTLLYCDNQAAIYIASNPVFHERTKHIDC--HFVRDKVSDGFLKLMPIRSQFQLADMFTKAV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-0345.45Show/hide
Query:  QRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYE
        QR+K D ++ PC+FIGY     GY+L+D  K ++  SRD +F E
Subjt:  QRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYE

P92519 Uncharacterized mitochondrial protein AtMg008103.6e-4441.41Show/hide
Query:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLS-STSGDPLSIDDST
        LL+Y DDI+LTG     ++ +   LSS F +KDLG V YFLG+++     G++LSQ KY  Q+L + G L   P + P+  +L+ S ST+  P    D +
Subjt:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLS-STSGDPLSIDDST

Query:  LYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKS
         +  I+G L YL ++RPDI++ V+ + Q + +P       +  +L Y+KGT   G+ + ++    ++AF D+DW  C  TR S TGFC FLG ++I+W +
Subjt:  LYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKS

Query:  KKQSTVSRSSVEVEYRALASVTSELIW
        K+Q TVSRSS E EYRALA   +EL W
Subjt:  KKQSTVSRSSVEVEYRALASVTSELIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-6641.62Show/hide
Query:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR
        S +D SLF      S   +LVY DDI++TG     + +    LS  F +KD   + YFLG+E      G++LSQR+Y L LL  T  + + P   PM P 
Subjt:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR

Query:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS
          LS  SG  L+  D T Y  I+G L YL  +RPDI++ V+ LSQF+  P   H+ A+  +L YL GT   GI L++  +  L A+ DADW    D   S
Subjt:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS

Query:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGF
          G+ V+LG   I+W SKKQ  V RSS E EYR++A+ +SE+ W+  LL +L I L  P ++YCDN  A Y+ +NPVFH R KH  ID HF+R++V  G 
Subjt:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKH--IDCHFVRDKVSDGF

Query:  LKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGV
        L+++ + +  QLAD  TK ++ +   +  SK+GV
Subjt:  LKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.2e-6440.41Show/hide
Query:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPM--D
        S +D SLF      S   +LVY DDI++TG     +      LS  F +K+   + YFLG+E     QG++LSQR+Y L LL  T  L + P A PM   
Subjt:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPM--D

Query:  PRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTR
        P+L+L S +     + D T Y  I+G L YL  +RPD+++ V+ LSQ++  P   H +A+  +L YL GT   GI L++  +  L A+ DADW    D  
Subjt:  PRLSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTR

Query:  HSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSD
         S  G+ V+LG   I+W SKKQ  V RSS E EYR++A+ +SEL W+  LL +L I L  P ++YCDN  A Y+ +NPVFH R KHI  D HF+R++V  
Subjt:  HSVTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDKVSD

Query:  GFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNL
        G L+++ + +  QLAD  TK ++     +   K+GV  +
Subjt:  GFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKMGVFNL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.7e-7749.83Show/hide
Query:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR
        S +D + F + ++  F  +LVY DDII+       +D +K+ L S F L+DLG +KYFLGLE++ S  GI + QRKY L LL++TG L   P+++PMDP 
Subjt:  SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPR

Query:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS
        ++ S+ SG      D+  Y R+IGRL+YLQI+R DI+F V+ LSQF   P   H  AV  +L Y+KGT GQG+         L+ F DA + SC DTR S
Subjt:  LSLSSTSGDPLSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHS

Query:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDK
          G+C+FLG S+I+WKSKKQ  VS+SS E EYRAL+  T E++W+AQ  ++L + L  PTLL+CDN AAI+IA+N VFHERTKHI  DCH VR++
Subjt:  VTGFCVFLGDSMITWKSKKQSTVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHI--DCHFVRDK

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.6e-1854.43Show/hide
Query:  LYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFC
        +YL I+RPD+TF V+ LSQF S   +  M AV+ +L Y+KGT GQG+    +    LKAF D+DW SC DTR SVTGFC
Subjt:  LYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-4541.41Show/hide
Query:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLS-STSGDPLSIDDST
        LL+Y DDI+LTG     ++ +   LSS F +KDLG V YFLG+++     G++LSQ KY  Q+L + G L   P + P+  +L+ S ST+  P    D +
Subjt:  LLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLS-STSGDPLSIDDST

Query:  LYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKS
         +  I+G L YL ++RPDI++ V+ + Q + +P       +  +L Y+KGT   G+ + ++    ++AF D+DW  C  TR S TGFC FLG ++I+W +
Subjt:  LYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKS

Query:  KKQSTVSRSSVEVEYRALASVTSELIW
        K+Q TVSRSS E EYRALA   +EL W
Subjt:  KKQSTVSRSSVEVEYRALASVTSELIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTATTGTCCAATAAGTGTCCTTTAACTGCTCAGCGTTCAAAGTTTGACCTTCGTGCCAAACCATGCGTCTTCATTGGTTATCCCCCCGGCATGAAAGGTTATAA
ACTTTATGATATCACCAAGTGTCAGATTTTTATATCACGGGATGCTATTTTCTATGAAGATTTCTTTCCCTTTCAGTCCATTGTTCATGACGAGAATACCGTGGATTTAT
TCTCTGATGTGGTTCTTCCAGTTTCATTATCAAATCCTTTACTTGATGATTTTTCTCTTCATCAGCCCTCAATTTCTTCAGATTTTCCTTTATCTAGTGATGCTCAGAGG
GCTGGTGGTGATGCTTCCATAATTCTAGAAAATCTTTTGCCTGATGATGCTCAATGGGCTGGTGGTGCTGTTACTCCTTTGGTTCCTGAAGCTTTTTCTCCTTTAGTTGG
TGACGTTATTTCTCTTATGACTCGCAAATCTACTAGGCCGCGACATCCACCCTCTTATTTGCATAATTATCATTGCAATTTACTAGCTAATGCTCCTTTATCATCTACGA
TCAACTATCCTTTACATAAATATTTGTCTTATGATAAGTTCTCTCCTTCTCATCAGCATTTTCTCCTTAATGTGTCCACTGTCTTTGAGCCTCAATTCTTTCATCAAGCT
TCTAAGGCAGATTTTTCTTTGTTCACTCGTGGTTCAAGTCCTTCTTTTACAACTTTGTTAGTCTATGCTGATGACATCATTTTGACCGGTCCTTGTCAGAAAGAGATAGA
CTCAGTTAAGGCTATACTTAGTTCTCATTTTCTGCTTAAGGATCTTGGTGTTGTTAAATATTTCCTTGGCTTGGAATTATCTTGTTCTCAACAGGGAATTTATCTTTCCC
AAAGAAAATATTGTCTTCAACTCTTGGAAGACACTGGGTTTTTGGCTTCTACGCCTACTGCCTTACCTATGGATCCAAGACTTTCTCTTAGTTCTACTAGTGGAGACCCC
TTGTCTATTGACGATTCCACTCTTTATCATCGTATTATTGGTCGTTTATTATATCTCCAGATTTCGAGGCCTGATATCACTTTTCTGGTTCATTGTCTTAGTCAATTTCT
TTCTAAGCCTTGCTCCGGCCATATGTCAGCTGTTCATCATCTATTGTGCTACCTTAAAGGTACTTCGGGTCAGGGTATTTTACTTCGCCGTTCTATTTCTTTTGGTCTTA
AGGCCTTTGTTGATGCTGATTGGGGTTCTTGCTTGGATACTCGGCATTCTGTAACGGGCTTTTGTGTTTTCTTGGGTGATTCTATGATTACCTGGAAATCTAAGAAACAA
TCCACTGTTTCTCGATCATCTGTTGAAGTCGAGTATCGTGCTTTAGCATCCGTTACCAGTGAATTGATTTGGGTTGCTCAATTGCTCAAGGATCTTTGGATTTCTTTGCC
TCTTCCAACTTTATTATATTGTGATAATCAGGCGGCCATCTATATTGCTTCTAATCCGGTTTTTCATGAACGTACGAAACACATCGATTGCCACTTTGTTCGGGATAAGG
TCTCAGATGGTTTCCTCAAACTTATGCCAATTCGCTCTCAATTTCAGCTTGCGGATATGTTCACTAAAGCTGTCACTACTTCTGTTTTGAATTCTTTAATGAGCAAGATG
GGTGTTTTTAATCTTTGTGCTCCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTATTGTCCAATAAGTGTCCTTTAACTGCTCAGCGTTCAAAGTTTGACCTTCGTGCCAAACCATGCGTCTTCATTGGTTATCCCCCCGGCATGAAAGGTTATAA
ACTTTATGATATCACCAAGTGTCAGATTTTTATATCACGGGATGCTATTTTCTATGAAGATTTCTTTCCCTTTCAGTCCATTGTTCATGACGAGAATACCGTGGATTTAT
TCTCTGATGTGGTTCTTCCAGTTTCATTATCAAATCCTTTACTTGATGATTTTTCTCTTCATCAGCCCTCAATTTCTTCAGATTTTCCTTTATCTAGTGATGCTCAGAGG
GCTGGTGGTGATGCTTCCATAATTCTAGAAAATCTTTTGCCTGATGATGCTCAATGGGCTGGTGGTGCTGTTACTCCTTTGGTTCCTGAAGCTTTTTCTCCTTTAGTTGG
TGACGTTATTTCTCTTATGACTCGCAAATCTACTAGGCCGCGACATCCACCCTCTTATTTGCATAATTATCATTGCAATTTACTAGCTAATGCTCCTTTATCATCTACGA
TCAACTATCCTTTACATAAATATTTGTCTTATGATAAGTTCTCTCCTTCTCATCAGCATTTTCTCCTTAATGTGTCCACTGTCTTTGAGCCTCAATTCTTTCATCAAGCT
TCTAAGGCAGATTTTTCTTTGTTCACTCGTGGTTCAAGTCCTTCTTTTACAACTTTGTTAGTCTATGCTGATGACATCATTTTGACCGGTCCTTGTCAGAAAGAGATAGA
CTCAGTTAAGGCTATACTTAGTTCTCATTTTCTGCTTAAGGATCTTGGTGTTGTTAAATATTTCCTTGGCTTGGAATTATCTTGTTCTCAACAGGGAATTTATCTTTCCC
AAAGAAAATATTGTCTTCAACTCTTGGAAGACACTGGGTTTTTGGCTTCTACGCCTACTGCCTTACCTATGGATCCAAGACTTTCTCTTAGTTCTACTAGTGGAGACCCC
TTGTCTATTGACGATTCCACTCTTTATCATCGTATTATTGGTCGTTTATTATATCTCCAGATTTCGAGGCCTGATATCACTTTTCTGGTTCATTGTCTTAGTCAATTTCT
TTCTAAGCCTTGCTCCGGCCATATGTCAGCTGTTCATCATCTATTGTGCTACCTTAAAGGTACTTCGGGTCAGGGTATTTTACTTCGCCGTTCTATTTCTTTTGGTCTTA
AGGCCTTTGTTGATGCTGATTGGGGTTCTTGCTTGGATACTCGGCATTCTGTAACGGGCTTTTGTGTTTTCTTGGGTGATTCTATGATTACCTGGAAATCTAAGAAACAA
TCCACTGTTTCTCGATCATCTGTTGAAGTCGAGTATCGTGCTTTAGCATCCGTTACCAGTGAATTGATTTGGGTTGCTCAATTGCTCAAGGATCTTTGGATTTCTTTGCC
TCTTCCAACTTTATTATATTGTGATAATCAGGCGGCCATCTATATTGCTTCTAATCCGGTTTTTCATGAACGTACGAAACACATCGATTGCCACTTTGTTCGGGATAAGG
TCTCAGATGGTTTCCTCAAACTTATGCCAATTCGCTCTCAATTTCAGCTTGCGGATATGTTCACTAAAGCTGTCACTACTTCTGTTTTGAATTCTTTAATGAGCAAGATG
GGTGTTTTTAATCTTTGTGCTCCATCTTGA
Protein sequenceShow/hide protein sequence
MPLLSNKCPLTAQRSKFDLRAKPCVFIGYPPGMKGYKLYDITKCQIFISRDAIFYEDFFPFQSIVHDENTVDLFSDVVLPVSLSNPLLDDFSLHQPSISSDFPLSSDAQR
AGGDASIILENLLPDDAQWAGGAVTPLVPEAFSPLVGDVISLMTRKSTRPRHPPSYLHNYHCNLLANAPLSSTINYPLHKYLSYDKFSPSHQHFLLNVSTVFEPQFFHQA
SKADFSLFTRGSSPSFTTLLVYADDIILTGPCQKEIDSVKAILSSHFLLKDLGVVKYFLGLELSCSQQGIYLSQRKYCLQLLEDTGFLASTPTALPMDPRLSLSSTSGDP
LSIDDSTLYHRIIGRLLYLQISRPDITFLVHCLSQFLSKPCSGHMSAVHHLLCYLKGTSGQGILLRRSISFGLKAFVDADWGSCLDTRHSVTGFCVFLGDSMITWKSKKQ
STVSRSSVEVEYRALASVTSELIWVAQLLKDLWISLPLPTLLYCDNQAAIYIASNPVFHERTKHIDCHFVRDKVSDGFLKLMPIRSQFQLADMFTKAVTTSVLNSLMSKM
GVFNLCAPS