; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:14446725..14450609
RNA-Seq ExpressionCSPI01G19040
SyntenyCSPI01G19040
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]1.7e-11531.16Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DFALW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SK L +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  -----VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS-------------------------------------------
             + ++     FLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                                            
Subjt:  -----VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS-------------------------------------------

Query:  ---SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
           SK+   +  +K  LS EF MKDLGE ++ILG+D+ RD+ + + +ISQ +Y  K
Subjt:  ---SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-11430.54Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DF+LW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SKSL +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG+DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------
                                         + ++    AFLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                
Subjt:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------

Query:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
                                       SK+  ++  +K  LS EF MKDLGE ++ILG+D+ RDK + + +ISQ +Y  K
Subjt:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

KAG8502437.1 hypothetical protein CXB51_000388 [Gossypium anomalum]1.9e-9828.91Show/hide
Query:  RVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNKMYLR
        + EI   D    FALW+ K++A+L Q     ALL   ++P+TLT  + +     A   L L+LS+ +++ V++E+T   +WK+LE +  +K L++K++++
Subjt:  RVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNKMYLR

Query:  EKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLFVKAN
        +  + ++++   S+ ++L  FK+I+S+ +++E +   ++   +L  SLP +Y   ++++ Y ++S   D +  +L + +    L  K    G GL V+  
Subjt:  EKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLFVKAN

Query:  Y-----------------------------------------VNPFR-KRDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKL
                                                  VN  +    W+LDSGCT+HM+  R WF TY+ +S   V MGNN +C IAG+G++ +K+
Subjt:  Y-----------------------------------------VNPFR-KRDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKL

Query:  KDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVGGKE--------------------------------------------
         D  V  L  VRHVP LKRNLISL  LDS    Y  + GV ++   S  V+ G ++                                            
Subjt:  KDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVGGKE--------------------------------------------

Query:  ----------------------------KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKF
                                    ++T K+IKYLR DNG EFC + FN  CK   I RH  VR+TPQQN +AER+NRTIMEKVRC LS+  L + F
Subjt:  ----------------------------KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKF

Query:  WVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSCGT---------------------------------TNKKFIISRDVHFRETEMFMQ
        W E A+ A + +NRSP  ++   TP+E W           +  C                                    N+K +ISRDV F ET M   
Subjt:  WVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSCGT---------------------------------TNKKFIISRDVHFRETEMFMQ

Query:  GKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQ-----TKIVKEQSNLSQYSLARDRQR-------------TLPVIMNLV--------
                         + L+++ N     +  K ++ +I+ E      TKI    ++  QYS+A++R R              +   +N+         
Subjt:  GKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQ-----TKIVKEQSNLSQYSLARDRQR-------------TLPVIMNLV--------

Query:  ---LLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG--------------------------------------------
               +SC D+ +W+ AM EE+ SL  N TW L+ LPK  K +  KW++K KEG                                            
Subjt:  ---LLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG--------------------------------------------

Query:  --------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS----------------SKEDLTHVKTLLSKEFYMKDLGES
                + +L  K AFLHG L+E IYM Q +GF V  KED  CLL+KS+Y  E S  S                 K ++  VK  LS+EF MKDLG +
Subjt:  --------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS----------------SKEDLTHVKTLLSKEFYMKDLGES

Query:  RKILGIDITRDKNQSIRSISQSTYYEK
        +KILG++I RD+  S   +SQ  Y EK
Subjt:  RKILGIDITRDKNQSIRSISQSTYYEK

RVW81650.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.4e-10129.3Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M  A+ ++EKF GK DF LW+ K++ALL QQ    ALL    LP+T+   Q  E+   A+  +IL+L D V+R+V + ++  ++W KLESLY TK L+N+
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF
        ++ + K +T+KM P  S+ ++LD F KI+ D ++++  +SD+++A +L  SL  +Y  +K+++ YG+DS   D + S L  REL+ Q   KE +SG GL 
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF

Query:  VKANYVNPFRK-----------------------------------------RDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVT
        ++       +K                                         ++W+LDSGC +HM   +AWF  +KE     V +GNN  C I G G+V 
Subjt:  VKANYVNPFRK-----------------------------------------RDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVT

Query:  MKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVG--------------------------------------------
        +K  D    +L +VR++P LKRNLISLGMLD  G  +K +    +V   S  V+ G                                            
Subjt:  MKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVG--------------------------------------------

Query:  -------------------------GK------------------------------------------------------------------------E
                                 GK                                                                        E
Subjt:  -------------------------GK------------------------------------------------------------------------E

Query:  KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEK
         Q  +++K LR DNG EF    FN FC++ GI RH+ VRYT QQN +AER+NRTI+E+VRC LS   L + FW E A   V+ +NRSP ++L   TP+EK
Subjt:  KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEK

Query:  WVSSCG----------TTNKKFIISRDVHFRETEMFMQGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARD
        W               T        +DV F E +M  Q        ++  +   + E        + + KT          Q +IV E+ N         
Subjt:  WVSSCG----------TTNKKFIISRDVHFRETEMFMQGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARD

Query:  RQRTLPVIM---------NLVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTE------------LTTKAAFLHGY
         Q   P+I                 ++ N+A +W++A+ EE++SL+ N+TW L+  PK+ K + SKW++K K+G                  K AFLHG 
Subjt:  RQRTLPVIM---------NLVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTE------------LTTKAAFLHGY

Query:  LDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-----------------------------------------------SSKEDLTHVKTLLSKE
        LDE IYM  P+GF    K+   CLLKKS+YGL+QSPR                                                 K  L  VK +L  E
Subjt:  LDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-----------------------------------------------SSKEDLTHVKTLLSKE

Query:  FYMKDLGESRKILGIDITRDKNQSI
        F MKDLG +++ILG++I RD+++ I
Subjt:  FYMKDLGESRKILGIDITRDKNQSI

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.5e-11530.63Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DFALW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SKSL +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG+DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------
                                         + ++    AFLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                
Subjt:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------

Query:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
                                       SK+  ++  +K  LS EF MKDLGE ++ILG+D+ RDK + + +ISQ +Y  K
Subjt:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider7.8e-9827.51Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   + EIEKF G  DF LW+ K+KALL QQ   +AL   + +   LTA +   M   A+  ++L+L D V+RQV +E T   +W KLESLY TK L N+
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF
        +YL++  +++KM   K L + LD F K++ D ++++ K+ D+++A +L  +LP ++   K +L YG++S   + + S L +++L  +  HK    G GL 
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF

Query:  VKANY--------------------------------------VNPFR------------------------------KRDWVLDSGCTYHMTLFRAWFN
        VK  +                                      V P R                              +++W++DSGCT+HMT  +  F 
Subjt:  VKANY--------------------------------------VNPFR------------------------------KRDWVLDSGCTYHMTLFRAWFN

Query:  TYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVGGK---------------
           +    SV +GNN AC IAG+GSV  KL DE++ LL  VR+VP LKRNL+SLG  D  G  ++G+  + +V   SKEVL G K               
Subjt:  TYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVGGK---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFW
                                  E QT +++K LR DNG EFC E F+ FC  SGI RH+    TPQQN +AER NRTI+E+VRC L+   L++ FW
Subjt:  --------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFW

Query:  VEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC-----------------------------------GTTNKKFIISRDVHFRETEMFM
         E  + A Y +NR P T+L + TPEE W           V  C                                      +K+ I SRDV F E EM  
Subjt:  VEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC-----------------------------------GTTNKKFIISRDVHFRETEMFM

Query:  QGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARDRQRTLPV------IMNLVLLRKLSCNDA---------
        +   +  RSTE +    ++E        +       I  E++ E     + +     Y L+RDR R +          +L+    +S ++          
Subjt:  QGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARDRQRTLPV------IMNLVLLRKLSCNDA---------

Query:  --------RRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTELTT-------------------------------------------
                  W++AM++E+ SL  N TW LI  P   + ++ KWI+K+KEG+  +T+                                           
Subjt:  --------RRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTELTT-------------------------------------------

Query:  ---------KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR----------------------------------------------
                 K AFL+G LDETI M QP+G+  +GKED  C LK+S+YGL+QSPR                                              
Subjt:  ---------KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR----------------------------------------------

Query:  --SSKEDLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
          ++ ED+T VK  L+KEF MKDLG + +ILGIDI RD+ +S   +SQ  Y  K
Subjt:  --SSKEDLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

A0A438HB17 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-10129.3Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M  A+ ++EKF GK DF LW+ K++ALL QQ    ALL    LP+T+   Q  E+   A+  +IL+L D V+R+V + ++  ++W KLESLY TK L+N+
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF
        ++ + K +T+KM P  S+ ++LD F KI+ D ++++  +SD+++A +L  SL  +Y  +K+++ YG+DS   D + S L  REL+ Q   KE +SG GL 
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLF

Query:  VKANYVNPFRK-----------------------------------------RDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVT
        ++       +K                                         ++W+LDSGC +HM   +AWF  +KE     V +GNN  C I G G+V 
Subjt:  VKANYVNPFRK-----------------------------------------RDWVLDSGCTYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVT

Query:  MKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVG--------------------------------------------
        +K  D    +L +VR++P LKRNLISLGMLD  G  +K +    +V   S  V+ G                                            
Subjt:  MKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVG--------------------------------------------

Query:  -------------------------GK------------------------------------------------------------------------E
                                 GK                                                                        E
Subjt:  -------------------------GK------------------------------------------------------------------------E

Query:  KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEK
         Q  +++K LR DNG EF    FN FC++ GI RH+ VRYT QQN +AER+NRTI+E+VRC LS   L + FW E A   V+ +NRSP ++L   TP+EK
Subjt:  KQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEK

Query:  WVSSCG----------TTNKKFIISRDVHFRETEMFMQGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARD
        W               T        +DV F E +M  Q        ++  +   + E        + + KT          Q +IV E+ N         
Subjt:  WVSSCG----------TTNKKFIISRDVHFRETEMFMQGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARD

Query:  RQRTLPVIM---------NLVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTE------------LTTKAAFLHGY
         Q   P+I                 ++ N+A +W++A+ EE++SL+ N+TW L+  PK+ K + SKW++K K+G                  K AFLHG 
Subjt:  RQRTLPVIM---------NLVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEGVTE------------LTTKAAFLHGY

Query:  LDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-----------------------------------------------SSKEDLTHVKTLLSKE
        LDE IYM  P+GF    K+   CLLKKS+YGL+QSPR                                                 K  L  VK +L  E
Subjt:  LDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-----------------------------------------------SSKEDLTHVKTLLSKE

Query:  FYMKDLGESRKILGIDITRDKNQSI
        F MKDLG +++ILG++I RD+++ I
Subjt:  FYMKDLGESRKILGIDITRDKNQSI

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class8.3e-11631.16Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DFALW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SK L +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  -----VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS-------------------------------------------
             + ++     FLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                                            
Subjt:  -----VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS-------------------------------------------

Query:  ---SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
           SK+   +  +K  LS EF MKDLGE ++ILG+D+ RD+ + + +ISQ +Y  K
Subjt:  ---SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

A0A5A7UB25 Putative gag-pol polyprotein9.2e-11530.54Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DF+LW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SKSL +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG+DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------
                                         + ++    AFLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                
Subjt:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------

Query:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
                                       SK+  ++  +K  LS EF MKDLGE ++ILG+D+ RDK + + +ISQ +Y  K
Subjt:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

A0A5D3DNU1 Putative gag-pol polyprotein3.2e-11530.63Show/hide
Query:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK
        M   R E+ KF+G  DFALW+ KI+A+L Q K  K +LD   LP  +T  +  +M   AY T++L LSD V+R V E  TT ++WKKLESLY TK L NK
Subjt:  MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNK

Query:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------
        +Y++EKFF YKMD SKSL +NLD+F+KIV D  ++ +K+SD+N+A +L NSLPE Y+EVK ++KYG+DS    +++  L+TR L+I+             
Subjt:  MYLREKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQ-------------

Query:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM
                                  L HKE         ++S      +AN  + +   +                            W++DSGCT+HM
Subjt:  --------------------------LSHKE---------HQSGNGLFVKANYVNPFRKRD----------------------------WVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------
        T  R +   ++++    V +G+N  C++ G GSV +   D  V +L NVR+VP LKRNLISLG LD  GC  K + GV +V   S               
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDS---------------

Query:  ------------------------------------------------------------------------------------------KEVLVGGK--
                                                                                                  KEV +GG   
Subjt:  ------------------------------------------------------------------------------------------KEVLVGGK--

Query:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD
                                          E QT +++KYLR DNG EF    FN FCK  GITRH  V YTPQQN +AER NRTIME+ RC L++
Subjt:  ----------------------------------EKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSD

Query:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH
          L  KFW E A  A Y +NRSP T+L L TP+E W           V  C        G  NK                           K IISRDV 
Subjt:  VILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC--------GTTNK---------------------------KFIISRDVH

Query:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT
        F ETEM    K   K+ T          A++    I+L+N        + T+    E DG Q++           ++E S   +L  Y L RD   R+R 
Subjt:  FRETEMFMQGKGNTKRSTE---------ATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTK----------IVKEQS---NLSQYSLARD---RQRT

Query:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------
         P+      ++   L+C                  +  ++W +AM EE+ SL  N TW+L+P P   K I SKWIYK+K G                   
Subjt:  LPVIMNL--VLLRKLSC------------------NDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG-------------------

Query:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------
                                         + ++    AFLHG L+E IYM QPKG+EV+GKED+ C L KS+YGL+QSPR                
Subjt:  ---------------------------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS---------------

Query:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK
                                       SK+  ++  +K  LS EF MKDLGE ++ILG+D+ RDK + + +ISQ +Y  K
Subjt:  -------------------------------SKE--DLTHVKTLLSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-1822.7Show/hide
Query:  KGKCGVFQVFMDSKEVLVGGKEKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAA
        K K  VF +F D     V   E   + ++ YL  DNG E+       FC + GI+ H  V +TPQ N V+ER+ RTI EK R  +S   L++ FW E   
Subjt:  KGKCGVFQVFMDSKEVLVGGKEKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAA

Query:  YAVYTLNRSPHTSL--GLLTPEEKW---------VSSCGTT-----------------------------------NKKFIISRDVHFRETEM-------
         A Y +NR P  +L     TP E W         +   G T                                   N+KFI++RDV   ET M       
Subjt:  YAVYTLNRSPHTSL--GLLTPEEKW---------VSSCGTT-----------------------------------NKKFIISRDVHFRETEM-------

Query:  ----FMQG--KGNTKRSTEATKTYTQIELENAR---NGAQFTKKT-------------KVIDQEIDGE-----QTKIVKEQSNLSQYSLARDRQR-----
            F++   +   K     ++   Q E  N     +  QF K +             K+I  E   E       + +K+    ++Y L   ++R     
Subjt:  ----FMQG--KGNTKRSTEATKTYTQIELENAR---NGAQFTKKT-------------KVIDQEIDGE-----QTKIVKEQSNLSQYSLARDRQR-----

Query:  -----------------------------------------------TLPVI--------MNLVLLRKLSC--------------NDARRWIEAMNEEIN
                                                       T P I        +N V+L   +               +D   W EA+N E+N
Subjt:  -----------------------------------------------TLPVI--------MNLVLLRKLSC--------------NDARRWIEAMNEEIN

Query:  SLKVNDTWTLIPLPKECKPIASKWIYKLKEG---------------------------------------------------VTELTTKAAFLHGYLDET
        + K+N+TWT+   P+    + S+W++ +K                                                     V ++  K AFL+G L E 
Subjt:  SLKVNDTWTLIPLPKECKPIASKWIYKLKEG---------------------------------------------------VTELTTKAAFLHGYLDET

Query:  IYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR
        IYM  P+G  +    D  C L K+IYGL+Q+ R
Subjt:  IYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-6023.12Show/hide
Query:  RVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNKMYLR
        + E+ KF+G   F+ W+ +++ LL QQ  HK L   S+ P T+ A    ++   A   + L+LSD+V+  +++E+T   IW +LESLY +K L+NK+YL+
Subjt:  RVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNKMYLR

Query:  EKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTL-------------------------
        ++ +   M    +   +L+ F  +++   +L  K+ ++++A +L NSLP +Y  +  ++ +GK + +   + S L                         
Subjt:  EKFFTYKMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTL-------------------------

Query:  ------------------------------------------RTRELKIQLSHKEHQSGNGLFVKAN-----YVNPFR--------KRDWVLDSGCTYHM
                                                    R+ K + S +++       V+ N     ++N           + +WV+D+  ++H 
Subjt:  ------------------------------------------RTRELKIQLSHKEHQSGNGLFVKAN-----YVNPFR--------KRDWVLDSGCTYHM

Query:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCE-----------------------------
        T  R  F  Y      +V MGN +   IAGIG + +K       +L++VRHVP L+ NLIS   LD  G E                             
Subjt:  TLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCE-----------------------------

Query:  --------------------------------------------------------------------------------YKGKCG--------------
                                                                                        Y   CG              
Subjt:  --------------------------------------------------------------------------------YKGKCG--------------

Query:  --------------------VFQVFMDSKEVLVGGKEKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQL
                            VFQVF     ++    E++T +++K LR+DNG E+    F  +C   GI   K V  TPQ N VAER+NRTI+EKVR  L
Subjt:  --------------------VFQVFMDSKEVLVGGKEKQTDKEIKYLRADNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQL

Query:  SDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC------------------------------------GTTNKKFIISR
            L + FW E    A Y +NRSP   L    PE  W           V  C                                        KK I SR
Subjt:  SDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKW-----------VSSC------------------------------------GTTNKKFIISR

Query:  DVHFRETEM-----------------FMQGKGNTKRSTEATKTYTQIELENAR------NGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARDRQR
        DV FRE+E+                 F+     +   T A  T  ++  +  +       G Q  +  + ++    GE+      +S   +    R    
Subjt:  DVHFRETEM-----------------FMQGKGNTKRSTEATKTYTQIELENAR------NGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARDRQR

Query:  TLPVIMN----LVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG----------------------------------
           +I +      L   LS  +  + ++AM EE+ SL+ N T+ L+ LPK  +P+  KW++KLK+                                   
Subjt:  TLPVIMN----LVLLRKLSCNDARRWIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLKEG----------------------------------

Query:  -----------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-------SSKEDLTHVKTL-------------
                         V +L  K AFLHG L+E IYM QP+GFEV GK+ + C L KS+YGL+Q+PR       S  +  T++KT              
Subjt:  -----------------VTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPR-------SSKEDLTHVKTL-------------

Query:  ----------------------------LSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEKPQLPRFEGK
                                    LSK F MKDLG +++ILG+ I R++      +SQ  Y E+  L RF  K
Subjt:  ----------------------------LSKEFYMKDLGESRKILGIDITRDKNQSIRSISQSTYYEKPQLPRFEGK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-0625.34Show/hide
Query:  RWIEAMNEEINSLKVNDTWTLI-PLPKECKPIASKWIYKLKEG---------------------------------------------------VTELTT
        RW  AM  EIN+   N TW L+ P P     +  +WI+  K                                                     + +L  
Subjt:  RWIEAMNEEINSLKVNDTWTLI-PLPKECKPIASKWIYKLKEG---------------------------------------------------VTELTT

Query:  KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS
          AFL G L + +YM QP GF  + + +  C L+K++YGL+Q+PR+
Subjt:  KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-0728.08Show/hide
Query:  RWIEAMNEEINSLKVNDTWTLIPLPKECKPIAS-KWIYKLK---EG------------------------------------------------VTELTT
        RW +AM  EIN+   N TW L+P P     I   +WI+  K   +G                                                + +L  
Subjt:  RWIEAMNEEINSLKVNDTWTLIPLPKECKPIAS-KWIYKLK---EG------------------------------------------------VTELTT

Query:  KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS
          AFL G L + +YM QP GF  + + D  C L+K+IYGL+Q+PR+
Subjt:  KAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRS

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein1.3e-0742.11Show/hide
Query:  QLPRFEGKTYRRWSQQMKVLYGSQDLWDIVDIGYSEPESENGLSAQQLNELKDARKK
        Q+P      Y  WS +MK + G+ D+W+IV+ G+ EPE+E  LS  Q + L+D+RK+
Subjt:  QLPRFEGKTYRRWSQQMKVLYGSQDLWDIVDIGYSEPESENGLSAQQLNELKDARKK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-1029.93Show/hide
Query:  WIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLK----------------------EGVT-----------------------------ELTTKA
        W  AM++EI +++   TW +  LP   KPI  KW+YK+K                      EG+                              +L    
Subjt:  WIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLK----------------------EGVT-----------------------------ELTTKA

Query:  AFLHGYLDETIYMVQPKGFEVQGKEDL----YCLLKKSIYGLEQSPR
        AFL+G LDE IYM  P G+  +  + L     C LKKSIYGL+Q+ R
Subjt:  AFLHGYLDETIYMVQPKGFEVQGKEDL----YCLLKKSIYGLEQSPR

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0438.6Show/hide
Query:  LNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKWVSSCGT
        +NRTI+EKVR  L +  L + F  + A  AV+ +N+ P T++    P+E W  S  T
Subjt:  LNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKWVSSCGT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-0432.35Show/hide
Query:  WIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLK---EGVTELTTKAAFLHGY-LDETIYMVQ
        W +AM EE+++L  N TW L+P P     +  KW++K K   +G  +         G+  +E IY V+
Subjt:  WIEAMNEEINSLKVNDTWTLIPLPKECKPIASKWIYKLK---EGVTELTTKAAFLHGY-LDETIYMVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTGGCAAGAGTGGAAATCGAGAAGTTCGATGGAAAGAGAGACTTTGCCTTATGGAAGGCAAAGATTAAGGCCTTGTTAGGACAACAAAAAGCTCACAAAGCCCT
TCTAGATTCTTCAGAACTTCCAACAACCCTCACAGCAGTACAAAATGAAGAAATGAAATTAAATGCCTATGGAACACTAATACTAAACCTTAGTGACAATGTTATAAGGC
AAGTACTAGAAGAAGAGACAACACACAAAATTTGGAAGAAATTAGAAAGTCTATATGCCACTAAAGATCTCTCAAACAAGATGTATCTAAGGGAAAAATTCTTTACATAT
AAAATGGATCCTTCCAAAAGTTTAACAGACAACTTAGATAAGTTCAAGAAAATAGTTTCAGACTTTAAAAGTCTTGAAGACAAACTCAGTGATGATAATGAGGCATTCGT
TCTCCAAAATTCTCTACCCGAGGCATACAAAGAAGTGAAAAATTCACTAAAGTATGGTAAAGACTCAGCCAAAACAGATGTCATAATATCAACCCTAAGAACCAGAGAAT
TGAAAATACAGTTATCTCATAAGGAACATCAAAGTGGAAATGGTTTGTTTGTCAAAGCCAATTATGTTAACCCCTTCCGAAAACGTGATTGGGTCCTAGACTCAGGATGC
ACCTACCATATGACACTTTTTAGAGCATGGTTCAATACCTATAAAGAAATCAGTAGAGAGTCTGTGTTTATGGGGAATAATAATGCTTGTAACATTGCTGGAATTGGATC
GGTCACCATGAAACTAAAGGATGAGACTGTAAATCTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAACTTAATCTCCTTGGGAATGCTAGACTCTCTAGGAT
GCGAATACAAAGGAAAATGTGGAGTCTTCCAAGTTTTCATGGACTCTAAAGAAGTATTGGTTGGGGGAAAAGAAAAACAAACTGATAAAGAGATTAAATACCTTAGAGCT
GACAATGGTTCGGAGTTTTGTGGAGAGGTGTTCAATCATTTTTGCAAGGAAAGTGGAATCACAAGACACAAAAATGTGAGATACACACCTCAACAAAATATGGTGGCAGA
AAGACTTAACAGAACTATAATGGAAAAGGTAAGATGCCAACTATCAGATGTCATTCTGGAAGAAAAGTTTTGGGTTGAAGTTGCTGCCTATGCGGTGTACACATTGAATA
GAAGCCCCCATACCTCCTTAGGACTCTTAACACCTGAGGAGAAATGGGTTTCAAGTTGTGGCACCACTAACAAGAAGTTCATAATTAGTAGGGATGTTCATTTCAGAGAA
ACTGAGATGTTTATGCAAGGGAAAGGTAATACTAAAAGGAGCACTGAAGCCACAAAAACCTATACTCAGATTGAACTGGAGAATGCTAGAAACGGTGCTCAATTTACTAA
GAAAACTAAAGTTATTGATCAAGAAATCGATGGAGAACAAACTAAAATAGTTAAAGAACAATCTAACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAACGCTCC
CAGTGATAATGAACCTAGTTCTTTTGAGGAAGTTAAGCTGTAATGATGCTAGACGGTGGATTGAAGCCATGAATGAAGAAATAAATTCTCTAAAGGTAAATGACACATGG
ACCCTTATCCCTTTACCTAAGGAATGCAAACCAATAGCATCTAAGTGGATCTATAAACTCAAGGAAGGAGTCACTGAACTCACTACCAAGGCAGCCTTCCTTCATGGCTA
TCTAGATGAAACAATTTACATGGTTCAACCTAAAGGTTTTGAAGTTCAAGGTAAGGAAGACCTCTACTGCTTACTAAAGAAGTCAATATATGGACTGGAGCAATCACCTA
GAAGCTCTAAGGAAGATTTAACTCATGTCAAAACTCTTTTAAGTAAAGAATTTTACATGAAGGATTTAGGTGAATCAAGAAAGATCCTAGGAATTGACATCACAAGAGAC
AAAAACCAATCCATACGAAGCATAAGTCAATCAACCTACTATGAGAAGCCTCAACTTCCTCGTTTTGAGGGAAAAACTTATAGGCGGTGGAGCCAGCAAATGAAGGTTCT
TTATGGATCTCAAGATCTTTGGGATATTGTTGACATCGGATATTCAGAGCCAGAAAGTGAGAATGGTCTTTCAGCACAACAACTCAATGAGTTGAAAGATGCTAGAAAAA
AAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTGGCAAGAGTGGAAATCGAGAAGTTCGATGGAAAGAGAGACTTTGCCTTATGGAAGGCAAAGATTAAGGCCTTGTTAGGACAACAAAAAGCTCACAAAGCCCT
TCTAGATTCTTCAGAACTTCCAACAACCCTCACAGCAGTACAAAATGAAGAAATGAAATTAAATGCCTATGGAACACTAATACTAAACCTTAGTGACAATGTTATAAGGC
AAGTACTAGAAGAAGAGACAACACACAAAATTTGGAAGAAATTAGAAAGTCTATATGCCACTAAAGATCTCTCAAACAAGATGTATCTAAGGGAAAAATTCTTTACATAT
AAAATGGATCCTTCCAAAAGTTTAACAGACAACTTAGATAAGTTCAAGAAAATAGTTTCAGACTTTAAAAGTCTTGAAGACAAACTCAGTGATGATAATGAGGCATTCGT
TCTCCAAAATTCTCTACCCGAGGCATACAAAGAAGTGAAAAATTCACTAAAGTATGGTAAAGACTCAGCCAAAACAGATGTCATAATATCAACCCTAAGAACCAGAGAAT
TGAAAATACAGTTATCTCATAAGGAACATCAAAGTGGAAATGGTTTGTTTGTCAAAGCCAATTATGTTAACCCCTTCCGAAAACGTGATTGGGTCCTAGACTCAGGATGC
ACCTACCATATGACACTTTTTAGAGCATGGTTCAATACCTATAAAGAAATCAGTAGAGAGTCTGTGTTTATGGGGAATAATAATGCTTGTAACATTGCTGGAATTGGATC
GGTCACCATGAAACTAAAGGATGAGACTGTAAATCTCCTTAGAAATGTAAGACATGTTCCTCACCTTAAAAGAAACTTAATCTCCTTGGGAATGCTAGACTCTCTAGGAT
GCGAATACAAAGGAAAATGTGGAGTCTTCCAAGTTTTCATGGACTCTAAAGAAGTATTGGTTGGGGGAAAAGAAAAACAAACTGATAAAGAGATTAAATACCTTAGAGCT
GACAATGGTTCGGAGTTTTGTGGAGAGGTGTTCAATCATTTTTGCAAGGAAAGTGGAATCACAAGACACAAAAATGTGAGATACACACCTCAACAAAATATGGTGGCAGA
AAGACTTAACAGAACTATAATGGAAAAGGTAAGATGCCAACTATCAGATGTCATTCTGGAAGAAAAGTTTTGGGTTGAAGTTGCTGCCTATGCGGTGTACACATTGAATA
GAAGCCCCCATACCTCCTTAGGACTCTTAACACCTGAGGAGAAATGGGTTTCAAGTTGTGGCACCACTAACAAGAAGTTCATAATTAGTAGGGATGTTCATTTCAGAGAA
ACTGAGATGTTTATGCAAGGGAAAGGTAATACTAAAAGGAGCACTGAAGCCACAAAAACCTATACTCAGATTGAACTGGAGAATGCTAGAAACGGTGCTCAATTTACTAA
GAAAACTAAAGTTATTGATCAAGAAATCGATGGAGAACAAACTAAAATAGTTAAAGAACAATCTAACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAAGAACGCTCC
CAGTGATAATGAACCTAGTTCTTTTGAGGAAGTTAAGCTGTAATGATGCTAGACGGTGGATTGAAGCCATGAATGAAGAAATAAATTCTCTAAAGGTAAATGACACATGG
ACCCTTATCCCTTTACCTAAGGAATGCAAACCAATAGCATCTAAGTGGATCTATAAACTCAAGGAAGGAGTCACTGAACTCACTACCAAGGCAGCCTTCCTTCATGGCTA
TCTAGATGAAACAATTTACATGGTTCAACCTAAAGGTTTTGAAGTTCAAGGTAAGGAAGACCTCTACTGCTTACTAAAGAAGTCAATATATGGACTGGAGCAATCACCTA
GAAGCTCTAAGGAAGATTTAACTCATGTCAAAACTCTTTTAAGTAAAGAATTTTACATGAAGGATTTAGGTGAATCAAGAAAGATCCTAGGAATTGACATCACAAGAGAC
AAAAACCAATCCATACGAAGCATAAGTCAATCAACCTACTATGAGAAGCCTCAACTTCCTCGTTTTGAGGGAAAAACTTATAGGCGGTGGAGCCAGCAAATGAAGGTTCT
TTATGGATCTCAAGATCTTTGGGATATTGTTGACATCGGATATTCAGAGCCAGAAAGTGAGAATGGTCTTTCAGCACAACAACTCAATGAGTTGAAAGATGCTAGAAAAA
AAGGATAA
Protein sequenceShow/hide protein sequence
MVVARVEIEKFDGKRDFALWKAKIKALLGQQKAHKALLDSSELPTTLTAVQNEEMKLNAYGTLILNLSDNVIRQVLEEETTHKIWKKLESLYATKDLSNKMYLREKFFTY
KMDPSKSLTDNLDKFKKIVSDFKSLEDKLSDDNEAFVLQNSLPEAYKEVKNSLKYGKDSAKTDVIISTLRTRELKIQLSHKEHQSGNGLFVKANYVNPFRKRDWVLDSGC
TYHMTLFRAWFNTYKEISRESVFMGNNNACNIAGIGSVTMKLKDETVNLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKCGVFQVFMDSKEVLVGGKEKQTDKEIKYLRA
DNGSEFCGEVFNHFCKESGITRHKNVRYTPQQNMVAERLNRTIMEKVRCQLSDVILEEKFWVEVAAYAVYTLNRSPHTSLGLLTPEEKWVSSCGTTNKKFIISRDVHFRE
TEMFMQGKGNTKRSTEATKTYTQIELENARNGAQFTKKTKVIDQEIDGEQTKIVKEQSNLSQYSLARDRQRTLPVIMNLVLLRKLSCNDARRWIEAMNEEINSLKVNDTW
TLIPLPKECKPIASKWIYKLKEGVTELTTKAAFLHGYLDETIYMVQPKGFEVQGKEDLYCLLKKSIYGLEQSPRSSKEDLTHVKTLLSKEFYMKDLGESRKILGIDITRD
KNQSIRSISQSTYYEKPQLPRFEGKTYRRWSQQMKVLYGSQDLWDIVDIGYSEPESENGLSAQQLNELKDARKKG