; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21010 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21010
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:17104476..17107556
RNA-Seq ExpressionCSPI03G21010
SyntenyCSPI03G21010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032332.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-20844.85Show/hide
Query:  RSL-VKSQGAEESKNNLADCLSNL----KLEVQRILLSFGSVFESRNQLTPPRNHD------PGARAVNVRPY------------PLNLGTIPDKYPIPV
        RSL V++ G EE       CL N     K  +  ++  +  VF+   +L P R  +       G   +NVRPY             +N  TIPDK+PIPV
Subjt:  RSL-VKSQGAEESKNNLADCLSNL----KLEVQRILLSFGSVFESRNQLTPPRNHD------PGARAVNVRPY------------PLNLGTIPDKYPIPV

Query:  VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVV
        V+EL DEL GA +FSKIDLKSGYHQIR+   D+ KTA RTHEGHYEFLVMPFGL NAPATFQ+ MN + +P+LR+FVLVFFDDIL+Y+++ +EH   L +
Subjt:  VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVV

Query:  VLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNA
        VL+ L  H+L AN KKCQFA ++IEYLGH+IS +GVA DP KI A+          E +GFLGLTGYYR+FV NYG+IA PLTQLLKKG F W E A  A
Subjt:  VLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNA

Query:  FQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ
        F +LK AM+S+PVL +PDF K    +       +G   + D               R + VYERELMAIV AVQ+W PYLL   F V+TD+K+LKFLL+Q
Subjt:  FQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ

Query:  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTP-EGYSLHREVLCYHGRLV
        R I  +YQ+WI KLLGY F + YK G+EN+AA+ALSR P   +L  +S+   ++  +   +V+ +     I   L +G+    +GYSL + +L Y  RLV
Subjt:  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTP-EGYSLHREVLCYHGRLV

Query:  LSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVI
        + + S   P++L  FHNS +GGH G L TY+R+A E+YW+GMKA I+ +  EC  CQ+ K ++L+PAGLL  L IP  +W +ISMDF+EGLPKS  Y+VI
Subjt:  LSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVI

Query:  LVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNTP
        LVVVDRLSKY HF+PLKHPF AKVVA +F++EI+ LHG P SIVSDRD                              GQTEVVNRGVETYLRCF    P
Subjt:  LVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNTP

Query:  KQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA--------------------
        K+W KWL W E  YNTT   +I MTPF+  YGR PP+I+      S  + V+ +++ERD +L  L+ +L  AQ++M  YA                    
Subjt:  KQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA--------------------

Query:  ------------------------------------KLTDG---RPLF----------------PIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLI
                                            +L D     P+F                P  + +  + + +  P EV+  RKT    ++ EVL+
Subjt:  ------------------------------------KLTDG---RPLF----------------PIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLI

Query:  FWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNK
         W       A+WE    ++E+F   HLEDKV L G  NDRP I +VYSRR K
Subjt:  FWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNK

KFK28310.1 hypothetical protein AALP_AA8G499800 [Arabis alpina]4.9e-21247.89Show/hide
Query:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI
        LN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYHQI V+  +V KTA RTH+GHYEFLVMPFGL NAP TFQ+ MN++ R +LRKFVLVFFDDIL+
Subjt:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI

Query:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL
        Y+ SL+EH + L VVL+ L   +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ AM           L+GFLGLTGYYR+FV  YGSIA PLT LL
Subjt:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL

Query:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV
        KK KFQW+E A  AF++LK AM +VPVL + DF ++   +     + LG              QAL D  + K+VYERELMAIV A+QKW  YLLG+ F+
Subjt:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV

Query:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS
        VRTD+KSLKFLLEQR +  EYQ+W+TK+LG++F I YK G+ENKAA+ALSR+  + +L  +SV + +      ++V  N     I+  ++    T  GYS
Subjt:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS

Query:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF
        + +  L Y+G+LVL ++S  I +LL EFHNS +GGH G L T + L    YW+GM A I+T+VAEC VCQ+ KY +L P+GLLQ LPIP +VWEDIS+DF
Subjt:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF

Query:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG
        +EGLPKSE +D ILVVVDRL+KYAHFI L+HPFGAK +AAVF++EI+ LHG P ++VSDRD                              GQTEV NRG
Subjt:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG

Query:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRM-----------
        +ET LRCF  + PK+WA +L WAE  YN++ H+AI MTPF+A YGR PPS+L     S+    ++  +KERD ++ ILK N+LKAQQ M           
Subjt:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRM-----------

Query:  --------------------------------------------------------------VKYAKLTDGRPLFP--IPKDLAADLSMQLSPVEVLGVR
                                                                      V   KL  G    P  +P  L A+  ++  P   +GVR
Subjt:  --------------------------------------------------------------VKYAKLTDGRPLFP--IPKDLAADLSMQLSPVEVLGVR

Query:  KTREKGDKLEVLIFWSDGTPE-SATWEQATLIQEQFLEFHLEDKVALWGAG-----NDRPHIVKVYSRRNKSGKK
             G + EVLI W  G PE  +TWE   +IQEQF EF LEDK     AG     +++  +V  Y RR K GK+
Subjt:  KTREKGDKLEVLIFWSDGTPE-SATWEQATLIQEQFLEFHLEDKVALWGAG-----NDRPHIVKVYSRRNKSGKK

PKU67907.1 putative mitochondrial protein [Dendrobium catenatum]3.0e-21749.65Show/hide
Query:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI
        LN  T+PDK+PIPV+DEL+DEL GA +F+KIDLKSGYHQIR+R  DV KTA RTHEGHYEFLVMPFGL NAPATFQ+ MN I +P+LR+FVLVFFDDILI
Subjt:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI

Query:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL
        Y+++ EEHL+ L VVL  L  H+L ANFKKC FA  ++EYLGH+IS +GVAAD +KI AM           L+GFLGLTGYYR+FV  Y +IA PLT+LL
Subjt:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL

Query:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV
        KK  F W E A  AF++LK AM +VPVL +PDF +V   +       LG              Q L    R K+VYERELMAIV A+QKW PYLLG+ F+
Subjt:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV

Query:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS
        VRTD++SLK+LLEQR +  E+QRW++KLLGYDF I Y+ G+ENKAA+ALSR    F+   +SV   ++     ++   NE L  I+  +++ + +  GYS
Subjt:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS

Query:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF
        +  E L Y GR V+   S  IP LL EFH S IGGH G   TY R+A E+YW+GM+  I   VA+C  CQ+ K++++ P GLLQ L +P++VWE+I+MDF
Subjt:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF

Query:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG
        I+GLPKSE Y VI VVVDRLSKYAHFIPL+HP+ A  VA  F+REI+ LHG P SIVSDRD                              GQTEVVNR 
Subjt:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG

Query:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVK---------
        +ETYLRCF   TPKQWAKWLAWAE  YNT+ H+A  MTPF+  YGR PP ++     S+PV+ VD  +KERD +L  LK +LL+AQQ M K         
Subjt:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVK---------

Query:  --------YAKLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLGVR
                Y KL   R                                          P+F                 +P  L  D+ + L P+ V GVR
Subjt:  --------YAKLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLGVR

Query:  KTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRP
           EKG+K EV I WS      ATWE    I  QF  FHLEDKV LW  GN  P
Subjt:  KTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRP

TYK10423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-20845.53Show/hide
Query:  KLEVQRI---LLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTAL
        K E++R+   +L+ G +  S +  + P    +  D G R  V+ R   LN  T+PDK+PIP++DELLDEL GA+IFSKIDLKSGYHQIRVR  D+ KTA 
Subjt:  KLEVQRI---LLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTAL

Query:  RTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAA
        RTHEGHYEFLVMPFGL NAPATFQ+ MN + RPYLRKF+LVFFDDIL+Y++ +E HL+ L +V + L  H L AN +KC FA DRIEYLGH +S+ GV A
Subjt:  RTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAA

Query:  DPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL--
        D  KI AM          EL+GFLGLTGYYR+FVANYG+IA PLT+L KK  F+W+E A  AF+QLK AM+++PVL +PDF   L +++      +GL  
Subjt:  DPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL--

Query:  -------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALS
                     Q L +T R K+VYERELMAIV AV+KW  YLLG  FVV TD+K+L+ +LEQR I    Q+W+ KL+G+DF I Y+ G ENKAA+ALS
Subjt:  -------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALS

Query:  RLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREV
        R+P   EL  I+V S L+ ++   +V  +E L +I   ++        Y++ +  L Y GRLV+S  S  IP +L  FH+S +GGH G L TY+R+A E+
Subjt:  RLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREV

Query:  YWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLH
        +W GMK  I+ YV  C VCQQ K  +L+PAGLLQ LPIP+R+WEDISMDF+EGLP+S+ +D ILVVVDRLSKYAHFI L HPF AKVVA VF++E++ LH
Subjt:  YWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLH

Query:  GCPRSIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPS
        G PRSIVSDRD                              GQTEVVN+ +E YLRC      K W+  +AWAE  YNT   ++I  TP+   YG+PPP 
Subjt:  GCPRSIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPS

Query:  ILPCVKE-SSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYAK-------------------------------------------------------
        I+   +  ++P + V+  ++ RD +L++LK +L  AQ+RM K+A                                                        
Subjt:  ILPCVKE-SSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYAK-------------------------------------------------------

Query:  --------------------LTDGRPLFPIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGA
                            + D  P++P    +   + + L P  VL  R+  E     E L+ W D     ATWE   +++ QF   HLEDKVAL   
Subjt:  --------------------LTDGRPLFPIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGA

Query:  GNDRPHIVKVYSRRNKSG
        G  RP I +VY RR K G
Subjt:  GNDRPHIVKVYSRRNKSG

XP_010541181.1 PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana]1.9e-20843.47Show/hide
Query:  ILQGDRS-LVKSQGAEESKNNLADCLSNLKLEVQRILLSFGSVFESRNQLTP------PRNHDPGARAVNVRPY--------------------------
        +  GD+S LVK +  EE         SNL  ++Q +L  FG VFE   +L P      P N   G   V+VRPY                          
Subjt:  ILQGDRS-LVKSQGAEESKNNLADCLSNLKLEVQRILLSFGSVFESRNQLTP------PRNHDPGARAVNVRPY--------------------------

Query:  -------------------------PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPAT
                                  LN  T+ DK+PIP++D+LLDEL GA +FSK+DL+SGYHQIR++  D+ KTA RTH+GHYEFLVMPFGL NAPAT
Subjt:  -------------------------PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPAT

Query:  FQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQG
        FQ+ MN+I RPYLRKFVLVFFDDIL+Y+ SL++H   L  VL  L  HKL AN KKC+F   +I+YLGHIIS +GV+ DP K  AM          EL+G
Subjt:  FQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQG

Query:  FLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPK---------------VLCWKLMFQVLILGLQALPDTHRF
        FLGLTGYYR+FV NYG+IA PLT LLKK  F W+E A +AF++LK AM S PVLG+PDF +               VL  K   + +    QAL +  R 
Subjt:  FLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPK---------------VLCWKLMFQVLILGLQALPDTHRF

Query:  KAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLP--PVFE---LGLISVVSGL
        K VYERELMA+V ++Q+W  YLLG+ F+V TD+K+LKFLLEQR +  EYQRW+TKLLGYDF I Y+ G+ENKAA+ LSR+P   + E   +GL   +   
Subjt:  KAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLP--PVFE---LGLISVVSGL

Query:  NPSIFIDQVAGNEA-LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAEC
           + +++  G ++ L  I   L  G+     Y L + +L Y  RLV+S+ S  IP +LAEFH+S +GGH G L T +R+    +W GMKA I+ YVAEC
Subjt:  NPSIFIDQVAGNEA-LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAEC

Query:  SVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----
        +VCQ  KY +L PAGLLQ LPIP+ +WEDISMDFIEGLP+S  Y+V+LVVVDRLSKYAHFI LKHPF A VVA VF++E++ LHG P+SIVSDRD     
Subjt:  SVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----

Query:  ------------------------HGQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDH
                                 GQTEV+NR +ETYLRC+A + P++W ++L+WAE  YNT+ HTA+  TPF+  YGR PP++L   + S+   E++ 
Subjt:  ------------------------HGQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDH

Query:  LMKERDGILRILKANLLKAQQRM-----------------------------------------------------------VKYAKLTDGRPLFPI---
         ++ERD ++  +K  L  AQQRM                                                           +K  K  +  P+F I   
Subjt:  LMKERDGILRILKANLLKAQQRM-----------------------------------------------------------VKYAKLTDGRPLFPI---

Query:  -------------PKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNKS
                     P+ L  DL +Q+ P ++   R T+E   +LEVL+ W D     +TWE A    +QF  F LEDK+   G   D+    +VY R  K 
Subjt:  -------------PKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNKS

Query:  GKK
        GK+
Subjt:  GKK

TrEMBL top hitse value%identityAlignment
A0A087GEK8 Uncharacterized protein2.4e-21247.89Show/hide
Query:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI
        LN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYHQI V+  +V KTA RTH+GHYEFLVMPFGL NAP TFQ+ MN++ R +LRKFVLVFFDDIL+
Subjt:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI

Query:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL
        Y+ SL+EH + L VVL+ L   +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ AM           L+GFLGLTGYYR+FV  YGSIA PLT LL
Subjt:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL

Query:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV
        KK KFQW+E A  AF++LK AM +VPVL + DF ++   +     + LG              QAL D  + K+VYERELMAIV A+QKW  YLLG+ F+
Subjt:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV

Query:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS
        VRTD+KSLKFLLEQR +  EYQ+W+TK+LG++F I YK G+ENKAA+ALSR+  + +L  +SV + +      ++V  N     I+  ++    T  GYS
Subjt:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS

Query:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF
        + +  L Y+G+LVL ++S  I +LL EFHNS +GGH G L T + L    YW+GM A I+T+VAEC VCQ+ KY +L P+GLLQ LPIP +VWEDIS+DF
Subjt:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF

Query:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG
        +EGLPKSE +D ILVVVDRL+KYAHFI L+HPFGAK +AAVF++EI+ LHG P ++VSDRD                              GQTEV NRG
Subjt:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG

Query:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRM-----------
        +ET LRCF  + PK+WA +L WAE  YN++ H+AI MTPF+A YGR PPS+L     S+    ++  +KERD ++ ILK N+LKAQQ M           
Subjt:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRM-----------

Query:  --------------------------------------------------------------VKYAKLTDGRPLFP--IPKDLAADLSMQLSPVEVLGVR
                                                                      V   KL  G    P  +P  L A+  ++  P   +GVR
Subjt:  --------------------------------------------------------------VKYAKLTDGRPLFP--IPKDLAADLSMQLSPVEVLGVR

Query:  KTREKGDKLEVLIFWSDGTPE-SATWEQATLIQEQFLEFHLEDKVALWGAG-----NDRPHIVKVYSRRNKSGKK
             G + EVLI W  G PE  +TWE   +IQEQF EF LEDK     AG     +++  +V  Y RR K GK+
Subjt:  KTREKGDKLEVLIFWSDGTPE-SATWEQATLIQEQFLEFHLEDKVALWGAG-----NDRPHIVKVYSRRNKSGKK

A0A2I0VWY1 Putative mitochondrial protein1.4e-21749.65Show/hide
Query:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI
        LN  T+PDK+PIPV+DEL+DEL GA +F+KIDLKSGYHQIR+R  DV KTA RTHEGHYEFLVMPFGL NAPATFQ+ MN I +P+LR+FVLVFFDDILI
Subjt:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI

Query:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL
        Y+++ EEHL+ L VVL  L  H+L ANFKKC FA  ++EYLGH+IS +GVAAD +KI AM           L+GFLGLTGYYR+FV  Y +IA PLT+LL
Subjt:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL

Query:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV
        KK  F W E A  AF++LK AM +VPVL +PDF +V   +       LG              Q L    R K+VYERELMAIV A+QKW PYLLG+ F+
Subjt:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL-------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV

Query:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS
        VRTD++SLK+LLEQR +  E+QRW++KLLGYDF I Y+ G+ENKAA+ALSR    F+   +SV   ++     ++   NE L  I+  +++ + +  GYS
Subjt:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYS

Query:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF
        +  E L Y GR V+   S  IP LL EFH S IGGH G   TY R+A E+YW+GM+  I   VA+C  CQ+ K++++ P GLLQ L +P++VWE+I+MDF
Subjt:  LHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDF

Query:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG
        I+GLPKSE Y VI VVVDRLSKYAHFIPL+HP+ A  VA  F+REI+ LHG P SIVSDRD                              GQTEVVNR 
Subjt:  IEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRG

Query:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVK---------
        +ETYLRCF   TPKQWAKWLAWAE  YNT+ H+A  MTPF+  YGR PP ++     S+PV+ VD  +KERD +L  LK +LL+AQQ M K         
Subjt:  VETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVK---------

Query:  --------YAKLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLGVR
                Y KL   R                                          P+F                 +P  L  D+ + L P+ V GVR
Subjt:  --------YAKLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLGVR

Query:  KTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRP
           EKG+K EV I WS      ATWE    I  QF  FHLEDKV LW  GN  P
Subjt:  KTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRP

A0A5A7SMR6 Ty3/gypsy retrotransposon protein1.2e-20844.85Show/hide
Query:  RSL-VKSQGAEESKNNLADCLSNL----KLEVQRILLSFGSVFESRNQLTPPRNHD------PGARAVNVRPY------------PLNLGTIPDKYPIPV
        RSL V++ G EE       CL N     K  +  ++  +  VF+   +L P R  +       G   +NVRPY             +N  TIPDK+PIPV
Subjt:  RSL-VKSQGAEESKNNLADCLSNL----KLEVQRILLSFGSVFESRNQLTPPRNHD------PGARAVNVRPY------------PLNLGTIPDKYPIPV

Query:  VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVV
        V+EL DEL GA +FSKIDLKSGYHQIR+   D+ KTA RTHEGHYEFLVMPFGL NAPATFQ+ MN + +P+LR+FVLVFFDDIL+Y+++ +EH   L +
Subjt:  VDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVV

Query:  VLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNA
        VL+ L  H+L AN KKCQFA ++IEYLGH+IS +GVA DP KI A+          E +GFLGLTGYYR+FV NYG+IA PLTQLLKKG F W E A  A
Subjt:  VLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNA

Query:  FQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ
        F +LK AM+S+PVL +PDF K    +       +G   + D               R + VYERELMAIV AVQ+W PYLL   F V+TD+K+LKFLL+Q
Subjt:  FQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGLQALPDTH-------------RFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQ

Query:  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTP-EGYSLHREVLCYHGRLV
        R I  +YQ+WI KLLGY F + YK G+EN+AA+ALSR P   +L  +S+   ++  +   +V+ +     I   L +G+    +GYSL + +L Y  RLV
Subjt:  RAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTP-EGYSLHREVLCYHGRLV

Query:  LSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVI
        + + S   P++L  FHNS +GGH G L TY+R+A E+YW+GMKA I+ +  EC  CQ+ K ++L+PAGLL  L IP  +W +ISMDF+EGLPKS  Y+VI
Subjt:  LSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVI

Query:  LVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNTP
        LVVVDRLSKY HF+PLKHPF AKVVA +F++EI+ LHG P SIVSDRD                              GQTEVVNRGVETYLRCF    P
Subjt:  LVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRD-----------------------------HGQTEVVNRGVETYLRCFAMNTP

Query:  KQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA--------------------
        K+W KWL W E  YNTT   +I MTPF+  YGR PP+I+      S  + V+ +++ERD +L  L+ +L  AQ++M  YA                    
Subjt:  KQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA--------------------

Query:  ------------------------------------KLTDG---RPLF----------------PIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLI
                                            +L D     P+F                P  + +  + + +  P EV+  RKT    ++ EVL+
Subjt:  ------------------------------------KLTDG---RPLF----------------PIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLI

Query:  FWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNK
         W       A+WE    ++E+F   HLEDKV L G  NDRP I +VYSRR K
Subjt:  FWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNK

A0A5D3BEL2 Ty3/gypsy retrotransposon protein3.6e-20846.67Show/hide
Query:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI
        LN  T+PDK+PIPVV+EL DEL GA++F+KIDLK+GYHQIR+   D+ KTA RTHEGHYEFLVMPFGL NAPATFQS MN I RPYLR+FVLVFFDDILI
Subjt:  LNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILI

Query:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL
        Y+++LE+HL+ +  V   L  H+L AN KKC F + ++EYLGH+IS+ GV  DP KI A+          E +GFLGLTGYYRKFV +YG++A PLTQLL
Subjt:  YNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLL

Query:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKL------MFQVLILGLQ-------ALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV
        KKG F WN  A+ AF++LK AM+++PVL +P F K    +       +  VLI   +        L    R + VYERELMA+V AVQ+W PYLLG  F+
Subjt:  KKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKL------MFQVLILGLQ-------ALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFV

Query:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEG--
        VRTD+KSLKFLLEQR +  +YQRW+ KLLGY F +EYK G+ENKAA+ALSR+ P  ++  I+V   L+  I  ++V  +  L  I ++ +NG  T +   
Subjt:  VRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEG--

Query:  YSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISM
        + +H  +L Y  RLV+S+ S  IP +L  +H+S +GGH G L TY+R++ E+YW+GMKA ++ Y AEC +CQQ K L L+PAGLL  L IP  +W DISM
Subjt:  YSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISM

Query:  DFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH-----------------------------GQTEVVN
        DF+EGLPK+  ++VI VVVDRLSKY HFIPLKHP+ AK VA +F++E++ LHG P SIVSDRD                              GQTEVVN
Subjt:  DFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH-----------------------------GQTEVVN

Query:  RGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA-----
        RGVE YLRCF  + PK+W KW+ WAE  YNTT   A+ MTPF+  YGR PP +L    + +P   +D  +KERD ++  L+ NL  AQ++M KYA     
Subjt:  RGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYA-----

Query:  ------------KLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLG
                    K+   R                                          P+F                P  + L  +   +  PVE L 
Subjt:  ------------KLTDGR------------------------------------------PLF----------------PIPKDLAADLSMQLSPVEVLG

Query:  VRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNKSGKKK
         R  R K  + EV+I W   +    TWE    + +++ +FHLEDKV+L G  N RP I+  YSR+NK   +K
Subjt:  VRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNKSGKKK

A0A5D3CEX8 Ty3/gypsy retrotransposon protein2.7e-20845.53Show/hide
Query:  KLEVQRI---LLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTAL
        K E++R+   +L+ G +  S +  + P    +  D G R  V+ R   LN  T+PDK+PIP++DELLDEL GA+IFSKIDLKSGYHQIRVR  D+ KTA 
Subjt:  KLEVQRI---LLSFGSVFESRNQLTPP----RNHDPGAR-AVNVRPYPLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTAL

Query:  RTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAA
        RTHEGHYEFLVMPFGL NAPATFQ+ MN + RPYLRKF+LVFFDDIL+Y++ +E HL+ L +V + L  H L AN +KC FA DRIEYLGH +S+ GV A
Subjt:  RTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAA

Query:  DPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL--
        D  KI AM          EL+GFLGLTGYYR+FVANYG+IA PLT+L KK  F+W+E A  AF+QLK AM+++PVL +PDF   L +++      +GL  
Subjt:  DPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGL--

Query:  -------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALS
                     Q L +T R K+VYERELMAIV AV+KW  YLLG  FVV TD+K+L+ +LEQR I    Q+W+ KL+G+DF I Y+ G ENKAA+ALS
Subjt:  -------------QALPDTHRFKAVYERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALS

Query:  RLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREV
        R+P   EL  I+V S L+ ++   +V  +E L +I   ++        Y++ +  L Y GRLV+S  S  IP +L  FH+S +GGH G L TY+R+A E+
Subjt:  RLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREV

Query:  YWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLH
        +W GMK  I+ YV  C VCQQ K  +L+PAGLLQ LPIP+R+WEDISMDF+EGLP+S+ +D ILVVVDRLSKYAHFI L HPF AKVVA VF++E++ LH
Subjt:  YWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLH

Query:  GCPRSIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPS
        G PRSIVSDRD                              GQTEVVN+ +E YLRC      K W+  +AWAE  YNT   ++I  TP+   YG+PPP 
Subjt:  GCPRSIVSDRDH-----------------------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPS

Query:  ILPCVKE-SSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYAK-------------------------------------------------------
        I+   +  ++P + V+  ++ RD +L++LK +L  AQ+RM K+A                                                        
Subjt:  ILPCVKE-SSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYAK-------------------------------------------------------

Query:  --------------------LTDGRPLFPIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGA
                            + D  P++P    +   + + L P  VL  R+  E     E L+ W D     ATWE   +++ QF   HLEDKVAL   
Subjt:  --------------------LTDGRPLFPIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGTPESATWEQATLIQEQFLEFHLEDKVALWGA

Query:  GNDRPHIVKVYSRRNKSG
        G  RP I +VY RR K G
Subjt:  GNDRPHIVKVYSRRNKSG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.5e-8630.64Show/hide
Query:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL
        PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +E+LVMP+G+  APA FQ  +N IL       V+ + DDIL
Subjt:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL

Query:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL
        I++KS  EH++ +  VL+ L    L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG   Y RKF+     +  PL  L
Subjt:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL

Query:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY
        LKK  +++W  T   A + +K  ++S PVL   DF K +  +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Subjt:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY

Query:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA
        L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+ALSR       +P   E   I+ V+ ++    F +QV     
Subjt:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA

Query:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY
         ++  L+L+N     E   +   +    G L+ S+D   +P        ++ ++H      H G       + R   W+G++  I+ YV  C  CQ  K 
Subjt:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY

Query:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------
         +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Subjt:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------

Query:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI
                         GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A  MTPFE  + R  P++ P ++  S  ++ D   +E   +
Subjt:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI

Query:  LRILKANLLKAQQRMVKY
         + +K +L     +M KY
Subjt:  LRILKANLLKAQQRMVKY

P0CT35 Transposon Tf2-2 polyprotein2.5e-8630.64Show/hide
Query:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL
        PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +E+LVMP+G+  APA FQ  +N IL       V+ + DDIL
Subjt:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL

Query:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL
        I++KS  EH++ +  VL+ L    L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG   Y RKF+     +  PL  L
Subjt:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL

Query:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY
        LKK  +++W  T   A + +K  ++S PVL   DF K +  +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Subjt:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY

Query:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA
        L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+ALSR       +P   E   I+ V+ ++    F +QV     
Subjt:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA

Query:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY
         ++  L+L+N     E   +   +    G L+ S+D   +P        ++ ++H      H G       + R   W+G++  I+ YV  C  CQ  K 
Subjt:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY

Query:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------
         +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Subjt:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------

Query:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI
                         GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A  MTPFE  + R  P++ P ++  S  ++ D   +E   +
Subjt:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI

Query:  LRILKANLLKAQQRMVKY
         + +K +L     +M KY
Subjt:  LRILKANLLKAQQRMVKY

P0CT36 Transposon Tf2-3 polyprotein2.5e-8630.64Show/hide
Query:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL
        PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +E+LVMP+G+  APA FQ  +N IL       V+ + DDIL
Subjt:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL

Query:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL
        I++KS  EH++ +  VL+ L    L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG   Y RKF+     +  PL  L
Subjt:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL

Query:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY
        LKK  +++W  T   A + +K  ++S PVL   DF K +  +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Subjt:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY

Query:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA
        L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+ALSR       +P   E   I+ V+ ++    F +QV     
Subjt:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA

Query:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY
         ++  L+L+N     E   +   +    G L+ S+D   +P        ++ ++H      H G       + R   W+G++  I+ YV  C  CQ  K 
Subjt:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY

Query:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------
         +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Subjt:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------

Query:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI
                         GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A  MTPFE  + R  P++ P ++  S  ++ D   +E   +
Subjt:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI

Query:  LRILKANLLKAQQRMVKY
         + +K +L     +M KY
Subjt:  LRILKANLLKAQQRMVKY

P0CT37 Transposon Tf2-4 polyprotein2.5e-8630.64Show/hide
Query:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL
        PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +E+LVMP+G+  APA FQ  +N IL       V+ + DDIL
Subjt:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL

Query:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL
        I++KS  EH++ +  VL+ L    L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG   Y RKF+     +  PL  L
Subjt:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL

Query:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY
        LKK  +++W  T   A + +K  ++S PVL   DF K +  +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Subjt:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY

Query:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA
        L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+ALSR       +P   E   I+ V+ ++    F +QV     
Subjt:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA

Query:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY
         ++  L+L+N     E   +   +    G L+ S+D   +P        ++ ++H      H G       + R   W+G++  I+ YV  C  CQ  K 
Subjt:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY

Query:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------
         +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Subjt:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------

Query:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI
                         GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A  MTPFE  + R  P++ P ++  S  ++ D   +E   +
Subjt:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI

Query:  LRILKANLLKAQQRMVKY
         + +K +L     +M KY
Subjt:  LRILKANLLKAQQRMVKY

P0CT41 Transposon Tf2-12 polyprotein2.5e-8630.64Show/hide
Query:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL
        PLN    P+ YP+P++++LL ++ G+TIF+K+DLKS YH IRVR  D HK A R   G +E+LVMP+G+  APA FQ  +N IL       V+ + DDIL
Subjt:  PLNLGTIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDIL

Query:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL
        I++KS  EH++ +  VL+ L    L+ N  KC+F   +++++G+ IS  G       I  +          EL+ FLG   Y RKF+     +  PL  L
Subjt:  IYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLGHIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQL

Query:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY
        LKK  +++W  T   A + +K  ++S PVL   DF K +  +     + +G  L    D  ++                 +V ++E++AI+++++ W  Y
Subjt:  LKKG-KFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILG--LQALPDTHRF----------------KAVYERELMAIVRAVQKWCPY

Query:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA
        L    +PF + TD ++L  +   E         RW   L  ++F I Y+ G  N  A+ALSR       +P   E   I+ V+ ++    F +QV     
Subjt:  LLG--KPFVVRTDKKSL--KFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSR-------LPPVFELGLISVVSGLN-PSIFIDQVAGNEA

Query:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY
         ++  L+L+N     E   +   +    G L+ S+D   +P        ++ ++H      H G       + R   W+G++  I+ YV  C  CQ  K 
Subjt:  LNSIRLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIP-------LLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKY

Query:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------
         +  P G LQ +P  +R WE +SMDFI  LP+S  Y+ + VVVDR SK A  +P      A+  A +F + +I   G P+ I++D DH            
Subjt:  LSLTPAGLLQALPIPDRVWEDISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDH------------

Query:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI
                         GQTE  N+ VE  LRC     P  W   ++  + SYN  +H+A  MTPFE  + R  P++ P ++  S  ++ D   +E   +
Subjt:  -----------------GQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTAILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGI

Query:  LRILKANLLKAQQRMVKY
         + +K +L     +M KY
Subjt:  LRILKANLLKAQQRMVKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-2650.79Show/hide
Query:  LQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLG--HIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQ
        +  L +VL+    H+  AN KKC F   +I YLG  HIIS +GV+ADP K+ AM          EL+GFLGLTGYYR+FV NYG I  PLT+LLKK   +
Subjt:  LQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLG--HIISSDGVAADPTKIMAM----------ELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQ

Query:  WNETAKNAFQQLKFAMMSVPVLGIPD
        W E A  AF+ LK A+ ++PVL +PD
Subjt:  WNETAKNAFQQLKFAMMSVPVLGIPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGGACAGAAGCCTCGTAAAATCACAGGGAGCGGAAGAGTCAAAGAATAATTTAGCGGACTGTCTGAGTAA
TCTAAAACTCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCAGAAATCAGTTGACACCACCTCGCAATCATGATCCAGGGGCCCGGGCGGTGAATG
TGCGGCCTTATCCACTGAATCTTGGCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACCATATTTTCAAAAATTGAT
CTAAAGTCAGGCTATCACCAGATTAGAGTGCGAGTCGCCGATGTTCACAAAACGGCGCTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAA
GAATGCTCCAGCCACTTTTCAATCAGGAATGAATGATATTCTCCGCCCATACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAACAAGTCACTAG
AAGAACATTTGCAGCAATTGGTCGTGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTTAAAAAATGTCAATTCGCAGTAGATCGAATTGAATATTTGGGT
CACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAATGGCTATGGAATTACAAGGTTTCTTGGGTCTTACAGGTTATTATCGGAAATTTGTGGCAAACTA
TGGTTCTATTGCACTACCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAAAAAATGCGTTCCAACAATTAAAATTTGCTATGATGTCGGTAC
CGGTGTTAGGTATTCCAGATTTTCCCAAGGTTTTGTGCTGGAAACTGATGTTTCAGGTGTTGATATTGGGGCTTCAAGCTTTGCCCGATACCCATAGGTTTAAAGCTGTG
TATGAACGGGAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGTGCCCATATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATAAAAAAAGTTTGAAGTTTCTTCT
TGAACAACGAGCCATTGGGGGAGAATATCAGAGATGGATCACTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAATAAAGCGGCTAATGCTT
TATCACGATTACCACCGGTATTTGAATTGGGCCTCATAAGTGTTGTGAGCGGGCTCAATCCTTCGATTTTCATTGATCAAGTGGCTGGGAATGAAGCTCTGAATAGTATT
CGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCACTACATAGAGAGGTTCTATGCTATCACGGCCGGTTGGTTCTTTCGGAGGATTCCCCTACTATCCC
CTTATTATTAGCGGAGTTTCATAACAGCCCGATAGGGGGACATCATGGCGCGTTGACAACATATCAAAGGCTAGCAAGGGAGGTATATTGGCGAGGTATGAAAGCACACA
TCCGCACATATGTTGCCGAATGTTCGGTTTGTCAACAAGCCAAATATTTGTCATTAACTCCAGCCGGTCTTCTTCAAGCATTACCCATTCCGGACCGTGTATGGGAGGAT
ATTAGTATGGATTTTATTGAGGGTCTTCCTAAATCTGAGTGCTATGATGTCATATTGGTGGTTGTGGATAGACTTTCCAAGTATGCACATTTCATTCCTTTGAAGCATCC
TTTTGGTGCAAAAGTTGTAGCTGCTGTTTTTATGCGGGAAATCATTCATTTGCATGGATGCCCCCGAAGTATAGTGTCGGATCGTGATCATGGTCAAACCGAAGTTGTCA
ATCGAGGGGTGGAGACGTATCTACGATGTTTTGCTATGAACACACCTAAGCAGTGGGCTAAATGGCTAGCTTGGGCTGAATTGAGCTATAATACAACTGTCCATACAGCC
ATATTGATGACCCCCTTCGAGGCTCACTATGGTCGACCACCTCCCTCTATATTACCATGCGTGAAGGAGTCGAGCCCGGTGAATGAAGTAGATCATTTGATGAAGGAAAG
AGATGGAATTCTGAGAATTTTGAAGGCCAATTTATTGAAGGCACAACAGCGAATGGTCAAATACGCCAAATTGACCGACGGGAGGCCTCTATTTCCAATTCCAAAAGATT
TGGCTGCTGATTTGTCGATGCAATTGTCTCCGGTAGAGGTGTTGGGAGTTCGCAAAACCAGAGAAAAGGGGGACAAGTTAGAAGTTTTGATTTTCTGGAGTGACGGTACA
CCTGAAAGTGCTACATGGGAACAAGCCACTCTTATTCAAGAACAATTTCTTGAATTCCACCTTGAGGACAAGGTGGCTCTTTGGGGGGCGGGCAATGATAGACCCCACAT
AGTAAAGGTGTATTCACGTAGGAATAAAAGTGGGAAGAAGAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGGACAGAAGCCTCGTAAAATCACAGGGAGCGGAAGAGTCAAAGAATAATTTAGCGGACTGTCTGAGTAA
TCTAAAACTCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCAGAAATCAGTTGACACCACCTCGCAATCATGATCCAGGGGCCCGGGCGGTGAATG
TGCGGCCTTATCCACTGAATCTTGGCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACCATATTTTCAAAAATTGAT
CTAAAGTCAGGCTATCACCAGATTAGAGTGCGAGTCGCCGATGTTCACAAAACGGCGCTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAA
GAATGCTCCAGCCACTTTTCAATCAGGAATGAATGATATTCTCCGCCCATACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAACAAGTCACTAG
AAGAACATTTGCAGCAATTGGTCGTGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTTAAAAAATGTCAATTCGCAGTAGATCGAATTGAATATTTGGGT
CACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAATGGCTATGGAATTACAAGGTTTCTTGGGTCTTACAGGTTATTATCGGAAATTTGTGGCAAACTA
TGGTTCTATTGCACTACCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAAAAAATGCGTTCCAACAATTAAAATTTGCTATGATGTCGGTAC
CGGTGTTAGGTATTCCAGATTTTCCCAAGGTTTTGTGCTGGAAACTGATGTTTCAGGTGTTGATATTGGGGCTTCAAGCTTTGCCCGATACCCATAGGTTTAAAGCTGTG
TATGAACGGGAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGTGCCCATATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATAAAAAAAGTTTGAAGTTTCTTCT
TGAACAACGAGCCATTGGGGGAGAATATCAGAGATGGATCACTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAATAAAGCGGCTAATGCTT
TATCACGATTACCACCGGTATTTGAATTGGGCCTCATAAGTGTTGTGAGCGGGCTCAATCCTTCGATTTTCATTGATCAAGTGGCTGGGAATGAAGCTCTGAATAGTATT
CGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCACTACATAGAGAGGTTCTATGCTATCACGGCCGGTTGGTTCTTTCGGAGGATTCCCCTACTATCCC
CTTATTATTAGCGGAGTTTCATAACAGCCCGATAGGGGGACATCATGGCGCGTTGACAACATATCAAAGGCTAGCAAGGGAGGTATATTGGCGAGGTATGAAAGCACACA
TCCGCACATATGTTGCCGAATGTTCGGTTTGTCAACAAGCCAAATATTTGTCATTAACTCCAGCCGGTCTTCTTCAAGCATTACCCATTCCGGACCGTGTATGGGAGGAT
ATTAGTATGGATTTTATTGAGGGTCTTCCTAAATCTGAGTGCTATGATGTCATATTGGTGGTTGTGGATAGACTTTCCAAGTATGCACATTTCATTCCTTTGAAGCATCC
TTTTGGTGCAAAAGTTGTAGCTGCTGTTTTTATGCGGGAAATCATTCATTTGCATGGATGCCCCCGAAGTATAGTGTCGGATCGTGATCATGGTCAAACCGAAGTTGTCA
ATCGAGGGGTGGAGACGTATCTACGATGTTTTGCTATGAACACACCTAAGCAGTGGGCTAAATGGCTAGCTTGGGCTGAATTGAGCTATAATACAACTGTCCATACAGCC
ATATTGATGACCCCCTTCGAGGCTCACTATGGTCGACCACCTCCCTCTATATTACCATGCGTGAAGGAGTCGAGCCCGGTGAATGAAGTAGATCATTTGATGAAGGAAAG
AGATGGAATTCTGAGAATTTTGAAGGCCAATTTATTGAAGGCACAACAGCGAATGGTCAAATACGCCAAATTGACCGACGGGAGGCCTCTATTTCCAATTCCAAAAGATT
TGGCTGCTGATTTGTCGATGCAATTGTCTCCGGTAGAGGTGTTGGGAGTTCGCAAAACCAGAGAAAAGGGGGACAAGTTAGAAGTTTTGATTTTCTGGAGTGACGGTACA
CCTGAAAGTGCTACATGGGAACAAGCCACTCTTATTCAAGAACAATTTCTTGAATTCCACCTTGAGGACAAGGTGGCTCTTTGGGGGGCGGGCAATGATAGACCCCACAT
AGTAAAGGTGTATTCACGTAGGAATAAAAGTGGGAAGAAGAAAGAATGA
Protein sequenceShow/hide protein sequence
MEFSLGEFLVILQGDRSLVKSQGAEESKNNLADCLSNLKLEVQRILLSFGSVFESRNQLTPPRNHDPGARAVNVRPYPLNLGTIPDKYPIPVVDELLDELFGATIFSKID
LKSGYHQIRVRVADVHKTALRTHEGHYEFLVMPFGLKNAPATFQSGMNDILRPYLRKFVLVFFDDILIYNKSLEEHLQQLVVVLETLVVHKLVANFKKCQFAVDRIEYLG
HIISSDGVAADPTKIMAMELQGFLGLTGYYRKFVANYGSIALPLTQLLKKGKFQWNETAKNAFQQLKFAMMSVPVLGIPDFPKVLCWKLMFQVLILGLQALPDTHRFKAV
YERELMAIVRAVQKWCPYLLGKPFVVRTDKKSLKFLLEQRAIGGEYQRWITKLLGYDFVIEYKKGMENKAANALSRLPPVFELGLISVVSGLNPSIFIDQVAGNEALNSI
RLSLINGQPTPEGYSLHREVLCYHGRLVLSEDSPTIPLLLAEFHNSPIGGHHGALTTYQRLAREVYWRGMKAHIRTYVAECSVCQQAKYLSLTPAGLLQALPIPDRVWED
ISMDFIEGLPKSECYDVILVVVDRLSKYAHFIPLKHPFGAKVVAAVFMREIIHLHGCPRSIVSDRDHGQTEVVNRGVETYLRCFAMNTPKQWAKWLAWAELSYNTTVHTA
ILMTPFEAHYGRPPPSILPCVKESSPVNEVDHLMKERDGILRILKANLLKAQQRMVKYAKLTDGRPLFPIPKDLAADLSMQLSPVEVLGVRKTREKGDKLEVLIFWSDGT
PESATWEQATLIQEQFLEFHLEDKVALWGAGNDRPHIVKVYSRRNKSGKKKE