; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G15470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G15470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr4:12853638..12855104
RNA-Seq ExpressionCSPI04G15470
SyntenyCSPI04G15470
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]1.4e-13149.8Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC     + +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+D    VKRC ICQT+KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD
        N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVVDRFSK THF+ CKKT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+ PF +VYT  P+  +DL  L     ++  AE M E I   
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
         + V   L+ T    K   DK+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++IS  FNVAD+  YHA
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

CAA7021913.1 unnamed protein product [Microthlaspi erraticum]7.7e-13048.81Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW++FLQ+F F+I+HKSG  NKVADALSR+ SLL+  ++E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC  +  + +FH+ DG+LFK ++LCIP +S  E L++E H  GL+GH G+DKTI  L  +YY P LRKD    V+RC ICQ +KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD
        N GLY PLPIP  IW+DLS+DFVL LP+ QR  ++V VVVDRFSK THF+ C+KT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        T+LK S+TAH Q+DGQ EVTNRTLGN+I  + G K +QWDLAL Q EFA+N   + +TG+ PF +VYT  P+  +DL  L     ++  AE M + I   
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA-----P
         + V   L+ T    K+  DK+RR   F EGD VM+ L+K RFP GTY KL+ R+ GP KIL K  DNAY ++LPDD++IS  FNVAD+  YHA     P
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA-----P

Query:  DEFS
        DE S
Subjt:  DEFS

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]1.4e-13149.8Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC     + +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+D    VKRC ICQT+KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD
        N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVVDRFSK THF+ CKKT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+ PF +VYT  P+  +DL  L     ++  AE M E I   
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
         + V   L+ T    K   DK+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++IS  FNVAD+  YHA
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]6.5e-12948.99Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F F+I+HKSG  NKVADALSR+ SLL+  + E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC     + +FHI +G+LFK ++LCIP +S RE L++E H  GL+GH G+DKTI  L  +YY P LRKD    V+RC +CQ +KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD
        N GLY PL +P  IW+DLS+DFVL LP+ QR  ++V VVVD+FSK THF+ C+KT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        T+LK S+TAH Q+DGQTEVTNRTLGN+I  + G K +QWDLAL Q EFA+N+  + +TG+ PF +VYT  P+  +DL  L     ++  A+ M + I + 
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
         + V   L+ T    K+  DKK+R   FKEGD VM+ LKK RFP GTY KL+ R+ GP KIL K  DNAY ++LPDD+ IS  FNVAD+  YHA
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]1.8e-13149.08Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+IRAL+ WE YL+ +EF+L T+H ++KYL +Q+S + + ARW ++LQ+F FV+KHKSG  NKVADALSR+ SLL     E+I F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        Y DD DF ++W  C+      EFH+ +G+LF   +LCIP +S RE L++E H  GL GH G+DKTI  ++ +YY PQL++D  NFV++C + QT+KG + 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD
        N GLY PLP+P AIWEDL++DFVL LP+ QR  ++V VVVDRFSK  HF+PC+KT+DA                                       ++D
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        T LKFS+TAH QTDGQTE  NRTLGNLI  + G K +QWD+ALAQ+EFA+NN  + +TG+ PF +VY + P+ ALDLA L     ++  AE M E+++++
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADL
          +V   L++    YK   D KRRE  F EGD VM+ L+K RFP G+YNKLK R+ GP K++ K  +NAY I+LP +++IS  FNVADL
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADL

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein2.7e-14451.31Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        W+ YE E++A++RAL+ WE YL+ +EF++ ++H ++K++  Q S +++  RWI+FLQRF FV+KHK+G++NKVADALSR+ +LL++ SSE+ +F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        Y++D DF + W KC     + EFHI DG+LFK  +LCIP TS RE +L++ HS GL GH G+DKTI ++  +YY PQL++D   FV++CPICQT KG + 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD
        N GLY PLP+P  IWEDL++DF+L LP+ QR  ++V VVVDRFSK  HF+PCKKT+DA                                       K D
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        T+L++S+TAH QTDGQTEVTNRTLGNLI C SG + +QWD+ L Q EFA+N M NRST + PFEIVYTK P+ ALDLA L      +  AEN  +R   +
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHAPDE
         ++V ++L+K    YK   DK RR   F EGDLVM+ L+KNRFP GTYNKLK+R+ GP ++  K  DNAY +ELPDD+ IS  FNVADL  YH PDE
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHAPDE

A0A6D2HLB5 Reverse transcriptase6.8e-13249.8Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC     + +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+D    VKRC ICQT+KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD
        N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVVDRFSK THF+ CKKT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+ PF +VYT  P+  +DL  L     ++  AE M E I   
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
         + V   L+ T    K   DK+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++IS  FNVAD+  YHA
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

A0A6D2IKM3 Reverse transcriptase6.8e-13249.8Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D +F ++W KC     + +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+D    VKRC ICQT+KG S 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD
        N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVVDRFSK THF+ CKKT DA                                          
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAK---------------------------------------LD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+ PF +VYT  P+  +DL  L     ++  AE M E I   
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
         + V   L+ T    K   DK+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++IS  FNVAD+  YHA
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

A0A6N2KHU6 Reverse transcriptase5.2e-13248.99Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS Y+QE YA++RAL+ WE YL+ ++F+L T+H ++KYL +QK+ + + ARW ++LQ+F F++KHKS   NKVADALSR+ +LL     EV+ F+ L  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        YE D DF  +W KC       EFHIVD +LF+  +LC+P +S RE L++E H  GL+GH G+DKTI  ++ +YY PQL++D  N V++C +CQT+KG + 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD
        N GLY PLP+P AIWEDLS+DFVL LP+ QR  ++V VVVDRFSK  HF+PC+KT+DA                                       ++D
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTLKFS+TAH QTDGQTE  NRTLGNLI  + G K +QWD++LAQ+EFA+NN  + +TGR PF +VY KSP+ ALDLA L      +  AEN+ E  +++
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA
          +V    ++    YK   D KRRE  F EGD VM+ L+K RFP  TYNKLK R+ GP  I+ K  DNAY ++L  D+HIS  FNVADL  + A
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHA

A0A6N2LVR1 Uncharacterized protein1.1e-13749.7Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL
        WS YE E+YA+ RA++ WE YL+ +EF+L ++H ++K++  Q + N++ ARW++F+QRF+F +KHKSG+ NKVADALSRK SLL+   +EVI F+ +  L
Subjt:  WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYL

Query:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST
        Y  D DF   W KC   L     H  DG+LF+  +LCIP +S RE ++ E H  GL GH G+DKT+ +   +YY PQL++D  N VKRCP CQ +KG + 
Subjt:  YEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTST

Query:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD
        N GLY PLPIP   WEDLS+DF+L LP+ QR  ++V VVVDRFSK  HF+ CKKT+DA                                       + D
Subjt:  NAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA---------------------------------------KLD

Query:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL
        TTL FS+T+H QTDGQTEV NRTLGNLI CLSG + +QWDL LAQ+EFA+N+M NRSTG+ PF++VY + P+ ALDL  L     +N  AE+M +R++ +
Subjt:  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNL

Query:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHAPDE
         ++V ++L+ +   YK   DKKRR   FKEGDLVM++L+K R P GT +KL D++ GP +IL K  DNAY+++LP D+ ISP FNVADL  YH PDE
Subjt:  HKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHAPDE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.6e-4530.17Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI
        +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FLQ F+F I ++ G  N +ADALSR       +   S    I
Subjt:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI

Query:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR
         F             +   Y +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H G +    I+  ++    +R
Subjt:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR

Query:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------
        K    +V+ C  CQ  K  S N   Y PL PIP +   WE LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Subjt:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------

Query:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L
                             K +  +KFS     QTDGQTE TN+T+  L+ C+  +    W   ++  + ++NN  + +T   PFEIV+  SP L+ L
Subjt:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L

Query:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL
        +L +     D     EN  E IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL     GP  +L K G N Y+++L
Subjt:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL

Query:  PDDLH--ISPVFNVADLKTYHAPDEFS
        PD +    S  F+V+ L+ Y    E +
Subjt:  PDDLH--ISPVFNVADLKTYHAPDEFS

P0CT41 Transposon Tf2-12 polyprotein4.6e-4530.17Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI
        +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FLQ F+F I ++ G  N +ADALSR       +   S    I
Subjt:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI

Query:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR
         F             +   Y +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H G +    I+  ++    +R
Subjt:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR

Query:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------
        K    +V+ C  CQ  K  S N   Y PL PIP +   WE LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Subjt:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------

Query:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L
                             K +  +KFS     QTDGQTE TN+T+  L+ C+  +    W   ++  + ++NN  + +T   PFEIV+  SP L+ L
Subjt:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L

Query:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL
        +L +     D     EN  E IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL     GP  +L K G N Y+++L
Subjt:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL

Query:  PDDLH--ISPVFNVADLKTYHAPDEFS
        PD +    S  F+V+ L+ Y    E +
Subjt:  PDDLH--ISPVFNVADLKTYHAPDEFS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-4627.22Show/hide
Query:  EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD
        E E+  +I+AL  +   L  K F L T+H S+  LQ +    +   RW+  L  +DF +++ +G +N VADA+SR    ++  +S  I  +     Y+ D
Subjt:  EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD

Query:  -------------TDFN---------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGHFGQDKTIEILSSKYYLPQLR
                     T  N         + + K +   ET  + + + D  ++ +++L +P    + A+++  H   L  GHFG   T+  +S  YY P+L+
Subjt:  -------------TDFN---------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGHFGQDKTIEILSSKYYLPQLR

Query:  KDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA--------------------
             +++ C  CQ  K       GL  PLPI    W D+S+DFV  LP      N ++VVVDRFSKR HF+  +KT DA                    
Subjt:  KDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA--------------------

Query:  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLA
                           +L      S+  H QTDGQ+E T +TL  L+     +  + W + L Q EF +N+   R+ G+ PFEI     P    +  
Subjt:  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLA

Query:  NLLSNADINNEAENMIERIQNLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELP
         + S+ ++N  +   +E  ++L     Q  E L+   +  + + +++R+ +    GD V++H +   F  G Y K++   +GP +++ K  DNAY+++L 
Subjt:  NLLSNADINNEAENMIERIQNLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELP

Query:  DDLHISPVFNVADLKTYH
               V NV  LK+ +
Subjt:  DDLHISPVFNVADLKTYH

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.2e-4727.29Show/hide
Query:  EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD
        E E+  +I+AL  +   L  K F L T+H S+  LQ +    +   RW+  L  +DF +++ +G +N VADA+SR    ++  +S  I  +     Y+ D
Subjt:  EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD

Query:  -------------TDFN---------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGHFGQDKTIEILSSKYYLPQLR
                     T  N         + + K +   ET  + + + D  ++ +++L +P    + A+++  H   L  GHFG   T+  +S  YY P+L+
Subjt:  -------------TDFN---------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGHFGQDKTIEILSSKYYLPQLR

Query:  KDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA--------------------
             +++ C  CQ  K       GL  PLPI    W D+S+DFV  LP      N ++VVVDRFSKR HF+  +KT DA                    
Subjt:  KDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA--------------------

Query:  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLA
                           +L      S+  H QTDGQ+E T +TL  L+   + +  + W + L Q EF +N+   R+ G+ PFEI     P    +  
Subjt:  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLA

Query:  NLLSNADINNEAENMIERIQNLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELP
         + S+ ++N  +   +E  ++L     Q  E L+   +  + + +++R+ +    GD V++H +   F  G Y K++   +GP +++ K  DNAY+++L 
Subjt:  NLLSNADINNEAENMIERIQNLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELP

Query:  DDLHISPVFNVADLKTY-HAPDEF
               V NV  LK + + PD +
Subjt:  DDLHISPVFNVADLKTY-HAPDEF

Q9UR07 Transposon Tf2-11 polyprotein4.6e-4530.17Show/hide
Query:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI
        +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FLQ F+F I ++ G  N +ADALSR       +   S    I
Subjt:  WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVI

Query:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR
         F             +   Y +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H G +    I+  ++    +R
Subjt:  AF-----------KHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLR

Query:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------
        K    +V+ C  CQ  K  S N   Y PL PIP +   WE LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Subjt:  KDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------

Query:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L
                             K +  +KFS     QTDGQTE TN+T+  L+ C+  +    W   ++  + ++NN  + +T   PFEIV+  SP L+ L
Subjt:  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-L

Query:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL
        +L +     D     EN  E IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL     GP  +L K G N Y+++L
Subjt:  DLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIEL

Query:  PDDLH--ISPVFNVADLKTYHAPDEFS
        PD +    S  F+V+ L+ Y    E +
Subjt:  PDDLH--ISPVFNVADLKTYHAPDEFS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAGCCCTTATGAGCAAGAAATGTATGCCCTCATTCGAGCATTAAGGCAGTGGGAAGATTACCTACTATCTAAAGAGTTTCTCCTATTAACAAATCATTTCTCCAT
AAAATACCTCCAAGCTCAGAAATCTACCAACAAGATACCCGCTAGATGGATCTCCTTTTTACAGAGATTTGACTTTGTTATCAAGCATAAAAGCGGAAAAGAAAACAAAG
TAGCTGATGCACTAAGTAGAAAGCATTCCTTACTTTCTATATCATCATCCGAGGTGATAGCATTCAAACACTTACCATACTTATATGAAGATGACACTGACTTCAACAAG
ATGTGGTACAAATGCATTCATCACCTCGAAACAAGAGAATTTCATATTGTTGATGGATTTCTATTCAAAGAAGAAAAATTGTGCATACCCCATACTTCCCCAAGAGAAGC
TCTACTAAAGGAGGCACACTCCGAAGGACTAGCTGGTCACTTTGGGCAAGACAAAACAATTGAAATACTTTCCTCCAAATACTATTTGCCACAATTGCGAAAAGACACAA
ATAACTTTGTAAAGCGATGCCCTATATGCCAAACAACTAAAGGGACCAGTACTAATGCTGGATTATACAACCCCTTACCGATTCCTACAGCTATATGGGAGGATTTATCA
GTAGACTTCGTATTGGAATTACCTAAGGCACAAAGACAGCATAATACTGTAATGGTAGTTGTAGACAGATTCAGCAAAAGGACCCACTTTTTACCCTGCAAGAAGACTAA
TGATGCTAAACTAGATACCACACTTAAATTCAGTACTACAGCACACTCACAAACAGATGGTCAAACAGAAGTCACTAACAGAACCTTGGGGAATCTAATTTGTTGCCTAA
GTGGCTCAAAGCATAGACAGTGGGATCTAGCGTTAGCTCAATCTGAATTCGCGTTCAATAATATGAAAAATAGATCAACCGGGAGACGTCCCTTTGAAATTGTCTATACT
AAAAGTCCTAGACTTGCACTTGATCTTGCCAATTTGCTATCAAATGCAGATATCAACAATGAAGCAGAAAATATGATTGAAAGAATACAAAATCTGCACAAACAGGTACA
TGAACACCTACAGAAAACAACTTTATCTTATAAACAAGATAAAGACAAGAAAAGAAGAGAAGTCAAATTCAAAGAAGGAGATCTGGTGATGATACATCTGAAGAAAAACC
GGTTCCCAACAGGGACATACAATAAACTAAAAGACAGACAACTTGGGCCTTGCAAAATACTAGCAAAGTATGGTGATAATGCCTATAAAATTGAACTGCCAGACGACCTA
CACATCAGCCCTGTTTTCAATGTAGCAGACCTGAAGACCTACCATGCCCCAGATGAATTCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGAGCCCTTATGAGCAAGAAATGTATGCCCTCATTCGAGCATTAAGGCAGTGGGAAGATTACCTACTATCTAAAGAGTTTCTCCTATTAACAAATCATTTCTCCAT
AAAATACCTCCAAGCTCAGAAATCTACCAACAAGATACCCGCTAGATGGATCTCCTTTTTACAGAGATTTGACTTTGTTATCAAGCATAAAAGCGGAAAAGAAAACAAAG
TAGCTGATGCACTAAGTAGAAAGCATTCCTTACTTTCTATATCATCATCCGAGGTGATAGCATTCAAACACTTACCATACTTATATGAAGATGACACTGACTTCAACAAG
ATGTGGTACAAATGCATTCATCACCTCGAAACAAGAGAATTTCATATTGTTGATGGATTTCTATTCAAAGAAGAAAAATTGTGCATACCCCATACTTCCCCAAGAGAAGC
TCTACTAAAGGAGGCACACTCCGAAGGACTAGCTGGTCACTTTGGGCAAGACAAAACAATTGAAATACTTTCCTCCAAATACTATTTGCCACAATTGCGAAAAGACACAA
ATAACTTTGTAAAGCGATGCCCTATATGCCAAACAACTAAAGGGACCAGTACTAATGCTGGATTATACAACCCCTTACCGATTCCTACAGCTATATGGGAGGATTTATCA
GTAGACTTCGTATTGGAATTACCTAAGGCACAAAGACAGCATAATACTGTAATGGTAGTTGTAGACAGATTCAGCAAAAGGACCCACTTTTTACCCTGCAAGAAGACTAA
TGATGCTAAACTAGATACCACACTTAAATTCAGTACTACAGCACACTCACAAACAGATGGTCAAACAGAAGTCACTAACAGAACCTTGGGGAATCTAATTTGTTGCCTAA
GTGGCTCAAAGCATAGACAGTGGGATCTAGCGTTAGCTCAATCTGAATTCGCGTTCAATAATATGAAAAATAGATCAACCGGGAGACGTCCCTTTGAAATTGTCTATACT
AAAAGTCCTAGACTTGCACTTGATCTTGCCAATTTGCTATCAAATGCAGATATCAACAATGAAGCAGAAAATATGATTGAAAGAATACAAAATCTGCACAAACAGGTACA
TGAACACCTACAGAAAACAACTTTATCTTATAAACAAGATAAAGACAAGAAAAGAAGAGAAGTCAAATTCAAAGAAGGAGATCTGGTGATGATACATCTGAAGAAAAACC
GGTTCCCAACAGGGACATACAATAAACTAAAAGACAGACAACTTGGGCCTTGCAAAATACTAGCAAAGTATGGTGATAATGCCTATAAAATTGAACTGCCAGACGACCTA
CACATCAGCCCTGTTTTCAATGTAGCAGACCTGAAGACCTACCATGCCCCAGATGAATTCAGTTAG
Protein sequenceShow/hide protein sequence
MWSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNK
MWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLS
VDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAKLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYT
KSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDL
HISPVFNVADLKTYHAPDEFS