; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G30790 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G30790
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr1:25287502..25289421
RNA-Seq ExpressionCSPI01G30790
SyntenyCSPI01G30790
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]1.8e-21456.35Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++ +K+ AI  WP P S+ E+++F GL +FYR+F+R+FS++ AP+T+CLKKG F W   Q +SF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  P+ +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE LI++ H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +RRD+   VKRC ICQ +KG S+N  LY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMTHF+ACKKT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVPK+I+SDRD KFLSHFW TL + F TTLK S+TA+PQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  AE MAE I    + V   L  T    K AADK+RR   F +GD VMV L+K RFP GTY KL+  + GPF +L K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  ++I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

CAA7021913.1 unnamed protein product [Microthlaspi erraticum]7.7e-21356.19Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++  K+ AI  W  P S+ E+++F GLA+F R+F+R+FS++ AP+T+CLKKG F+W   Q ESF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  PI +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW++FLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLLT L++EI+ F+ L +LYE D +FK++W KC     + D+H+ +GYLFKG++LCIP +SL E LI+E H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +R+D+   V+RC ICQ +KG S+N  LY PLPIP  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMTHF+AC+KT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+E+VRLHGVPKSIVSDRD KFLSHFW TL + F T+LK S+TA+PQ+DGQ EVTNRTLGN++R + G KPKQWDLAL Q EFA+N   + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  AE MA++I +  + V   L  T    K+AADK+RR   F +GD VMV L+K RFP GTY KL+ R+ GPF IL+K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  ++I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]1.8e-21456.35Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++ +K+ AI  WP P S+ E+++F GL +FYR+F+R+FS++ AP+T+CLKKG F W   Q +SF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  P+ +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE LI++ H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +RRD+   VKRC ICQ +KG S+N  LY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMTHF+ACKKT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVPK+I+SDRD KFLSHFW TL + F TTLK S+TA+PQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  AE MAE I    + V   L  T    K AADK+RR   F +GD VMV L+K RFP GTY KL+  + GPF +L K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  ++I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]3.7e-21556.83Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++  K+ AI  WP P +  E+++F GLA+FYR+F+R+FS++ AP+T+CLKKG F W P Q ESF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  PI +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLLT L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI EGYLFKG++LCIP +SLRE LI+E H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +R+D+   V+RC +CQ +KG S+N  LY PL +P  IW+DLS+DFV+GLP+TQR  DS+ V+VD+FSKMTHF+AC+KT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVPKSIVSDRD KFLSHFW TL + F T+LK S+TA+PQ+DGQTEVTNRTLGN++R + G KPKQWDLAL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  A+ MA++I    + V   L  T    K+AADKK+R   F +GD VMV LKK RFP GTY KL+ R+ GPF IL+K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  + I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

XP_024641774.2 uncharacterized protein LOC112422671 [Medicago truncatula]1.9e-21156.89Show/hide
Query:  IAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPF
        + FLGFI+ +  I ++ +K+ AI  WP P S+ E+++F GLA+FYR+FIR+FS++ AP+T+CLKKG FKW   Q +SF  IKKKL ++P+L LPDF   F
Subjt:  IAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPF

Query:  EVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKH
        +V  DA   GIGAVL+Q+  PI +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+H
Subjt:  EVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKH

Query:  QSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKT
        +SG  NKVADALSR+ SLL  L+ E++ F+ L +LYE D +FK+++ KC       D+HI EGYLFKG+QLCIP +SLRE LI++ HSGGL+GH G++KT
Subjt:  QSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKT

Query:  LEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLF
        +  + +R+YWP +R+D    VK+C  CQ +KG S+N  LY PLPIP  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKM+HF+ACK+T DA  IA LF
Subjt:  LEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLF

Query:  FKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEV
        F+EVVRLHGVP SI SDRD KFLSHFW TL K F T+L  S+TA+PQTDGQTEVTNRTLGN++RC+ G KPKQWDLAL Q EFA+N+  + +TGK+PF +
Subjt:  FKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEV

Query:  VYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKY
        VYT +PR   DL  LP    ++  AE MAE I  +   V   L  T    K AADK+RR   FN GD VMV L+K RFP GTY KL+ R+ GPF +  K 
Subjt:  VYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKY

Query:  GDNAFKIDLP--LHIHPVFNVADLKPYHAPD
         DNA+ + LP  ++I   FNVAD+  YHA +
Subjt:  GDNAFKIDLP--LHIHPVFNVADLKPYHAPD

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein6.6e-22657.69Show/hide
Query:  IAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPF
        + FLGFII    I ++ +K+ AI  WPTP ++ +I++F GLA+FYR+FIRNFSS+ AP+TDC+KKG F+W   Q+ SF  IK+KL+++P+L LP F   F
Subjt:  IAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPF

Query:  EVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKH
        +V  DA  TGIGAVL+Q+G P+E+FS KL+ +RQ W+TYE EL+A+VRALK WEHYL+ +EFV+ +DH +LK++  Q ++SRMH RWI+FLQRF FV+KH
Subjt:  EVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKH

Query:  QSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKT
        ++G++NKVADALSR+ +LL ++SSEI +F+ L +LY+ D DF+  W KC     + ++HI +GYLFKG QLCIP TSLRE ++++ HSGGL GH G++KT
Subjt:  QSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKT

Query:  LEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLF
        + ++ +RYYWP+++RD   FV++CPICQ  KG ++N  LY+PLP+P  IWEDL++DF++GLP+TQR  DS+ V+VDRFSKM HF+ CKKT+DA ++ANLF
Subjt:  LEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLF

Query:  FKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEV
        F+E+VRLHGVPKSI SDRDVKFLSHFWRTL +KFDT+L++S+TA+PQTDGQTEVTNRTLGNL+RC SG +PKQWD+ L Q EFA+N M NRST K+PFE+
Subjt:  FKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEV

Query:  VYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKY
        VYTK P+   DL  LP     +  AE  A+    + +EV  +L +  + YK AADK RR   F +GDLVMV L+K+RFP GTYNKLK+R+ GPF +  K 
Subjt:  VYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKY

Query:  GDNAFKIDLP--LHIHPVFNVADLKPYHAPD
         DNA+ ++LP  + I   FNVADL  YH PD
Subjt:  GDNAFKIDLP--LHIHPVFNVADLKPYHAPD

A0A6D2HLB5 Reverse transcriptase8.9e-21556.35Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++ +K+ AI  WP P S+ E+++F GL +FYR+F+R+FS++ AP+T+CLKKG F W   Q +SF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  P+ +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE LI++ H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +RRD+   VKRC ICQ +KG S+N  LY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMTHF+ACKKT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVPK+I+SDRD KFLSHFW TL + F TTLK S+TA+PQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  AE MAE I    + V   L  T    K AADK+RR   F +GD VMV L+K RFP GTY KL+  + GPF +L K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  ++I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

A0A6D2IKM3 Reverse transcriptase8.9e-21556.35Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++ +K+ AI  WP P S+ E+++F GL +FYR+F+R+FS++ AP+T+CLKKG F W   Q +SF  IK+KL ++P+L LPDF   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        F+V  DA   GIGAVL+Q+  P+ +FS KLS +RQ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L +QK I++MHARW+SFLQ+F F+I+
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H+SG  NKVADALSR+ SLL  L+ EI+ F+ L +LYE D +FK++W KC+    + D+HI +G+LFKG++LCIP +SLRE LI++ H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +RYYWP +RRD+   VKRC ICQ +KG S+N  LY PLP+P  IW+DLS+DFV+GLP+TQR  DS+ V+VDRFSKMTHF+ACKKT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVPK+I+SDRD KFLSHFW TL + F TTLK S+TA+PQTDGQTEVTNRTLGN++R + G +PKQWDLAL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P+   DL  LP    ++  AE MAE I    + V   L  T    K AADK+RR   F +GD VMV L+K RFP GTY KL+  + GPF +L K
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA
          DNA+ +DLP  ++I   FNVAD+  YHA
Subjt:  YGDNAFKIDLP--LHIHPVFNVADLKPYHA

A0A6N2LVR1 Uncharacterized protein5.0e-22657.8Show/hide
Query:  MKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDF
        M   + FLGF++    I ++ +K+ AI  WPTP +I E+++F GLA+FYR+F+R+FS + AP+T+C+KKG F W    + SF  IK+KL S+P+L LPDF
Subjt:  MKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDF
           FEV  DA   GIGAVL+Q+  P+ ++S KLS +R+ WSTYE ELYA+ RA+K WEHYL+ +EF+L +DH +LK++  Q N++RMHARW++F+QRF+F
Subjt:  SSPFEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDF

Query:  VIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFG
         +KH+SG+ NKVADALSRK SLLT L +E+I F+ + DLY GD DF + W KC   L  +  H  +GYLF+G QLCIP +SLRE +I E H GGL GH G
Subjt:  VIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFG

Query:  QNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYI
        ++KT+ +  +RYYWP+++RD  N VKRCP CQ +KG ++N  LY PLPIP   WEDLS+DF++GLP+TQR  DS+ V+VDRFSKM HF+ACKKT+DA+++
Subjt:  QNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYI

Query:  ANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKS
        ANLFFKEVVRLHGVPKSI SDRD KFLSHFWRTL ++FDTTL FS+T++PQTDGQTEV NRTLGNL+RCLSG +PKQWDL LAQAEFA+N+M NRSTGK+
Subjt:  ANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKS

Query:  PFEVVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPI
        PF+VVY + P+   DL  LP    +N  AE MA+ ++ + +EV  +L  + + YK AADKKRR   F +GDLVMV+L+K R P GT +KL D++ GP+ I
Subjt:  PFEVVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPI

Query:  LEKYGDNAFKIDLP--LHIHPVFNVADLKPYHAPD
        L+K  DNA+++DLP  + I P FNVADL  YH PD
Subjt:  LEKYGDNAFKIDLP--LHIHPVFNVADLKPYHAPD

M5WCC7 Reverse transcriptase9.8e-21456.48Show/hide
Query:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP
        ++ FLGF++ +  I ++ +KI+AI  WP P ++ E+++F GLA+FYR+F+R+FSS+ AP+T+CLKKG F W   Q+ SF DIK+KL ++P+L LP+F   
Subjt:  EIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSP

Query:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK
        FEV  DA   G+GAVL+Q   P+ +FS KLS +RQ WSTY+QE YA+VRALKQWEHYL+ KEFVL TDH +LKY+ +QKNI +MHARW++FLQ+F FVIK
Subjt:  FEVAVDACYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIK

Query:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK
        H SGK N+VADALSR+ SLL  L+ E++ F+ L +LYEGD DF +IW KC+N     DY + EGYLFKG QLCIP +SLRE LI++ H GGL+GH G++K
Subjt:  HQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNK

Query:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL
        T+  + +R+YWP+++RD    V++C  CQ +KG  +N  LY PLP+P  IW+DL++DFV+GLP+TQR  DS+ V+VDRFSKM HF+AC+KT DA  IA L
Subjt:  TLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANL

Query:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE
        FF+EVVRLHGVP SI SDRD KFLSHFW TL + F TTL  S+TA+PQTDGQTEVTNRTLGN++R + G KPKQWD AL Q EFA+N+  + +TGKSPF 
Subjt:  FFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFE

Query:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK
        +VYT +P    DL  LP     +  A+ +AE +  +  EV   L QT   YK AADK RR   F +GD VM+ L+K RFP GTY+KLK ++ GP+ +L++
Subjt:  VVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEK

Query:  YGDNAFKIDLP--LHIHPVFNVADL
          DNA+ I+LP  + I  +FNVADL
Subjt:  YGDNAFKIDLP--LHIHPVFNVADL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.0e-9033.84Show/hide
Query:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF
        + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DF
Subjt:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW
        S    +  DA    +GAVL+Q+      +P+ Y+SAK+S ++  +S  ++E+ A++++LK W HYL S  + F +LTDH +L  +     +  ++  ARW
Subjt:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW

Query:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----
          FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  DT       K  N L+ +D  + E    K     
Subjt:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----

Query:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL
           +Q+ +P+ T L   +IK+ H  G   H G      II +R+ W  IR+    +V+ C  CQ  K  S+N + Y PL PIP S   WE LS+DF+  L
Subjt:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL

Query:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN
        P++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  
Subjt:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN

Query:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ
        LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N++     E I ++ + V +HL       KK  D K ++
Subjt:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ

Query:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN
           F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I  +F+
Subjt:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN

P0CT35 Transposon Tf2-2 polyprotein5.0e-9033.84Show/hide
Query:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF
        + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DF
Subjt:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW
        S    +  DA    +GAVL+Q+      +P+ Y+SAK+S ++  +S  ++E+ A++++LK W HYL S  + F +LTDH +L  +     +  ++  ARW
Subjt:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW

Query:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----
          FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  DT       K  N L+ +D  + E    K     
Subjt:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----

Query:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL
           +Q+ +P+ T L   +IK+ H  G   H G      II +R+ W  IR+    +V+ C  CQ  K  S+N + Y PL PIP S   WE LS+DF+  L
Subjt:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL

Query:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN
        P++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  
Subjt:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN

Query:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ
        LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N++     E I ++ + V +HL       KK  D K ++
Subjt:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ

Query:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN
           F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I  +F+
Subjt:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN

P0CT41 Transposon Tf2-12 polyprotein5.0e-9033.84Show/hide
Query:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF
        + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DF
Subjt:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW
        S    +  DA    +GAVL+Q+      +P+ Y+SAK+S ++  +S  ++E+ A++++LK W HYL S  + F +LTDH +L  +     +  ++  ARW
Subjt:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW

Query:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----
          FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  DT       K  N L+ +D  + E    K     
Subjt:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----

Query:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL
           +Q+ +P+ T L   +IK+ H  G   H G      II +R+ W  IR+    +V+ C  CQ  K  S+N + Y PL PIP S   WE LS+DF+  L
Subjt:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL

Query:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN
        P++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  
Subjt:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN

Query:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ
        LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N++     E I ++ + V +HL       KK  D K ++
Subjt:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ

Query:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN
           F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I  +F+
Subjt:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.3e-9032.25Show/hide
Query:  KEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDF
        +E  FLG+ I    I+    K  AI  +PTP ++K+ Q FLG+ ++YR+FI N S +A P  L  C K    +WT  Q ++ + +K  L +SP+L   + 
Subjt:  KEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAP--LTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQGHP------IEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISF
         + + +  DA   GIGAVL +  +       + YFS  L ++++ +   E EL  +++AL  + + L  K F L TDH SL  LQ +   +R   RW+  
Subjt:  SSPFEVAVDACYTGIGAVLAQQGHP------IEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISF

Query:  LQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHIVEGYLF
        L  +DF +++ +G +N VADA+SR    +T  +S  I  +     Y+ D                       + F+    K   +     +Y + +  ++
Subjt:  LQRFDFVIKHQSGKENKVADALSRKGSLLTILSSEIIAFKHLPDLYEGD-----------------------TDFKDIWYKCS-NFLDADDYHIVEGYLF

Query:  KGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNAR-LYSPLPIPTSIWEDLSIDFVIGLPKT
          ++L +P    + A+++  H   L  GHFG   TL  IS  YYWP+++     +++ C  CQ  K        L  PLPI    W D+S+DFV GLP T
Subjt:  KGEQLCIPHTSLREALIKEAHSGGL-AGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNAR-LYSPLPIPTSIWEDLSIDFVIGLPKT

Query:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLR
            + I+V+VDRFSK  HF+A +KT DA  + +L F+ +   HG P++I SDRDV+  +  ++ L K+       S+  +PQTDGQ+E T +TL  LLR
Subjt:  QRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGNLLR

Query:  CLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA-------D
          + +  + W + L Q EF +N+   R+ GKSPFE+          DL  LP T  + ++ E  A +    +L K +    IQT +  + A        +
Subjt:  CLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIK--KLHKEVHDHLIQTTDSYKKAA-------D

Query:  KKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLH--IHPVFNVADLKPY-HAPDRF
        ++R+    N GD V+VH + + F  G Y K++   +GPF +++K  DNA+++DL  H   H V NV  LK + + PD +
Subjt:  KKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLH--IHPVFNVADLKPY-HAPDRF

Q9UR07 Transposon Tf2-11 polyprotein5.0e-9033.84Show/hide
Query:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF
        + ++ F+G+ I +   +   + I+ +  W  P + KE++ FLG  ++ RKFI   S L  PL + LKK   +KWTP Q ++ E+IK+ L S P+L+  DF
Subjt:  KKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKG-NFKWTPLQQESFEDIKKKLTSSPILKLPDF

Query:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW
        S    +  DA    +GAVL+Q+      +P+ Y+SAK+S ++  +S  ++E+ A++++LK W HYL S  + F +LTDH +L  +     +  ++  ARW
Subjt:  SSPFEVAVDACYTGIGAVLAQQG-----HPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLS--KEFVLLTDHFSL--KYLQAQKNISRMHARW

Query:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----
          FLQ F+F I ++ G  N +ADALSR            + + +  ++   I   FK+ +   Y  DT       K  N L+ +D  + E    K     
Subjt:  ISFLQRFDFVIKHQSGKENKVADALSR------------KGSLLTILSSEIIA--FKH-LPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKG----

Query:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL
           +Q+ +P+ T L   +IK+ H  G   H G      II +R+ W  IR+    +V+ C  CQ  K  S+N + Y PL PIP S   WE LS+DF+  L
Subjt:  ---EQLCIPH-TSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPICQRTKGSSKNARLYSPL-PIPTS--IWEDLSIDFVIGL

Query:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN
        P++   ++++ V+VDRFSKM   V C K+  A   A +F + V+   G PK I++D D  F S  W+    K++  +KFS    PQTDGQTE TN+T+  
Subjt:  PKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDTTLKFSTTANPQTDGQTEVTNRTLGN

Query:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ
        LLRC+  + P  W   ++  + ++NN  + +T  +PFE+V+   P L+  +L +     D N++     E I ++ + V +HL       KK  D K ++
Subjt:  LLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLT-FDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQTTDSYKKAADKKRRQ

Query:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN
           F  GDLVMV   K+ F     NKL     GPF +L+K G N +++DLP  I  +F+
Subjt:  -AHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFN

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.6e-1940.19Show/hide
Query:  EIAFLG--FIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFS
        +IA+LG   II    +S +P K+EA+  WP P +  E++ FLGL  +YR+F++N+  +  PLT+ LKK + KWT +   +F+ +K  +T+ P+L LPD  
Subjt:  EIAFLG--FIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFS

Query:  SPFEVAV
         PF   V
Subjt:  SPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAGAAATTGCATTCCTCGGCTTTATAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGCCTCCATTAA
AGAAATACAAGCCTTCCTCGGCCTGGCTTCGTTTTACAGGAAATTCATCAGAAATTTCAGCTCTTTAGCCGCACCACTAACTGACTGTCTAAAGAAAGGAAACTTCAAAT
GGACCCCGTTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAGCTCACATCCAGCCCTATCCTTAAATTACCAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCA
TGCTACACAGGGATTGGAGCTGTCCTAGCTCAGCAAGGACATCCTATCGAATACTTCAGTGCAAAACTCAGCACCTCAAGACAGACCTGGAGCACATACGAACAAGAGCT
GTATGCCCTTGTCCGAGCACTAAAACAATGGGAACACTACCTGCTTTCTAAAGAATTTGTACTCCTAACTGACCATTTCTCACTAAAATACCTTCAAGCTCAAAAGAATA
TCAGTAGGATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACTTCGTGATCAAACACCAATCAGGCAAAGAGAACAAGGTGGCCGATGCTCTAAGCAGAAAAGGC
TCCCTACTCACAATACTGTCCTCGGAAATCATAGCATTCAAACATTTACCCGACTTATACGAAGGTGATACTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTT
AGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGAACAATTATGCATCCCGCACACCTCACTACGAGAAGCCTTAATAAAAGAGGCACATTCTGGAG
GGCTAGCTGGACATTTCGGACAGAATAAGACATTGGAGATCATCTCCAAACGATACTACTGGCCGGAAATAAGAAGGGATTCCAATAATTTCGTAAAGAGATGCCCCATT
TGCCAAAGAACCAAAGGCTCCAGCAAGAATGCAAGATTATACTCGCCACTACCCATCCCGACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAA
AACACAAAGACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAGCAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCT
TCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTATCAGACAGAGATGTCAAGTTCCTAAGCCATTTTTGGCGAACATTAGGGAAGAAGTTTGACACA
ACACTGAAGTTCAGCACCACAGCCAACCCACAAACAGATGGACAAACTGAAGTAACAAACAGGACCCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACA
ATGGGATCTAGCATTGGCTCAAGCTGAATTCGCCTTCAATAATATGAAGAACAGATCAACAGGAAAATCCCCCTTCGAAGTAGTTTATACCAAACTACCACGATTAACCT
TTGATCTCACTACACTCCCCACAACCGTGGATCTCAACAACGAAGCAGAATGCATGGCAGAAAATATCAAAAAACTACACAAGGAAGTCCATGATCATCTTATACAGACA
ACAGACTCCTACAAAAAGGCAGCAGATAAAAAAAGAAGACAAGCCCACTTCAATAAAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACTGGCACCTA
CAACAAGCTGAAAGACAGACAAATTGGGCCATTCCCTATATTAGAGAAATACGGAGATAATGCCTTCAAGATCGATCTACCACTACACATACACCCAGTCTTCAATGTTG
CTGACCTAAAGCCATACCATGCACCAGATCGTTTCAGGCTTGCTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAGAAATTGCATTCCTCGGCTTTATAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAAATCGAAGCCATCCATACATGGCCGACTCCTGCCTCCATTAA
AGAAATACAAGCCTTCCTCGGCCTGGCTTCGTTTTACAGGAAATTCATCAGAAATTTCAGCTCTTTAGCCGCACCACTAACTGACTGTCTAAAGAAAGGAAACTTCAAAT
GGACCCCGTTACAACAAGAAAGCTTTGAAGATATCAAAAAGAAGCTCACATCCAGCCCTATCCTTAAATTACCAGATTTCTCTTCACCTTTTGAAGTAGCAGTTGATGCA
TGCTACACAGGGATTGGAGCTGTCCTAGCTCAGCAAGGACATCCTATCGAATACTTCAGTGCAAAACTCAGCACCTCAAGACAGACCTGGAGCACATACGAACAAGAGCT
GTATGCCCTTGTCCGAGCACTAAAACAATGGGAACACTACCTGCTTTCTAAAGAATTTGTACTCCTAACTGACCATTTCTCACTAAAATACCTTCAAGCTCAAAAGAATA
TCAGTAGGATGCACGCACGCTGGATATCCTTCCTCCAAAGGTTTGACTTCGTGATCAAACACCAATCAGGCAAAGAGAACAAGGTGGCCGATGCTCTAAGCAGAAAAGGC
TCCCTACTCACAATACTGTCCTCGGAAATCATAGCATTCAAACATTTACCCGACTTATACGAAGGTGATACTGACTTCAAGGATATCTGGTACAAATGCTCCAACTTCTT
AGACGCTGATGACTACCACATTGTTGAAGGATATCTATTTAAAGGAGAACAATTATGCATCCCGCACACCTCACTACGAGAAGCCTTAATAAAAGAGGCACATTCTGGAG
GGCTAGCTGGACATTTCGGACAGAATAAGACATTGGAGATCATCTCCAAACGATACTACTGGCCGGAAATAAGAAGGGATTCCAATAATTTCGTAAAGAGATGCCCCATT
TGCCAAAGAACCAAAGGCTCCAGCAAGAATGCAAGATTATACTCGCCACTACCCATCCCGACCTCAATATGGGAAGATTTATCAATTGACTTCGTGATTGGATTACCAAA
AACACAAAGACAATTTGACTCAATAATGGTTATAGTGGACAGATTCAGCAAAATGACACATTTCGTAGCATGCAAAAAGACAAATGATGCAATCTACATAGCCAACCTCT
TCTTTAAAGAAGTAGTACGACTACATGGAGTACCTAAAAGCATAGTATCAGACAGAGATGTCAAGTTCCTAAGCCATTTTTGGCGAACATTAGGGAAGAAGTTTGACACA
ACACTGAAGTTCAGCACCACAGCCAACCCACAAACAGATGGACAAACTGAAGTAACAAACAGGACCCTCGGTAATCTGCTACGCTGCCTTAGCGGGTCAAAACCAAAACA
ATGGGATCTAGCATTGGCTCAAGCTGAATTCGCCTTCAATAATATGAAGAACAGATCAACAGGAAAATCCCCCTTCGAAGTAGTTTATACCAAACTACCACGATTAACCT
TTGATCTCACTACACTCCCCACAACCGTGGATCTCAACAACGAAGCAGAATGCATGGCAGAAAATATCAAAAAACTACACAAGGAAGTCCATGATCATCTTATACAGACA
ACAGACTCCTACAAAAAGGCAGCAGATAAAAAAAGAAGACAAGCCCACTTCAATAAAGGAGACCTAGTAATGGTACACCTGAAAAAGAGCAGATTTCCTACTGGCACCTA
CAACAAGCTGAAAGACAGACAAATTGGGCCATTCCCTATATTAGAGAAATACGGAGATAATGCCTTCAAGATCGATCTACCACTACACATACACCCAGTCTTCAATGTTG
CTGACCTAAAGCCATACCATGCACCAGATCGTTTCAGGCTTGCTGACTGA
Protein sequenceShow/hide protein sequence
MKKEIAFLGFIIKQGSISMEPKKIEAIHTWPTPASIKEIQAFLGLASFYRKFIRNFSSLAAPLTDCLKKGNFKWTPLQQESFEDIKKKLTSSPILKLPDFSSPFEVAVDA
CYTGIGAVLAQQGHPIEYFSAKLSTSRQTWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQAQKNISRMHARWISFLQRFDFVIKHQSGKENKVADALSRKG
SLLTILSSEIIAFKHLPDLYEGDTDFKDIWYKCSNFLDADDYHIVEGYLFKGEQLCIPHTSLREALIKEAHSGGLAGHFGQNKTLEIISKRYYWPEIRRDSNNFVKRCPI
CQRTKGSSKNARLYSPLPIPTSIWEDLSIDFVIGLPKTQRQFDSIMVIVDRFSKMTHFVACKKTNDAIYIANLFFKEVVRLHGVPKSIVSDRDVKFLSHFWRTLGKKFDT
TLKFSTTANPQTDGQTEVTNRTLGNLLRCLSGSKPKQWDLALAQAEFAFNNMKNRSTGKSPFEVVYTKLPRLTFDLTTLPTTVDLNNEAECMAENIKKLHKEVHDHLIQT
TDSYKKAADKKRRQAHFNKGDLVMVHLKKSRFPTGTYNKLKDRQIGPFPILEKYGDNAFKIDLPLHIHPVFNVADLKPYHAPDRFRLAD