; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G26150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G26150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr3:23455621..23458274
RNA-Seq ExpressionCSPI03G26150
SyntenyCSPI03G26150
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035681.1 reverse transcriptase [Cucumis melo var. makuwa]2.1e-26865.73Show/hide
Query:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNR
        Y+QYQNCRQGS+ VAEYIEEFHRLSAR NLSENEQHQIARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR   WETN +KKQ Y +
Subjt:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNR

Query:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK-EGEEETKLIEADDGDRVSC
        +T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NNYTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+ E EEET+LIEADDGDR+SC
Subjt:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK-EGEEETKLIEADDGDRVSC

Query:  VIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY------IGREKTP
        ++QRVLI PKEET PQ HSLFKTRCTIN K  PHPDPYKIGWVKKGGE  INEICT+PLSIGNSYKDQI       D+ ++ L  P+      + R +  
Subjt:  VIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY------IGREKTP

Query:  TSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLI
        T    W  K     P      + IR K +++      G  L            +TDKS+G   E+ +P+LKEL  EFPHLK+EPQGLPP RDIQHQIDL+
Subjt:  TSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLI

Query:  PGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGY
        P ASLPNLPHYRMSPEEY++LHDHIE+LL+KG+IKPSLSPCAVPALLTP KDGSWRMCVDSR INR+T +Y+FPIPRIGDLLDQLGKA IFSKIDL++GY
Subjt:  PGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGY

Query:  HQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWP
        HQI+IRPGDEWKTAFKTNEGLFE               ++ ++H+ +LRKLF+VLTE ELYIN K CT+  +EI FLGF+IK+G I M+PKK+EAIH+ P
Subjt:  HQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWP

Query:  TPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSE
        TPTSIKE+QAFLGLASFYRRFIRNFS +VAPLT+                                  F  PFEVAV+ACGTGIGAVLSQ+ HPI+Y SE
Subjt:  TPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSE

Query:  KLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK
        KLS+SRQSWSTY QELYALVRALKQWEH+LL  +F ++     K
Subjt:  KLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]5.7e-26656.48Show/hide
Query:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS
        MKKL+K RF+PPNYEQTLY QYQNCRQG R  AEYIEEFHRL  RTNL E E+H I+ F+GGLRFD+KEKVKLQPF+ LSEAI+ AETVEEMI  R K S
Subjt:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS

Query:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKE
         R   WE +++KK      T        A   K  + +E+S KKE   G  K  N Y RP  G C+RCG+ GH SN CPQRKTIA+A+D     ++   E
Subjt:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKE

Query:  GEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYKIGWVKKGGEATINEICT
         +EET++IEAD+GD +SC++QRVLI+PKEE   QRHSLFKTRCTI                          N KT PH  PYKIGW+KKGGE  I+EIC 
Subjt:  GEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYKIGWVKKGGEATINEICT

Query:  VPLSIGNSYKDQ-IADLGNMTL------EPY------IGREKTPTSSNGWERKSP---PPTNKKKHIRHKAEKEGAALYHITDKS--------------K
        VPLSIGNSYKDQ + D+  M +       P+      + R +  T    W  K     P   +K     K +K+G+    I+ K                
Subjt:  VPLSIGNSYKDQ-IADLGNMTL------EPY------IGREKTPTSSNGWERKSP---PPTNKKKHIRHKAEKEGAALYHITDKS--------------K

Query:  GEEDEVTDPK----LKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSW
        G ED   D +    +KEL  ++P + +EP  LPP RDI H I+L+ GAS P+LPHY MSP EY+ILHD IEELL+KG+IKPS S C VPALLTPKKDG+W
Subjt:  GEEDEVTDPK----LKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSW

Query:  RMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIV
        RMCVDSR IN+ITV+Y+FPIPR+ DLLDQLG A IFSKIDL+S YHQIRIRPGDEWKTAFKTNEGLFEW+                           FI+
Subjt:  RMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIV

Query:  VYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSL
        VYFDDILV+S  YD+H+ H+ +LFQVL   ELY+N K C F   EIAFLGFII++  + M  KKVEAI  W TPT++ ++QAFLGLASFYR+FI+N SS+
Subjt:  VYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSL

Query:  VAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH
         AP+T+CLK G F+W  KQQDSF+ +K  L +  +L+LP+F   FEVAVD CGTGIGAVLSQ+ HPI+Y SE+LS SRQSWSTY QELYALVRALKQWEH
Subjt:  VAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH

Query:  YLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF
        YLL +EFILLT +FSLKYL SQ++IS MHA+WIS+ QRFDFVIKHQ+G ENKVADAL+ KS+LL +LS E+  F H+ +LY++D +F
Subjt:  YLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF

KAA0062943.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]6.4e-24964.7Show/hide
Query:  ISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKT
        ++  ++ EEM+  RLKNSN+  TWETN +KKQ   ++T+EQP+TSV +KGK  D QE + KKE   RGK+ NNYTRPSLGKCFRCGEP HLSNNCPQRKT
Subjt:  ISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKT

Query:  IALAEDEGSDMSKEDKEGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYK
        IALAEDE + MS+ DKE +EE +LIEAD+GDR+SC++QRVLI  KEE  PQRHSLFKTRCTI                          N K DPHPDPYK
Subjt:  IALAEDEGSDMSKEDKEGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYK

Query:  IGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTLEPYIGREKTPTSSNGWERKSPPPTNKKKHIRHKAEKEGAALYHITDKSKGEEDEVT
        IGWVKK GE  INEICT+PLSI NSYKDQI       D+ ++ L+              WE+                +  G     + +KS+G   E+ 
Subjt:  IGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTLEPYIGREKTPTSSNGWERKSPPPTNKKKHIRHKAEKEGAALYHITDKSKGEEDEVT

Query:  DPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINR
        +P+LKEL  EFPHLK+EPQGLPP  DIQHQIDL+PGASLP+LPHYRMSPEEY++LHD+IE LL+KG+IKPSLSPC VPALLTPKKD SWRMCVDSR INR
Subjt:  DPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINR

Query:  ITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDI--LVYSTNYDEHILHLRKLFQVLTETELYINS
        ITV+Y FPIP++GDLLDQLGKA +FSKIDL+S YHQIRIRP DEWKT FK NEGLFEW+ +     +      S + ++H+ HLRKLFQVL E ELYIN 
Subjt:  ITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDI--LVYSTNYDEHILHLRKLFQVLTETELYINS

Query:  KNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPIL
        K CTF  +EI FLGF+IK+G I M+PKKVEAI +WP PTSIKE+QAFLGLASFY+RFIRNFSS+V PLT+ LK  NFKW   QQ SF++IKRRLTSSPIL
Subjt:  KNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPIL

Query:  QLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTN-FSLKYLLSQRTISRMHA
        QLP+FT PFEV VDACG GIG VLSQ+ HPI+Y SEKLS+SRQSWSTY QELYALVRALKQWEHYLL KEF+LLTN FSLKYL SQ++ISRMHA
Subjt:  QLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTN-FSLKYLLSQRTISRMHA

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.1e-26965.12Show/hide
Query:  PPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWE
        P +Y Q +   Y+QYQNCRQGS++VAEYIEEFHRL AR NLSENEQHQIARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR   WE
Subjt:  PPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKEGEEETKLI
        TN +KKQ Y ++T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NNYTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+E EEET+LI
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKEGEEETKLI

Query:  EADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY---
        EADDGDR+SC++QRVLI PKEET PQ HSLFKTRCTIN K  PHPDPYKIGWVKKGGE  INEICT+PLSIGNSYKDQI       D+ ++ L  P+   
Subjt:  EADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY---

Query:  ---IGREKTPTSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPT
           + R +  T    W  K     P      + IR K +++      G  L            +TDKS+G   E+ +P+LKEL  EFPHLK+EPQGLPP 
Subjt:  ---IGREKTPTSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPT

Query:  RDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATI
        RDIQHQIDL+P ASLPNLPHYRMSPEEY++LHDHIE+LL+KG+IKPSLSPCAVPALLTP KDGSWRMCVDSR INR+T +Y+FPIPRIGDLLDQLGKA I
Subjt:  RDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATI

Query:  FSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKP
        FSKIDL++GYHQI+IRPGDEWKTAFKTNEGLFE               ++ ++H+ +LRKLF+VLTE ELYIN K CT+  +EI FLGF+IK+G I M+P
Subjt:  FSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKP

Query:  KKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQ
        KK+EAI + PTPTSIKE+QAFLGLASFYRRFIRNFS +VAPLT+                                  F  PFEVAV+ACGTGIGAVLSQ
Subjt:  KKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQ

Query:  RSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK
        + HPI+Y SEKLS+SRQSWSTY QELYALVRALKQWEHYLL  +F ++     K
Subjt:  RSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]3.0e-21445.23Show/hide
Query:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS
        MK+L++ RFLPP+YEQ LY QYQNCRQG+R + EY EEF+RL++R NL+E E  Q+AR+IGGLR  I++KV L     LSEA++LA  +E  + +R    
Subjt:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS

Query:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGK-EFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKE-DK
          N+T  T    + P        P++S  ++ +  +   +A+    G   G S N Y +P  GKCFRC +PGH SN CP RK++ L   +G D +KE D 
Subjt:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGK-EFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKE-DK

Query:  EGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDK--------------------------TDPHPDPYKIGWVKKGGEATINEIC
        E EE+ + +E D+GD V+CVIQR+L+APK+E   QRH +FKTRCT+N K                          T+ HP PYKI W+KKG E  +   C
Subjt:  EGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDK--------------------------TDPHPDPYKIGWVKKGGEATINEIC

Query:  TVPLSIGNSYKDQI-ADLGNMTL--------------EPYIGREKTPTSSNGWERKSPP--PTNKKKHIRHKAEKEGAALYHITDK--------------
         +P SIG  Y D +  D+  M                  Y GR+ T T    W  +     PT ++ H     EK+ + L    D+              
Subjt:  TVPLSIGNSYKDQI-ADLGNMTL--------------EPYIGREKTPTSSNGWERKSPP--PTNKKKHIRHKAEKEGAALYHITDK--------------

Query:  -SKGEEDEV---TDPKLKELLTEFPHLKRE--PQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKK
          KG  +E    + P +++LL EF  +  +  P GLPP RDIQH IDL+PG SLPNLPHYRMSP E++IL D +E+L+RKG I+ S+SPCAVPALL PKK
Subjt:  -SKGEEDEV---TDPKLKELLTEFPHLKRE--PQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKK

Query:  DGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM--------------------------
        DGSWRMCVDSR IN+ITV+Y+FPIPR+ D+LD L  + +FSK+DL+SGYHQIR+RPGDEWKTAFKT EGL+EW+                          
Subjt:  DGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM--------------------------

Query:  -FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRN
         F+VVYFDDIL+YS +  +H+ HLR++  VL E +LYIN K C F    + FLGF++ +  + +  +KV  I  WP P++I ++++F GLA+FYRRFIRN
Subjt:  -FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRN

Query:  FSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALK
        FS+L AP+T C+K G   W + Q  SF  IK +L+++P+L LP+F   FEV  DA   GIGAVLSQ   P+++ SEKL+ +R+ W+ Y  E YA++RALK
Subjt:  FSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALK

Query:  QWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF
         WEHYL+ +EF+L + + +LK++ +Q  ++RMHA+W++F+Q+F  V+K++SG  NKVADAL+ +  LL  L  E+  F  L DLY ED +F
Subjt:  QWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase1.0e-26865.73Show/hide
Query:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNR
        Y+QYQNCRQGS+ VAEYIEEFHRLSAR NLSENEQHQIARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR   WETN +KKQ Y +
Subjt:  YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNR

Query:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK-EGEEETKLIEADDGDRVSC
        +T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NNYTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+ E EEET+LIEADDGDR+SC
Subjt:  RTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDK-EGEEETKLIEADDGDRVSC

Query:  VIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY------IGREKTP
        ++QRVLI PKEET PQ HSLFKTRCTIN K  PHPDPYKIGWVKKGGE  INEICT+PLSIGNSYKDQI       D+ ++ L  P+      + R +  
Subjt:  VIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY------IGREKTP

Query:  TSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLI
        T    W  K     P      + IR K +++      G  L            +TDKS+G   E+ +P+LKEL  EFPHLK+EPQGLPP RDIQHQIDL+
Subjt:  TSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLI

Query:  PGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGY
        P ASLPNLPHYRMSPEEY++LHDHIE+LL+KG+IKPSLSPCAVPALLTP KDGSWRMCVDSR INR+T +Y+FPIPRIGDLLDQLGKA IFSKIDL++GY
Subjt:  PGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGY

Query:  HQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWP
        HQI+IRPGDEWKTAFKTNEGLFE               ++ ++H+ +LRKLF+VLTE ELYIN K CT+  +EI FLGF+IK+G I M+PKK+EAIH+ P
Subjt:  HQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWP

Query:  TPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSE
        TPTSIKE+QAFLGLASFYRRFIRNFS +VAPLT+                                  F  PFEVAV+ACGTGIGAVLSQ+ HPI+Y SE
Subjt:  TPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSE

Query:  KLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK
        KLS+SRQSWSTY QELYALVRALKQWEH+LL  +F ++     K
Subjt:  KLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.63.1e-24964.7Show/hide
Query:  ISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKT
        ++  ++ EEM+  RLKNSN+  TWETN +KKQ   ++T+EQP+TSV +KGK  D QE + KKE   RGK+ NNYTRPSLGKCFRCGEP HLSNNCPQRKT
Subjt:  ISLAETVEEMITTRLKNSNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKT

Query:  IALAEDEGSDMSKEDKEGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYK
        IALAEDE + MS+ DKE +EE +LIEAD+GDR+SC++QRVLI  KEE  PQRHSLFKTRCTI                          N K DPHPDPYK
Subjt:  IALAEDEGSDMSKEDKEGEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYK

Query:  IGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTLEPYIGREKTPTSSNGWERKSPPPTNKKKHIRHKAEKEGAALYHITDKSKGEEDEVT
        IGWVKK GE  INEICT+PLSI NSYKDQI       D+ ++ L+              WE+                +  G     + +KS+G   E+ 
Subjt:  IGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTLEPYIGREKTPTSSNGWERKSPPPTNKKKHIRHKAEKEGAALYHITDKSKGEEDEVT

Query:  DPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINR
        +P+LKEL  EFPHLK+EPQGLPP  DIQHQIDL+PGASLP+LPHYRMSPEEY++LHD+IE LL+KG+IKPSLSPC VPALLTPKKD SWRMCVDSR INR
Subjt:  DPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINR

Query:  ITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDI--LVYSTNYDEHILHLRKLFQVLTETELYINS
        ITV+Y FPIP++GDLLDQLGKA +FSKIDL+S YHQIRIRP DEWKT FK NEGLFEW+ +     +      S + ++H+ HLRKLFQVL E ELYIN 
Subjt:  ITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDI--LVYSTNYDEHILHLRKLFQVLTETELYINS

Query:  KNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPIL
        K CTF  +EI FLGF+IK+G I M+PKKVEAI +WP PTSIKE+QAFLGLASFY+RFIRNFSS+V PLT+ LK  NFKW   QQ SF++IKRRLTSSPIL
Subjt:  KNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPIL

Query:  QLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTN-FSLKYLLSQRTISRMHA
        QLP+FT PFEV VDACG GIG VLSQ+ HPI+Y SEKLS+SRQSWSTY QELYALVRALKQWEHYLL KEF+LLTN FSLKYL SQ++ISRMHA
Subjt:  QLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTN-FSLKYLLSQRTISRMHA

A0A5B7BER3 Uncharacterized protein1.0e-21546.21Show/hide
Query:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTR-LKN
        M++LL+ RFLP +YEQ LY QYQNCRQG R V+EY +EF+ LS+R NL+E E  Q+AR++GGLR  I++++ L+    L+EA SLA  VE   + + L++
Subjt:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTR-LKN

Query:  SNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEG--AGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRK---TIALAEDEGSDM-
         N   ++  +S  +Q  NR  + +      +K    D   +SK +    A   KS N Y RP  GKCFRC +PGH SN CP R+    + + ED   D  
Subjt:  SNRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEG--AGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRK---TIALAEDEGSDM-

Query:  SKEDKEGEEE---TKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTIND--------------------------KTDPHPDPYKIGWVKKGG
        ++E+ E ++E    ++ E D+G+ VSCV+QR+L+ PK+E  PQRH++F+TRCTIN                           KT+ HP+PYKIGW+KKG 
Subjt:  SKEDKEGEEE---TKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTIND--------------------------KTDPHPDPYKIGWVKKGG

Query:  EATINEICTVPLSIGNSYKDQIA-DLGNM-TLEPYIGRE-----------KTPTSSNGWERK-----------SPPPTNKKK----------HIRHKAEK
        E  + EIC VP SIG  YKD++A D+ +M      +GR            K  T    W  K           + P T+K +               A++
Subjt:  EATINEICTVPLSIGNSYKDQIA-DLGNM-TLEPYIGRE-----------KTPTSSNGWERK-----------SPPPTNKKK----------HIRHKAEK

Query:  EGAALYHITDKSKGEEDEVTDPKLKELLTEFPHL--KREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVP
         G  +  I     G E       L+ LL EF  +     P  LPP RDIQH IDL+PGASLPNLPHYRMSP+E  IL   +E+L+ KG+I+ S+SPCAVP
Subjt:  EGAALYHITDKSKGEEDEVTDPKLKELLTEFPHL--KREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVP

Query:  ALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------
        ALLTPKKDGSWRMCVDSR IN+ITV+Y+FPIPR+ D+LD L  + IFSKIDL+SGYHQIRIRPGDEWKTAFKT EGL+EW+                   
Subjt:  ALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM-------------------

Query:  --------FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASF
                F+VVYFDDIL+YS +  EH+ H+R++   L E++LYIN K C F    + FLGFII    I +  +KV AI  WPTP ++ +I++F GLA+F
Subjt:  --------FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASF

Query:  YRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELY
        YRRFIRNFSS+VAP+T+C+K G F+W   Q+ SF  IK +L+++P+L LP+F   F+V  DA  TGIGAVLSQ   P+++ SEKL+ +RQ W+TY  EL+
Subjt:  YRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELY

Query:  ALVRALKQWEHYLLCKEFILLTNF-SLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF
        A+VRALK WEHYL+ +EF++ ++  +LK++ +Q ++SRMH +WI+FLQRF FV+KH++G +NKVADAL+ +++LL ++S E+ +F  L +LY+ED +F
Subjt:  ALVRALKQWEHYLLCKEFILLTNF-SLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF

A0A5D3DGR0 Reverse transcriptase2.8e-26656.48Show/hide
Query:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS
        MKKL+K RF+PPNYEQTLY QYQNCRQG R  AEYIEEFHRL  RTNL E E+H I+ F+GGLRFD+KEKVKLQPF+ LSEAI+ AETVEEMI  R K S
Subjt:  MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNS

Query:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKE
         R   WE +++KK      T        A   K  + +E+S KKE   G  K  N Y RP  G C+RCG+ GH SN CPQRKTIA+A+D     ++   E
Subjt:  NRNITWETNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKE-GAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKE

Query:  GEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYKIGWVKKGGEATINEICT
         +EET++IEAD+GD +SC++QRVLI+PKEE   QRHSLFKTRCTI                          N KT PH  PYKIGW+KKGGE  I+EIC 
Subjt:  GEEETKLIEADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTI--------------------------NDKTDPHPDPYKIGWVKKGGEATINEICT

Query:  VPLSIGNSYKDQ-IADLGNMTL------EPY------IGREKTPTSSNGWERKSP---PPTNKKKHIRHKAEKEGAALYHITDKS--------------K
        VPLSIGNSYKDQ + D+  M +       P+      + R +  T    W  K     P   +K     K +K+G+    I+ K                
Subjt:  VPLSIGNSYKDQ-IADLGNMTL------EPY------IGREKTPTSSNGWERKSP---PPTNKKKHIRHKAEKEGAALYHITDKS--------------K

Query:  GEEDEVTDPK----LKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSW
        G ED   D +    +KEL  ++P + +EP  LPP RDI H I+L+ GAS P+LPHY MSP EY+ILHD IEELL+KG+IKPS S C VPALLTPKKDG+W
Subjt:  GEEDEVTDPK----LKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSW

Query:  RMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIV
        RMCVDSR IN+ITV+Y+FPIPR+ DLLDQLG A IFSKIDL+S YHQIRIRPGDEWKTAFKTNEGLFEW+                           FI+
Subjt:  RMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIV

Query:  VYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSL
        VYFDDILV+S  YD+H+ H+ +LFQVL   ELY+N K C F   EIAFLGFII++  + M  KKVEAI  W TPT++ ++QAFLGLASFYR+FI+N SS+
Subjt:  VYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSL

Query:  VAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH
         AP+T+CLK G F+W  KQQDSF+ +K  L +  +L+LP+F   FEVAVD CGTGIGAVLSQ+ HPI+Y SE+LS SRQSWSTY QELYALVRALKQWEH
Subjt:  VAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH

Query:  YLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF
        YLL +EFILLT +FSLKYL SQ++IS MHA+WIS+ QRFDFVIKHQ+G ENKVADAL+ KS+LL +LS E+  F H+ +LY++D +F
Subjt:  YLLCKEFILLT-NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTNKSSLLTLLSLEVVAFTHLPDLYEEDMNF

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X15.4e-27065.12Show/hide
Query:  PPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWE
        P +Y Q +   Y+QYQNCRQGS++VAEYIEEFHRL AR NLSENEQHQIARFIGGLRFDIKEKVKL  FR LSEAISLAETVEEM+T RLKNSNR   WE
Subjt:  PPNYEQTL---YNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWE

Query:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKEGEEETKLI
        TN +KKQ Y ++T+EQP+TS+ +KGK  D QE +KKKE   RGK+ NNYTRPSLGKCFRCGEPGHLSNNC QRKTIALAEDE + MS  D+E EEET+LI
Subjt:  TNSTKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKEGEEETKLI

Query:  EADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY---
        EADDGDR+SC++QRVLI PKEET PQ HSLFKTRCTIN K  PHPDPYKIGWVKKGGE  INEICT+PLSIGNSYKDQI       D+ ++ L  P+   
Subjt:  EADDGDRVSCVIQRVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIA------DLGNMTL-EPY---

Query:  ---IGREKTPTSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPT
           + R +  T    W  K     P      + IR K +++      G  L            +TDKS+G   E+ +P+LKEL  EFPHLK+EPQGLPP 
Subjt:  ---IGREKTPTSSNGWERKS----PPPTNKKKHIRHKAEKE------GAALYH----------ITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPT

Query:  RDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATI
        RDIQHQIDL+P ASLPNLPHYRMSPEEY++LHDHIE+LL+KG+IKPSLSPCAVPALLTP KDGSWRMCVDSR INR+T +Y+FPIPRIGDLLDQLGKA I
Subjt:  RDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATI

Query:  FSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKP
        FSKIDL++GYHQI+IRPGDEWKTAFKTNEGLFE               ++ ++H+ +LRKLF+VLTE ELYIN K CT+  +EI FLGF+IK+G I M+P
Subjt:  FSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKP

Query:  KKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQ
        KK+EAI + PTPTSIKE+QAFLGLASFYRRFIRNFS +VAPLT+                                  F  PFEVAV+ACGTGIGAVLSQ
Subjt:  KKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQ

Query:  RSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK
        + HPI+Y SEKLS+SRQSWSTY QELYALVRALKQWEHYLL  +F ++     K
Subjt:  RSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-6635.83Show/hide
Query:  NLPHYR--MSPEEY-RILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGS-----WRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKS
        NLP Y     P+ Y + +   I+++L +G I+ S SP   P  + PKK  +     +R+ +D R +N ITV  + PIP + ++L +LG+   F+ IDL  
Subjt:  NLPHYR--MSPEEY-RILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGS-----WRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKS

Query:  GYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFR
        G+HQI + P    KTAF T  G +E++                             +VY DDI+V+ST+ DEH+  L  +F+ L +  L +    C F +
Subjt:  GYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFR

Query:  REIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGG-NFKWTQKQQDS-FDDIKRRLTSSPILQLPNF
        +E  FLG ++    I   P+K+EAI  +P PT  KEI+AFLGL  +YR+FI NF+ +  P+T CLK       T  + DS F  +K  ++  PIL++P+F
Subjt:  REIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGG-NFKWTQKQQDS-FDDIKRRLTSSPILQLPNF

Query:  TLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFS-LKYLLSQRTISRMHAQWISFLQRFDF
        T  F +  DA    +GAVLSQ  HP+ Y S  L+    ++ST  +EL A+V A K + HYLL + F + ++   L +L   +  +    +W   L  FDF
Subjt:  TLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFS-LKYLLSQRTISRMHAQWISFLQRFDF

Query:  VIKHQSGTENKVADALTNKSSLLTLLS
         IK+  G EN VADAL+      T LS
Subjt:  VIKHQSGTENKVADALTNKSSLLTLLS

P0CT41 Transposon Tf2-12 polyprotein3.5e-6431.17Show/hide
Query:  VTDPKLKELLTEFPHLKRE--PQGLP-PTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDS
        V +P+L ++  EF  +  E   + LP P + ++ +++L        + +Y + P + + ++D I + L+ G I+ S +  A P +  PKK+G+ RM VD 
Subjt:  VTDPKLKELLTEFPHLKRE--PQGLP-PTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDS

Query:  RTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDI
        + +N+      +P+P I  LL ++  +TIF+K+DLKS YH IR+R GDE K AF+   G+FE++                            +V Y DDI
Subjt:  RTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDI

Query:  LVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTN
        L++S +  EH+ H++ + Q L    L IN   C F + ++ F+G+ I +       + ++ +  W  P + KE++ FLG  ++ R+FI   S L  PL N
Subjt:  LVYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTN

Query:  CLKGG-NFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRS-----HPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH
         LK    +KWT  Q  + ++IK+ L S P+L+  +F+    +  DA    +GAVLSQ+      +P+ Y S K+S ++ ++S   +E+ A++++LK W H
Subjt:  CLKGG-NFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRS-----HPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEH

Query:  YL--LCKEFILLT---NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALT
        YL    + F +LT   N   +        ++  A+W  FLQ F+F I ++ G+ N +ADAL+
Subjt:  YL--LCKEFILLT---NFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALT

P20825 Retrovirus-related Pol polyprotein from transposon 2976.3e-6633.11Show/hide
Query:  KLKELLTEFPHLK-REPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKD-----GSWRMCVDSR
        KLK LL +F +L+ +E + L  T  I+H ++    + + +  +      E  +  + ++E+L +G I+ S SP   P  + PKK        +R+ +D R
Subjt:  KLKELLTEFPHLK-REPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKD-----GSWRMCVDSR

Query:  TINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDIL
         +N IT+  ++PIP + ++L +LGK   F+ IDL  G+HQI +      KTAF T  G +E++                             +VY DDI+
Subjt:  TINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWM---------------------------FIVVYFDDIL

Query:  VYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNC
        ++ST+  EH+  ++ +F  L +  L +    C F ++E  FLG I+    I   P KV+AI ++P PT  KEI+AFLGL  +YR+FI N++ +  P+T+C
Subjt:  VYSTNYDEHILHLRKLFQVLTETELYINSKNCTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNC

Query:  LKGGNFKWTQKQQ--DSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCK
        LK      TQK +  ++F+ +K  +   PILQLP+F   F +  DA    +GAVLSQ  HPI + S  L+    ++S   +EL A+V A K + HYLL +
Subjt:  LKGGNFKWTQKQQ--DSFDDIKRRLTSSPILQLPNFTLPFEVAVDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCK

Query:  EFILLTNFS-LKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALT
        +F++ ++   L++L + +       +W   L  + F I +  G EN VADAL+
Subjt:  EFILLTNFS-LKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.6e-6935.02Show/hide
Query:  IQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFS
        ++H I++ PGA LP L  Y ++ +  + ++  +++LL   +I PS SPC+ P +L PKKDG++R+CVD RT+N+ T+   FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFS

Query:  KIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------------MFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNC
         +DL SGYHQI + P D +KTAF T  G +E+                          F+ VY DDIL++S + +EH  HL  + + L    L +  K C
Subjt:  KIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------------MFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNC

Query:  TFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAP--LTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQ
         F   E  FLG+ I    I     K  AI  +PTP ++K+ Q FLG+ ++YRRFI N S +  P  L  C K    +WT+KQ  + + +K  L +SP+L 
Subjt:  TFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAP--LTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQ

Query:  LPNFTLPFEVAVDACGTGIGAVLSQRSHPIK------YSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQ
          N    + +  DA   GIGAVL +  +  K      Y S+ L S+++++     EL  +++AL  + + L  K F L T + SL  L ++   +R   +
Subjt:  LPNFTLPFEVAVDACGTGIGAVLSQRSHPIK------YSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQ

Query:  WISFLQRFDFVIKHQSGTENKVADALTNKSSLLT
        W+  L  +DF +++ +G +N VADA++     +T
Subjt:  WISFLQRFDFVIKHQSGTENKVADALTNKSSLLT

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.2e-7035.25Show/hide
Query:  IQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFS
        ++H I++ PGA LP L  Y ++ +  + ++  +++LL   +I PS SPC+ P +L PKKDG++R+CVD RT+N+ T+   FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWRMCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFS

Query:  KIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------------MFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNC
         +DL SGYHQI + P D +KTAF T  G +E+                          F+ VY DDIL++S + +EH  HL  + + L    L +  K C
Subjt:  KIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEW-------------------------MFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKNC

Query:  TFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAP--LTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQ
         F   E  FLG+ I    I     K  AI  +PTP ++K+ Q FLG+ ++YRRFI N S +  P  L  C K    +WT+KQ  + D +K  L +SP+L 
Subjt:  TFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAP--LTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQ

Query:  LPNFTLPFEVAVDACGTGIGAVLSQRSHPIK------YSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQ
          N    + +  DA   GIGAVL +  +  K      Y S+ L S+++++     EL  +++AL  + + L  K F L T + SL  L ++   +R   +
Subjt:  LPNFTLPFEVAVDACGTGIGAVLSQRSHPIK------YSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLT-NFSLKYLLSQRTISRMHAQ

Query:  WISFLQRFDFVIKHQSGTENKVADALTNKSSLLT
        W+  L  +DF +++ +G +N VADA++     +T
Subjt:  WISFLQRFDFVIKHQSGTENKVADALTNKSSLLT

Arabidopsis top hitse value%identityAlignment
AT1G47350.1 F-box associated ubiquitination effector family protein7.1e-0426.32Show/hide
Query:  LYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYN
        +YN+ QN R  +R V EY EEF+ L    ++++++   ++R IG LR  ++  +       +SEA   A + E+ +        R+ +W   +T+ +   
Subjt:  LYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNSTKKQPYN

Query:  RRTEEQPATSVAEK
        ++T   P T++A +
Subjt:  RRTEEQPATSVAEK

ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-2338.35Show/hide
Query:  HLRKLFQVLTETELYINSKNCTFFRREIAFLG--FIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWT
        HL  + Q+  + + Y N K C F + +IA+LG   II    +   P K+EA+  WP P +  E++ FLGL  +YRRF++N+  +V PLT  LK  + KWT
Subjt:  HLRKLFQVLTETELYINSKNCTFFRREIAFLG--FIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWT

Query:  QKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAV
        +    +F  +K  +T+ P+L LP+  LPF   V
Subjt:  QKQQDSFDDIKRRLTSSPILQLPNFTLPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTCTTGAAGACACGCTTTCTGCCACCGAACTACGAACAAACATTGTATAATCAATATCAGAATTGCCGCCAAGGGAGCCGAATTGTGGCAGAATATATTGA
AGAATTCCACAGATTGAGCGCAAGAACCAATCTGAGTGAGAACGAACAGCATCAGATTGCAAGGTTCATTGGCGGATTACGATTCGATATCAAGGAAAAGGTAAAGTTAC
AGCCCTTTCGCTTCTTGTCGGAAGCTATTTCTCTTGCGGAGACAGTAGAGGAAATGATCACAACACGATTGAAGAACTCAAACAGAAATATTACATGGGAGACAAACTCC
ACCAAGAAGCAACCTTACAACAGGAGGACTGAGGAACAGCCAGCAACATCAGTGGCTGAGAAGGGTAAAGAGTTCGATACTCAAGAGGCAAGCAAAAAGAAAGAAGGAGC
AGGCAGGGGGAAGAGTCTAAACAATTACACTCGCCCGTCCTTAGGGAAGTGTTTTCGATGTGGTGAACCTGGCCACTTATCCAACAACTGCCCCCAAAGGAAAACAATAG
CACTAGCTGAAGATGAAGGCAGTGATATGAGTAAAGAAGACAAAGAAGGAGAAGAAGAAACAAAGCTGATTGAAGCAGATGATGGGGACAGGGTATCCTGTGTTATCCAA
AGGGTCCTCATCGCCCCTAAAGAAGAGACAACCCCCCAGCGACACAGTCTATTCAAGACAAGATGCACTATCAATGACAAGACAGACCCCCACCCCGATCCTTACAAGAT
TGGATGGGTGAAGAAGGGAGGGGAAGCCACAATCAATGAGATTTGTACAGTACCACTTTCCATTGGAAACAGCTACAAAGATCAGATTGCAGACCTTGGCAACATGACAC
TCGAACCCTACATAGGGAGAGAGAAAACACCTACGAGTTCCAATGGATGGGAAAGAAAGAGTCCTCCTCCCACTAACAAGAAAAAACACATAAGGCATAAGGCAGAAAAA
GAAGGGGCAGCTCTTTATCACATTACTGACAAGTCCAAGGGAGAGGAAGATGAGGTCACTGATCCAAAGCTCAAGGAGTTACTCACAGAATTCCCTCACCTAAAGAGAGA
ACCACAGGGATTGCCACCCACACGTGACATTCAACACCAAATTGACCTCATCCCAGGAGCATCACTACCCAATTTACCCCACTACAGAATGAGTCCAGAAGAATACAGAA
TCCTACACGATCACATAGAAGAGCTGCTGAGAAAGGGCTACATCAAGCCAAGCCTTAGCCCATGTGCTGTGCCTGCATTACTTACACCAAAGAAGGATGGAAGCTGGAGA
ATGTGTGTAGACAGCAGGACTATCAACCGAATTACTGTAAGGTACCAATTCCCCATCCCTCGAATTGGAGACTTGCTGGATCAACTAGGCAAGGCTACCATCTTTTCAAA
GATTGACTTGAAGAGCGGCTATCACCAAATAAGGATTAGACCGGGGGACGAGTGGAAAACAGCCTTCAAAACCAATGAAGGCTTATTCGAATGGATGTTTATAGTTGTCT
ATTTCGATGACATACTTGTATACAGCACCAACTATGATGAGCACATACTGCACTTAAGAAAACTATTCCAAGTCCTAACGGAGACAGAACTATACATCAATTCCAAAAAC
TGCACATTCTTTAGAAGGGAAATTGCCTTTCTTGGCTTTATAATCAAGCAAGGGAGCATAGACATGAAACCAAAGAAAGTAGAGGCTATCCATACTTGGCCCACACCAAC
CTCTATTAAGGAGATACAAGCCTTCCTTGGCTTGGCTTCCTTTTACAGAAGATTTATAAGAAACTTCAGCTCATTGGTAGCACCGCTCACCAACTGCCTAAAGGGAGGAA
ACTTTAAGTGGACCCAAAAGCAGCAAGATAGCTTTGACGACATTAAAAGGAGGTTGACTTCCAGCCCCATACTTCAACTACCAAACTTCACTTTACCATTTGAAGTGGCT
GTGGATGCATGCGGAACAGGAATTGGGGCTGTTCTCTCTCAACGAAGTCATCCCATCAAATATTCTAGTGAAAAGCTAAGCTCATCTAGACAGTCTTGGAGTACGTATGT
GCAGGAATTATATGCTCTCGTTCGGGCACTCAAACAGTGGGAGCACTACCTATTATGCAAAGAATTTATACTGTTAACTAACTTTTCTTTAAAATACCTCCTGTCTCAAA
GAACCATCAGTCGGATGCACGCACAATGGATCTCTTTCTTACAAAGGTTTGACTTTGTGATCAAACACCAAAGTGGCACAGAAAATAAAGTGGCAGATGCACTCACCAAT
AAGAGTTCCCTCCTTACACTCCTTTCTCTAGAAGTTGTGGCATTTACACATCTTCCTGACTTATATGAAGAGGATATGAACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTCTTGAAGACACGCTTTCTGCCACCGAACTACGAACAAACATTGTATAATCAATATCAGAATTGCCGCCAAGGGAGCCGAATTGTGGCAGAATATATTGA
AGAATTCCACAGATTGAGCGCAAGAACCAATCTGAGTGAGAACGAACAGCATCAGATTGCAAGGTTCATTGGCGGATTACGATTCGATATCAAGGAAAAGGTAAAGTTAC
AGCCCTTTCGCTTCTTGTCGGAAGCTATTTCTCTTGCGGAGACAGTAGAGGAAATGATCACAACACGATTGAAGAACTCAAACAGAAATATTACATGGGAGACAAACTCC
ACCAAGAAGCAACCTTACAACAGGAGGACTGAGGAACAGCCAGCAACATCAGTGGCTGAGAAGGGTAAAGAGTTCGATACTCAAGAGGCAAGCAAAAAGAAAGAAGGAGC
AGGCAGGGGGAAGAGTCTAAACAATTACACTCGCCCGTCCTTAGGGAAGTGTTTTCGATGTGGTGAACCTGGCCACTTATCCAACAACTGCCCCCAAAGGAAAACAATAG
CACTAGCTGAAGATGAAGGCAGTGATATGAGTAAAGAAGACAAAGAAGGAGAAGAAGAAACAAAGCTGATTGAAGCAGATGATGGGGACAGGGTATCCTGTGTTATCCAA
AGGGTCCTCATCGCCCCTAAAGAAGAGACAACCCCCCAGCGACACAGTCTATTCAAGACAAGATGCACTATCAATGACAAGACAGACCCCCACCCCGATCCTTACAAGAT
TGGATGGGTGAAGAAGGGAGGGGAAGCCACAATCAATGAGATTTGTACAGTACCACTTTCCATTGGAAACAGCTACAAAGATCAGATTGCAGACCTTGGCAACATGACAC
TCGAACCCTACATAGGGAGAGAGAAAACACCTACGAGTTCCAATGGATGGGAAAGAAAGAGTCCTCCTCCCACTAACAAGAAAAAACACATAAGGCATAAGGCAGAAAAA
GAAGGGGCAGCTCTTTATCACATTACTGACAAGTCCAAGGGAGAGGAAGATGAGGTCACTGATCCAAAGCTCAAGGAGTTACTCACAGAATTCCCTCACCTAAAGAGAGA
ACCACAGGGATTGCCACCCACACGTGACATTCAACACCAAATTGACCTCATCCCAGGAGCATCACTACCCAATTTACCCCACTACAGAATGAGTCCAGAAGAATACAGAA
TCCTACACGATCACATAGAAGAGCTGCTGAGAAAGGGCTACATCAAGCCAAGCCTTAGCCCATGTGCTGTGCCTGCATTACTTACACCAAAGAAGGATGGAAGCTGGAGA
ATGTGTGTAGACAGCAGGACTATCAACCGAATTACTGTAAGGTACCAATTCCCCATCCCTCGAATTGGAGACTTGCTGGATCAACTAGGCAAGGCTACCATCTTTTCAAA
GATTGACTTGAAGAGCGGCTATCACCAAATAAGGATTAGACCGGGGGACGAGTGGAAAACAGCCTTCAAAACCAATGAAGGCTTATTCGAATGGATGTTTATAGTTGTCT
ATTTCGATGACATACTTGTATACAGCACCAACTATGATGAGCACATACTGCACTTAAGAAAACTATTCCAAGTCCTAACGGAGACAGAACTATACATCAATTCCAAAAAC
TGCACATTCTTTAGAAGGGAAATTGCCTTTCTTGGCTTTATAATCAAGCAAGGGAGCATAGACATGAAACCAAAGAAAGTAGAGGCTATCCATACTTGGCCCACACCAAC
CTCTATTAAGGAGATACAAGCCTTCCTTGGCTTGGCTTCCTTTTACAGAAGATTTATAAGAAACTTCAGCTCATTGGTAGCACCGCTCACCAACTGCCTAAAGGGAGGAA
ACTTTAAGTGGACCCAAAAGCAGCAAGATAGCTTTGACGACATTAAAAGGAGGTTGACTTCCAGCCCCATACTTCAACTACCAAACTTCACTTTACCATTTGAAGTGGCT
GTGGATGCATGCGGAACAGGAATTGGGGCTGTTCTCTCTCAACGAAGTCATCCCATCAAATATTCTAGTGAAAAGCTAAGCTCATCTAGACAGTCTTGGAGTACGTATGT
GCAGGAATTATATGCTCTCGTTCGGGCACTCAAACAGTGGGAGCACTACCTATTATGCAAAGAATTTATACTGTTAACTAACTTTTCTTTAAAATACCTCCTGTCTCAAA
GAACCATCAGTCGGATGCACGCACAATGGATCTCTTTCTTACAAAGGTTTGACTTTGTGATCAAACACCAAAGTGGCACAGAAAATAAAGTGGCAGATGCACTCACCAAT
AAGAGTTCCCTCCTTACACTCCTTTCTCTAGAAGTTGTGGCATTTACACATCTTCCTGACTTATATGAAGAGGATATGAACTTTTAA
Protein sequenceShow/hide protein sequence
MKKLLKTRFLPPNYEQTLYNQYQNCRQGSRIVAEYIEEFHRLSARTNLSENEQHQIARFIGGLRFDIKEKVKLQPFRFLSEAISLAETVEEMITTRLKNSNRNITWETNS
TKKQPYNRRTEEQPATSVAEKGKEFDTQEASKKKEGAGRGKSLNNYTRPSLGKCFRCGEPGHLSNNCPQRKTIALAEDEGSDMSKEDKEGEEETKLIEADDGDRVSCVIQ
RVLIAPKEETTPQRHSLFKTRCTINDKTDPHPDPYKIGWVKKGGEATINEICTVPLSIGNSYKDQIADLGNMTLEPYIGREKTPTSSNGWERKSPPPTNKKKHIRHKAEK
EGAALYHITDKSKGEEDEVTDPKLKELLTEFPHLKREPQGLPPTRDIQHQIDLIPGASLPNLPHYRMSPEEYRILHDHIEELLRKGYIKPSLSPCAVPALLTPKKDGSWR
MCVDSRTINRITVRYQFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWMFIVVYFDDILVYSTNYDEHILHLRKLFQVLTETELYINSKN
CTFFRREIAFLGFIIKQGSIDMKPKKVEAIHTWPTPTSIKEIQAFLGLASFYRRFIRNFSSLVAPLTNCLKGGNFKWTQKQQDSFDDIKRRLTSSPILQLPNFTLPFEVA
VDACGTGIGAVLSQRSHPIKYSSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTNFSLKYLLSQRTISRMHAQWISFLQRFDFVIKHQSGTENKVADALTN
KSSLLTLLSLEVVAFTHLPDLYEEDMNF