; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G12910 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G12910
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr1:8411815..8415195
RNA-Seq ExpressionCSPI01G12910
SyntenyCSPI01G12910
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]9.1e-27450.42Show/hide
Query:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
        M  +RG+ PAA E   ++ A E   LSPRT++  L +VE S+ ++R + + ++     E+ +L  R  E P          QN  R    RGRR  E FR
Subjt:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR

Query:  NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
             R FQ R      + +PQ      + D+ + WQ  +E+ ++SS  E+ +    NDD+   R ++  +NE ++ E  +YKMKIDLP YDGK+NIE F
Subjt:  NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF

Query:  LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
        LDW+K+TENFF YM   + KKVHLVALKL+ GAS WWDQ+ +NRQ+ GK  IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R   +YIEEFH L 
Subjt:  LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS

Query:  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
         RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI  R+K+ ++RP WE ++++      KT    S       ++   QE +  
Subjt:  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE

Query:  RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
        +K     +   +N Y RP  G  +RCGQ  H SN CPQRKTIA+A++    S  S  E ++ETE+IEAD+ + +SC++QR                   R
Subjt:  RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR

Query:  CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
        CTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H  PYKIGW++KGGE  +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Subjt:  CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR

Query:  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
        ENTYE  WM +KV+LLP+ ++  + +   ++   LF T                                    +L  ++P+I +EP  LPPLRDI H+I
Subjt:  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI

Query:  DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
        +L+SGAS P+L HY MSP EYK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+
Subjt:  DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK

Query:  SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
        S YHQIRIRPGDEWKT FKTNEGLFEW                 M+K    F                           +    L   ELY+N KK +F 
Subjt:  SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM

Query:  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
          EIAFL F+I++  + M+ KK+EAI  W  P ++ ++QAFLGLASFYRKFI+N +S+AAP+TD                                  F 
Subjt:  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS

Query:  SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
          FEV VD C TGIG VL+QQ HPIEYFSE+L+ SRQSWSTYEQELYALVRALKQWEHYLLS+EF+LLTDHFSLK
Subjt:  SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]3.7e-29159.62Show/hide
Query:  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
        P R E   +N++  G   GRRAR N++N  N    QRRRP     Q  D++ QE+ E WQ  Q+ DSS GDEQGN+WN + + R  +  +  E RR  YH
Subjt:  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH

Query:  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
        DYKMKIDLP Y+GK++IE+FLDWIK+TENFF YM  P+RKKVHLVALKL+ GAS W                               P +Y Q +   Y+
Subjt:  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN

Query:  QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
        QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+GGLR DIKEKVKL  FR LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  KT
Subjt:  QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT

Query:  NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
        ++QPSTS   KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC QRKTIA+AE E   MS   +E E+ETELIEADD +R+SC+
Subjt:  NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV

Query:  IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
        +QR   T          +  + ++    K    +N K   HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+
Subjt:  IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS

Query:  LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
        LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF T                                    +L  EFP +K+EP+GLPPLRDI
Subjt:  LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI

Query:  QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
        QH IDL+  ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSK
Subjt:  QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK

Query:  IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
        IDL++GYHQI+IRPGDEWKT FKTNEGLFE   ++       +K    LTE ELYIN KK  ++ +EI FL F+IK+G I MEPKKIEAI + P P SIK
Subjt:  IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK

Query:  EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
        E+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Subjt:  EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL

XP_011648447.2 uncharacterized protein LOC105434464 [Cucumis sativus]1.7e-25675.39Show/hide
Query:  MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
        MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDL VYDGKQNIEAFLDWIKSTENFFNYMD PE KKVHLVALKLR                         
Subjt:  MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW

Query:  EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
                                                                                               AETVEEMIAVRSK
Subjt:  EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK

Query:  NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
        NLKRRPAW+TTSTRMNNYADKTNDQPSTSTKGKGKEVENQEV VERKNEQ FKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
Subjt:  NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS

Query:  KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
        K AEDE ELIEADD ERVSCVIQR                  ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPT YKIGWVRK GE TVSEI
Subjt:  KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI

Query:  CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEK-QLFTTQLLYEFPRIKEEPEGLPPLR
        CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQ MGRKVVLLPITRKNKEGLR E  +    QLLYEFPRIKEEPEGLPPLR
Subjt:  CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEK-QLFTTQLLYEFPRIKEEPEGLPPLR

Query:  DIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
        DIQHHIDLI GASLPNLAHYRMSPQEYKTLHDHIEELLKKGHI+PSLSPCAVPALLT KK GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
Subjt:  DIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF

Query:  SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
        SKIDLKSGYHQIRIRPGDEWKTTFKT EGLFEWM
Subjt:  SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]8.4e-29684.24Show/hide
Query:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNP
        MAGRR  NPA GENR QE AEEIT LSP+T+TVRLLAVEESLGDL N FDRL+ESVELL+RR+E+PQPPPRNEINFQN+QRFGE RGRRAR   RN+NNP
Subjt:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFN
        RG QRRRPGYAI QQ DEDFQEDQE WQE QEDDSSSGDEQGNMWNFND+ RAGRNNQR E RRGEYHDYKMKIDLP+YDGK+NIEAFLDWIKSTENFFN
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFN

Query:  YMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQ
        YMD PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ +RSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSV +YIEEFH LSARTNLSENEQHQ
Subjt:  YMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQ

Query:  IARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQ
        +ARFVGGLR DIKEKV+LQPFRFLSEAISFAETVEEMIA+RSKNL RR AWET ST+      KTNDQPSTSTK KGKE++NQEVAVERK EQTFK S Q
Subjt:  IARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQ

Query:  NNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIID
        N+YSRP LGK FRCGQT HLS+NCPQRKTIAIAEEG Q+SEDS EAE+ETELIEADD ERVSCVIQR                   RCTINGRVCDVIID
Subjt:  NNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIID

Query:  NDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKV
        + SS+NFVAKKLVTVLNLKAEAHP PYKIGWVRKGGE TVSEICTVPLSI NAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKV
Subjt:  NDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKV

Query:  VLLPITRKNKEGLRGEKQLFTT
        VLLPIT+K  EGLRGEKQLF T
Subjt:  VLLPITRKNKEGLRGEKQLFTT

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]1.3e-29979.67Show/hide
Query:  MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW
        MWN NDDLRAGRNN+R EVRRGEYHDYKMKIDLP+YDGK+NIEAFLDWIKSTENFFNYMD PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ IRSW
Subjt:  MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSW

Query:  EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK
        EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSV DYIEEFH LSARTNLSENEQHQ+ARFVG                         ETVEEMIA+RSK
Subjt:  EKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSK

Query:  NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
        NL RR AWETTST+      KTNDQPSTSTKGKGKEV+NQEVAVERK EQTFK S QN+YSRP LGK FRCGQT HLSNNCPQRKTIAIAEEG Q SEDS
Subjt:  NLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS

Query:  KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI
         EAE+ETELIEADD ERVSC IQR                   RCTINGRVCDVIID+ SS+NFVAKKLV VLNLKAEAHPTPYKIGWVRKGGE TVSEI
Subjt:  KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI

Query:  CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTT---------------------
        CTVPLSI NAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKVVLLPIT+K  EGLRGEKQLF T                     
Subjt:  CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTT---------------------

Query:  ---------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWR
                       QLL+EFP IKEEP+GLPPLRDIQHHIDLI GASLPNLAHYRMSPQEYK LHDHIEELLKKGHI+PSLSPCAVPALLTPKK GSWR
Subjt:  ---------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWR

Query:  MCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM
        MCVDSRAINRITVKYRF IPRISDLLDQLGKASIFSKIDLKSGYHQIR+RPGDEWKT FKTNEGLFEWM
Subjt:  MCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase8.9e-25163.96Show/hide
Query:  YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYAD
        Y+QYQNCRQG + V +YIEEFH LSAR NLSENEQHQIARF+GGLR DIKEKVKL  FR LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  
Subjt:  YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYAD

Query:  KTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMS-EDSKEAEDETELIEADDEERV
        KT++QPSTS   KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC QRKTIA+AE E   MS  D +E E+ETELIEADD +R+
Subjt:  KTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMS-EDSKEAEDETELIEADDEERV

Query:  SCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD
        SC++QR   T          +  + ++    K    +N K   HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+D
Subjt:  SCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD

Query:  TQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPL
        TQ+LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF T                                    +L  EFP +K+EP+GLPPL
Subjt:  TQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPL

Query:  RDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASI
        RDIQH IDL+  ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA I
Subjt:  RDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASI

Query:  FSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPA
        FSKIDL++GYHQI+IRPGDEWKT FKTNEGLFE   ++       +K    LTE ELYIN KK  ++ +EI FL F+IK+G I MEPKKIEAIH+ P P 
Subjt:  FSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPA

Query:  SIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFV
        SIKE+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEH+LLS +F 
Subjt:  SIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFV

Query:  LL
        ++
Subjt:  LL

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.61.3e-22246.89Show/hide
Query:  KNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRR-KEYPQPPPRNEINFQNNQRFGEARGRR-ARENFRNVNNPRGFQ
        K P+ G    QE  EEI  LS RT+TVRLLAVE+SLGDL    DR+++ ++ L+RR  E P P        +   R    RG R  R N+RN  N R  Q
Subjt:  KNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRR-KEYPQPPPRNEINFQNNQRFGEARGRR-ARENFRNVNNPRGFQ

Query:  RRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDI
        RRRP     Q  D++ QE+ E WQ I                                                                          
Subjt:  RRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDI

Query:  PERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARF
           KKVHLVALKL+ GAS WWDQLE+NRQ+                                                                      
Subjt:  PERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARF

Query:  VGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYS
                                    + EEM+ VR KN  +R  WET  ++  +   KT++QPSTS   KGK ++ QE     K E   +  +QNNY+
Subjt:  VGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYS

Query:  RPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS
        RP LGK FRCG+ +HLSNNCPQRKTIA+AE E   MSE  KE ++E ELIEAD+ +R+SC++QR                   RCTI+G+VCDVIID+ S
Subjt:  RPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS

Query:  SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLL
        S+NFVA+KLV  LNLK + HP PYKIGWV+K GE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLL RPW+ D                    ++ L
Subjt:  SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLL

Query:  PITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY
         +  K++ G     +    +L  EFP +K+EP+GLPPL DIQH IDL+ GASLP+L HYRMSP+EY+ LHD+IE LLKKGHI+PSLSPC VPALLTPKK 
Subjt:  PITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY

Query:  GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------AFTTS---------KK
         SWRMCVDSRAINRITVKY F IP++ DLLDQLGKA++FSKIDL+S YHQIRIRP DEWKTTFK NEGLFEW+    G       FT+          +K
Subjt:  GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------AFTTS---------KK

Query:  NVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------
            L E ELYIN KK  F+ +EI FL F+IK+G I MEPKK+EAI +WP P SIKE+QAFLGLASFY++FIRNF+S+  PLTDY               
Subjt:  NVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------

Query:  ------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
                          F+SPFEV VDAC  GIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLL KEFVLLT+HFSLK  Q
Subjt:  ------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ

A0A5B7BER3 Uncharacterized protein1.9e-21344.08Show/hide
Query:  SSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQR
        +++ D      +F  D   GR     + R     +Y+MKIDLP ++G  +IE+FLDWI   E FF+ M+I + K+V LVA KL+ GAS WWDQ++ NR+R
Subjt:  SSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQR

Query:  CGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETV
         GKQ +R+W+KM++LL+ RFLP +YEQ LY QYQNCRQG RSV +Y +EF+ LS+R NL+E E  Q+AR+VGGLR  I++++ L+    L+EA S A  V
Subjt:  CGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETV

Query:  EEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKG-----KEVENQEVAVERKNEQTFKTSSQ---NNYSRPLLGKFFRCGQTEHLSNNCPQ
        E           R+P     S R  +Y D + +Q +   + +G     +++  ++ A   KN+ T    SQ   N Y+RP+ GK FRC Q  H SN CP 
Subjt:  EEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKG-----KEVENQEVAVERKNEQTFKTSSQ---NNYSRPLLGKFFRCGQTEHLSNNCPQ

Query:  RK---TIAIAEEGRQMSEDSKEAE--DE---TELIEADDEERVSCVIQ------------------RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNL
        R+    + + E+     E+ +EAE  DE    E+ E D+ E VSCV+Q                  R RCTIN +VCDVIID+ SS+N V+K LV  L L
Subjt:  RK---TIAIAEEGRQMSEDSKEAE--DE---TELIEADDEERVSCVIQ------------------RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNL

Query:  KAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLP--------------
        K E HP PYKIGW++KG E  V+EIC VP SI   YKD++ CD+++MD CH+LLGRPWQ+D  + HKG++NTY   W  +KVVL+P              
Subjt:  KAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLP--------------

Query:  ----ITRKNKEGLRGEKQL---------------------FTTQLLYEFPRI--KEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIE
            +T    + +   K+                          LL EF  I   E P+ LPP+RDIQHHIDL+ GASLPNL HYRMSP+E + L   +E
Subjt:  ----ITRKNKEGLRGEKQL---------------------FTTQLLYEFPRI--KEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIE

Query:  ELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM-
        +L+ KG IQ S+SPCAVPALLTPKK GSWRMCVDSRAIN+ITVKYRF IPR++D+LD L  + IFSKIDL+SGYHQIRIRPGDEWKT FKT EGL+EW+ 
Subjt:  ELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM-

Query:  ---------------------------------------HKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTW
                                                 +R      ++ +  L E++LYIN KK  F+   + FL F+I    I ++ +K+ AI  W
Subjt:  ---------------------------------------HKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTW

Query:  PIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD---------------------------------YFSSPFEVEVDACCTGIGVVLAQQGHPIEYFS
        P P ++ +I++F GLA+FYR+FIRNF+S+ AP+TD                                  F   F+V+ DA  TGIG VL+Q+G P+E+FS
Subjt:  PIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD---------------------------------YFSSPFEVEVDACCTGIGVVLAQQGHPIEYFS

Query:  EKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
        EKLN +RQ W+TYE EL+A+VRALK WEHYL+ +EFV+ +DH +LK
Subjt:  EKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK

A0A5D3DGR0 Reverse transcriptase4.4e-27450.42Show/hide
Query:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR
        M  +RG+ PAA E   ++ A E   LSPRT++  L +VE S+ ++R + + ++     E+ +L  R  E P          QN  R    RGRR  E FR
Subjt:  MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFR

Query:  NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF
             R FQ R      + +PQ      + D+ + WQ  +E+ ++SS  E+ +    NDD+   R ++  +NE ++ E  +YKMKIDLP YDGK+NIE F
Subjt:  NVNNPRGFQRR---RPGYAIPQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYHDYKMKIDLPVYDGKQNIEAF

Query:  LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS
        LDW+K+TENFF YM   + KKVHLVALKL+ GAS WWDQ+ +NRQ+ GK  IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R   +YIEEFH L 
Subjt:  LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS

Query:  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE
         RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI  R+K+ ++RP WE ++++      KT    S       ++   QE +  
Subjt:  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVE

Query:  RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR
        +K     +   +N Y RP  G  +RCGQ  H SN CPQRKTIA+A++    S  S  E ++ETE+IEAD+ + +SC++QR                   R
Subjt:  RKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR------------------AR

Query:  CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR
        CTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H  PYKIGW++KGGE  +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Subjt:  CTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR

Query:  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI
        ENTYE  WM +KV+LLP+ ++  + +   ++   LF T                                    +L  ++P+I +EP  LPPLRDI H+I
Subjt:  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDIQHHI

Query:  DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK
        +L+SGAS P+L HY MSP EYK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR+SDLLDQLG A IFSKIDL+
Subjt:  DLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLK

Query:  SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM
        S YHQIRIRPGDEWKT FKTNEGLFEW                 M+K    F                           +    L   ELY+N KK +F 
Subjt:  SGYHQIRIRPGDEWKTTFKTNEGLFEW-----------------MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM

Query:  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS
          EIAFL F+I++  + M+ KK+EAI  W  P ++ ++QAFLGLASFYRKFI+N +S+AAP+TD                                  F 
Subjt:  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY---------------------------------FS

Query:  SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
          FEV VD C TGIG VL+QQ HPIEYFSE+L+ SRQSWSTYEQELYALVRALKQWEHYLLS+EF+LLTDHFSLK
Subjt:  SPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X11.8e-29159.62Show/hide
Query:  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH
        P R E   +N++  G   GRRAR N++N  N    QRRRP     Q  D++ QE+ E WQ  Q+ DSS GDEQGN+WN + + R  +  +  E RR  YH
Subjt:  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYH

Query:  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN
        DYKMKIDLP Y+GK++IE+FLDWIK+TENFF YM  P+RKKVHLVALKL+ GAS W                               P +Y Q +   Y+
Subjt:  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL---YN

Query:  QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT
        QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+GGLR DIKEKVKL  FR LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  KT
Subjt:  QYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT

Query:  NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV
        ++QPSTS   KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC QRKTIA+AE E   MS   +E E+ETELIEADD +R+SC+
Subjt:  NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCV

Query:  IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS
        +QR   T          +  + ++    K    +N K   HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+DTQ+
Subjt:  IQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQS

Query:  LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI
        LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF T                                    +L  EFP +K+EP+GLPPLRDI
Subjt:  LHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT------------------------------------QLLYEFPRIKEEPEGLPPLRDI

Query:  QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK
        QH IDL+  ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSK
Subjt:  QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSK

Query:  IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK
        IDL++GYHQI+IRPGDEWKT FKTNEGLFE   ++       +K    LTE ELYIN KK  ++ +EI FL F+IK+G I MEPKKIEAI + P P SIK
Subjt:  IDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIK

Query:  EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL
        E+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Subjt:  EIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-4230.62Show/hide
Query:  EGLRGEKQLFTTQLLYEFPRIK-EEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEY-KTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGS---
        E L  E++     LL ++  I+  E + L      +H I+  +  +LP  + Y   PQ Y + +   I+++L +G I+ S SP   P  + PKK  +   
Subjt:  EGLRGEKQLFTTQLLYEFPRIK-EEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEY-KTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGS---

Query:  --WRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG----------------------
          +R+ +D R +N ITV  R  IP + ++L +LG+ + F+ IDL  G+HQI + P    KT F T  G +E++    G                      
Subjt:  --WRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG----------------------

Query:  ----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNF
                   F+TS              L +  L +   K  F+K+E  FL  V+    I   P+KIEAI  +PIP   KEI+AFLGL  +YRKFI NF
Subjt:  ----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNF

Query:  NSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRAL
          +A P+T              +Y                     F+  F +  DA    +G VL+Q GHP+ Y S  LN    ++ST E+EL A+V A 
Subjt:  NSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRAL

Query:  KQWEHYLLSKEFVLLTDH
        K + HYLL + F + +DH
Subjt:  KQWEHYLLSKEFVLLTDH

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.0e-3827.36Show/hide
Query:  LHDHIEELLKKGHIQPSLSPCAVPALLTPKK----YG--SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTT
        +++ +++LLK G I+PS SP   P  +  KK    +G  + R+ +D R +N  T+  R+ +P I  +L  LGKA  F+ +DLKSGYHQI +   D  KT+
Subjt:  LHDHIEELLKKGHIQPSLSPCAVPALLTPKK----YG--SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTT

Query:  FKTNEGLFEW---------------------MHKQRG-----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSIS
        F  N G +E+                     + +Q G            F+ ++ +        +  L +  + ++ +K+ F K  + +L F++ +    
Subjt:  FKTNEGLFEW---------------------MHKQRG-----------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSIS

Query:  MEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY----------------------------------------------FSSPFE
         +P+K++AI  +P P  + ++++FLGLAS+YR FI++F ++A P+TD                                               F  PF+
Subjt:  MEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY----------------------------------------------FSSPFE

Query:  VEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLL-SKEFVLLTDH----FSLKQRQQGGRCPKQKRLPTHIVVFENH
        +  DA  +GIG VL+Q+G PI   S  L    Q+++T E+EL A+V AL + +++L  S+E  + TDH    F++  R       K KR  ++I   + H
Subjt:  VEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLL-SKEFVLLTDH----FSLKQRQQGGRCPKQKRLPTHIVVFENH

Query:  SIQAFTRPIRRRY
        + + F +P +  +
Subjt:  SIQAFTRPIRRRY

P20825 Retrovirus-related Pol polyprotein from transposon 2972.5e-4027.29Show/hide
Query:  HKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKG
        H  +E T++L+ +  K        +N E   GEK  FT  + +              + +  H+  + S         Y ++      + + ++E+L +G
Subjt:  HKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKG

Query:  HIQPSLSPCAVPALLTPKK-----YGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHK
         I+ S SP   P  + PKK        +R+ +D R +N IT+  R+ IP + ++L +LGK   F+ IDL  G+HQI +      KT F T  G +E++  
Subjt:  HIQPSLSPCAVPALLTPKK-----YGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHK

Query:  QRG--------------------------------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWP
          G                                 F+TS              L +  L +   K  F+K+E  FL  ++    I   P K++AI ++P
Subjt:  QRG--------------------------------AFTTSKKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWP

Query:  IPASIKEIQAFLGLASFYRKFIRNFNSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYF
        IP   KEI+AFLGL  +YRKFI N+  +A P+T              +Y                     F   F +  DA    +G VL+Q GHPI + 
Subjt:  IPASIKEIQAFLGLASFYRKFIRNFNSLAAPLT--------------DY---------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYF

Query:  SEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK
        S  LN    ++S  E+EL A+V A K + HYLL ++F++ +DH  L+
Subjt:  SEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.7e-4229.27Show/hide
Query:  IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
        ++H I++  GA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKK G++R+CVD R +N+ T+   F +PRI +LL ++G A IF+
Subjt:  IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS

Query:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
         +DL SGYHQI + P D +KT F T  G +E+     G                               F+ S +         +  L    L +  KK 
Subjt:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS

Query:  MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
         F   E  FL + I    I+    K  AI  +P P ++K+ Q FLG+ ++YR+FI N + +A P+  +                       +SP      
Subjt:  MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------

Query:  ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
            + +  DA   GIG VL +  +       + YFS+ L  +++++   E EL  +++AL  + + L  K F L TDH SL   Q
Subjt:  ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.7e-4229.27Show/hide
Query:  IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS
        ++H I++  GA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKK G++R+CVD R +N+ T+   F +PRI +LL ++G A IF+
Subjt:  IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFS

Query:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS
         +DL SGYHQI + P D +KT F T  G +E+     G                               F+ S +         +  L    L +  KK 
Subjt:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN--------VPGLTETELYINTKKS

Query:  MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------
         F   E  FL + I    I+    K  AI  +P P ++K+ Q FLG+ ++YR+FI N + +A P+  +                       +SP      
Subjt:  MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------

Query:  ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ
            + +  DA   GIG VL +  +       + YFS+ L  +++++   E EL  +++AL  + + L  K F L TDH SL   Q
Subjt:  ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQ

Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein1.7e-0428.99Show/hide
Query:  FLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLP
        +L W  +   +F +    +  K+ +   +L+  A  WWDQ E NR    +  IR+WE++K  + A++ P
Subjt:  FLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLP

AT4G13320.1 unknown protein3.2e-1131.67Show/hide
Query:  RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEM--DVCHLLLGRPWQYDTQS
        R +C IN   C +++      N ++K LV  L LK        ++   R+  +V   E C VP+SI + YKD++ C V+ M  +   LL G PW Y  Q+
Subjt:  RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEM--DVCHLLLGRPWQYDTQS

Query:  LHKGRENTYELQWMGRKVVL
         H GR+++  + W    ++L
Subjt:  LHKGRENTYELQWMGRKVVL

ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-1037.18Show/hide
Query:  ELYINTKKSMFMKREIAFLD--FVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD
        + Y N KK  F + +IA+L    +I    +S +P K+EA+  WP P +  E++ FLGL  +YR+F++N+  +  PLT+
Subjt:  ELYINTKKSMFMKREIAFLD--FVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTCGAAGAGGTAAAAACCCTGCCGCGGGGGAAAACCGTACGCAAGAAGTAGCGGAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGC
TGTTGAGGAATCCTTGGGAGATCTCCGTAACATATTTGATAGATTGATAGAAAGCGTCGAATTGTTAAGCCGAAGGAAAGAATACCCACAACCACCACCACGGAACGAAA
TCAACTTCCAAAACAACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGAAAACTTCAGAAACGTGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGGTAC
GCCATACCACAACAGTTTGACGAAGATTTTCAAGAAGACCAAGAAGTATGGCAAGAAATCCAAGAAGATGATTCTTCAAGTGGGGATGAACAAGGAAACATGTGGAACTT
CAATGATGACTTGCGAGCAGGAAGAAATAACCAAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCGTGTATGATGGCAAACAAA
ATATAGAAGCATTCCTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGATATACCCGAACGCAAGAAAGTCCATCTAGTAGCCTTAAAGTTAAGAGCCGGT
GCATCAACTTGGTGGGATCAATTGGAAATTAACAGACAAAGATGTGGGAAACAGTCGATCCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCC
AAACTATGAACAAACACTCTACAATCAGTACCAAAACTGTCGCCAAGGTGTCCGTTCAGTAGTTGATTACATTGAAGAATTCCACTGCCTGAGTGCAAGAACGAACCTGA
GCGAAAATGAACAACACCAGATTGCAAGATTTGTGGGAGGTCTCCGACTCGACATCAAGGAAAAAGTCAAACTACAACCATTCCGTTTCTTGTCTGAAGCAATATCCTTT
GCAGAAACAGTGGAAGAAATGATTGCGGTTCGATCCAAAAACCTAAAGAGAAGACCAGCATGGGAGACAACTTCAACAAGAATGAACAATTATGCGGACAAAACAAACGA
CCAACCCTCAACCTCAACAAAAGGAAAAGGGAAGGAAGTTGAAAATCAAGAAGTAGCCGTTGAAAGAAAGAATGAACAAACATTCAAAACCAGTAGTCAGAACAACTACT
CCCGCCCTTTATTAGGAAAATTCTTCCGATGTGGCCAAACTGAACACCTCTCCAACAACTGCCCGCAAAGAAAAACCATAGCAATAGCCGAAGAAGGAAGGCAGATGAGT
GAAGATAGTAAAGAAGCAGAAGACGAAACTGAACTGATTGAAGCAGATGACGAGGAAAGGGTCTCTTGTGTCATCCAACGGGCAAGATGCACCATAAACGGAAGGGTATG
TGATGTAATCATAGACAACGACAGTAGCAAAAACTTCGTAGCAAAGAAACTAGTAACAGTCTTGAACCTAAAGGCTGAAGCACATCCAACCCCCTACAAGATAGGTTGGG
TAAGAAAAGGAGGAGAAGTCACGGTTAGCGAAATCTGCACAGTCCCTCTCTCCATTGAAAACGCCTACAAAGACCAAATTGTTTGTGACGTCATTGAGATGGACGTATGC
CATCTCCTATTAGGAAGACCTTGGCAGTATGATACCCAATCCTTACACAAAGGAAGAGAAAATACGTATGAATTACAATGGATGGGGAGAAAGGTAGTTCTACTCCCAAT
AACAAGAAAGAATAAGGAAGGATTAAGAGGTGAGAAACAACTATTCACCACCCAGCTCCTTTATGAGTTCCCACGCATAAAGGAAGAACCAGAGGGACTCCCACCTCTTC
GAGACATACAGCACCACATAGACTTGATCTCGGGAGCATCATTACCAAACTTGGCTCACTATAGGATGAGTCCCCAGGAGTACAAAACACTTCATGACCATATTGAGGAA
CTATTAAAGAAAGGGCACATCCAACCGAGCCTCAGCCCTTGTGCAGTACCAGCCCTTCTCACACCAAAGAAATATGGGAGTTGGAGAATGTGTGTTGACAGCAGAGCCAT
CAATCGTATCACGGTAAAGTATAGATTTTCCATCCCAAGGATTAGTGACCTGCTTGATCAACTCGGCAAAGCCAGCATTTTTTCGAAGATTGATTTAAAAAGTGGCTACC
ACCAAATACGTATAAGACCTGGCGATGAATGGAAAACAACTTTCAAGACAAACGAAGGCTTATTTGAATGGATGCACAAACAACGAGGAGCATTTACTACATCTAAGAAA
AATGTTCCAGGCTTGACAGAGACAGAACTCTACATCAACACTAAGAAAAGCATGTTTATGAAAAGAGAAATTGCATTCCTCGATTTTGTAATCAAACAAGGAAGCATAAG
CATGGAACCAAAGAAGATTGAAGCCATCCACACATGGCCGATTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTTGGCCTGGCTTCATTTTACAGAAAATTCATCAGGA
ACTTCAACTCTTTAGCCGCACCCCTCACCGACTACTTCTCTTCACCTTTTGAAGTAGAAGTCGACGCATGCTGCACAGGGATTGGAGTTGTCCTAGCTCAGCAAGGACAC
CCTATCGAATACTTCAGTGAAAAGCTCAACCCCTCAAGACAGTCATGGAGCACATATGAACAAGAGTTGTATGCCCTTGTGCGAGCACTAAAACAATGGGAGCACTACCT
ACTCTCCAAAGAATTCGTACTCCTAACTGATCACTTCTCACTAAAGCAAAGACAACAAGGTGGCCGATGCCCTAAGCAGAAAAGGCTTCCTACTCACATTGTTGTCTTCG
AAAATCATAGCATTCAAGCATTTACCCGACCTATACGAAGAAGATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTCGAAGAGGTAAAAACCCTGCCGCGGGGGAAAACCGTACGCAAGAAGTAGCGGAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGC
TGTTGAGGAATCCTTGGGAGATCTCCGTAACATATTTGATAGATTGATAGAAAGCGTCGAATTGTTAAGCCGAAGGAAAGAATACCCACAACCACCACCACGGAACGAAA
TCAACTTCCAAAACAACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGAAAACTTCAGAAACGTGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGGTAC
GCCATACCACAACAGTTTGACGAAGATTTTCAAGAAGACCAAGAAGTATGGCAAGAAATCCAAGAAGATGATTCTTCAAGTGGGGATGAACAAGGAAACATGTGGAACTT
CAATGATGACTTGCGAGCAGGAAGAAATAACCAAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCGTGTATGATGGCAAACAAA
ATATAGAAGCATTCCTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGATATACCCGAACGCAAGAAAGTCCATCTAGTAGCCTTAAAGTTAAGAGCCGGT
GCATCAACTTGGTGGGATCAATTGGAAATTAACAGACAAAGATGTGGGAAACAGTCGATCCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCC
AAACTATGAACAAACACTCTACAATCAGTACCAAAACTGTCGCCAAGGTGTCCGTTCAGTAGTTGATTACATTGAAGAATTCCACTGCCTGAGTGCAAGAACGAACCTGA
GCGAAAATGAACAACACCAGATTGCAAGATTTGTGGGAGGTCTCCGACTCGACATCAAGGAAAAAGTCAAACTACAACCATTCCGTTTCTTGTCTGAAGCAATATCCTTT
GCAGAAACAGTGGAAGAAATGATTGCGGTTCGATCCAAAAACCTAAAGAGAAGACCAGCATGGGAGACAACTTCAACAAGAATGAACAATTATGCGGACAAAACAAACGA
CCAACCCTCAACCTCAACAAAAGGAAAAGGGAAGGAAGTTGAAAATCAAGAAGTAGCCGTTGAAAGAAAGAATGAACAAACATTCAAAACCAGTAGTCAGAACAACTACT
CCCGCCCTTTATTAGGAAAATTCTTCCGATGTGGCCAAACTGAACACCTCTCCAACAACTGCCCGCAAAGAAAAACCATAGCAATAGCCGAAGAAGGAAGGCAGATGAGT
GAAGATAGTAAAGAAGCAGAAGACGAAACTGAACTGATTGAAGCAGATGACGAGGAAAGGGTCTCTTGTGTCATCCAACGGGCAAGATGCACCATAAACGGAAGGGTATG
TGATGTAATCATAGACAACGACAGTAGCAAAAACTTCGTAGCAAAGAAACTAGTAACAGTCTTGAACCTAAAGGCTGAAGCACATCCAACCCCCTACAAGATAGGTTGGG
TAAGAAAAGGAGGAGAAGTCACGGTTAGCGAAATCTGCACAGTCCCTCTCTCCATTGAAAACGCCTACAAAGACCAAATTGTTTGTGACGTCATTGAGATGGACGTATGC
CATCTCCTATTAGGAAGACCTTGGCAGTATGATACCCAATCCTTACACAAAGGAAGAGAAAATACGTATGAATTACAATGGATGGGGAGAAAGGTAGTTCTACTCCCAAT
AACAAGAAAGAATAAGGAAGGATTAAGAGGTGAGAAACAACTATTCACCACCCAGCTCCTTTATGAGTTCCCACGCATAAAGGAAGAACCAGAGGGACTCCCACCTCTTC
GAGACATACAGCACCACATAGACTTGATCTCGGGAGCATCATTACCAAACTTGGCTCACTATAGGATGAGTCCCCAGGAGTACAAAACACTTCATGACCATATTGAGGAA
CTATTAAAGAAAGGGCACATCCAACCGAGCCTCAGCCCTTGTGCAGTACCAGCCCTTCTCACACCAAAGAAATATGGGAGTTGGAGAATGTGTGTTGACAGCAGAGCCAT
CAATCGTATCACGGTAAAGTATAGATTTTCCATCCCAAGGATTAGTGACCTGCTTGATCAACTCGGCAAAGCCAGCATTTTTTCGAAGATTGATTTAAAAAGTGGCTACC
ACCAAATACGTATAAGACCTGGCGATGAATGGAAAACAACTTTCAAGACAAACGAAGGCTTATTTGAATGGATGCACAAACAACGAGGAGCATTTACTACATCTAAGAAA
AATGTTCCAGGCTTGACAGAGACAGAACTCTACATCAACACTAAGAAAAGCATGTTTATGAAAAGAGAAATTGCATTCCTCGATTTTGTAATCAAACAAGGAAGCATAAG
CATGGAACCAAAGAAGATTGAAGCCATCCACACATGGCCGATTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTTGGCCTGGCTTCATTTTACAGAAAATTCATCAGGA
ACTTCAACTCTTTAGCCGCACCCCTCACCGACTACTTCTCTTCACCTTTTGAAGTAGAAGTCGACGCATGCTGCACAGGGATTGGAGTTGTCCTAGCTCAGCAAGGACAC
CCTATCGAATACTTCAGTGAAAAGCTCAACCCCTCAAGACAGTCATGGAGCACATATGAACAAGAGTTGTATGCCCTTGTGCGAGCACTAAAACAATGGGAGCACTACCT
ACTCTCCAAAGAATTCGTACTCCTAACTGATCACTTCTCACTAAAGCAAAGACAACAAGGTGGCCGATGCCCTAAGCAGAAAAGGCTTCCTACTCACATTGTTGTCTTCG
AAAATCATAGCATTCAAGCATTTACCCGACCTATACGAAGAAGATATTGA
Protein sequenceShow/hide protein sequence
MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGY
AIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAG
ASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISF
AETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMS
EDSKEAEDETELIEADDEERVSCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVC
HLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEE
LLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKK
NVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGH
PIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQQGGRCPKQKRLPTHIVVFENHSIQAFTRPIRRRY