; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22210 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22210
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr1:17758379..17762649
RNA-Seq ExpressionCSPI01G22210
SyntenyCSPI01G22210
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]0.0e+0060.68Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++A+ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KN+ ILE LP ETPL KG+DNS  KSLATT
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        EPHES  TFN GKGEAYTS+TK +ILKDE AAN PVL YVPLS RK  ESP +ESPKGLKV DIE++KESFTT LTKI KQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD
                                               LL+EGH+I VSRKGLGYKS E I IT+KGKE+VVD NHI IEE DNTD KEGDNQ  S FD
Subjt:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD

Query:  RIRPHVACTLA--------------------------------------------------------------------------------NYDSH---K
        RIRP VA  +                                                                                 N DS+   K
Subjt:  RIRPHVACTLA--------------------------------------------------------------------------------NYDSH---K

Query:  GEKHMSGLNNYKTISLSKA----SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH
         ++ MS +  ++ I  +      SK+FPC+TKE GEIHSNVPSRMKRKTF+TLN SQGSLKVKRH+VILTNPEKE SEQG+GET CHHITIIEESET TH
Subjt:  GEKHMSGLNNYKTISLSKA----SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH

Query:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL
         ED EN PQS EDGGQ TVDELKEVNLG IEEP PT +SA LSNEE DKYMSLLT YR+IFAWSYKEMPGLDPKVAVHHL IK GYR IKQAQRRF+ EL
Subjt:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL

Query:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------
        IPQIE                              NGQLRV VDF DLNNAC KDDFPLPITEIMVDATTGHE                           
Subjt:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------

Query:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID
                               R MQK+FDDMLH+YVECYVDDLV+K+KRRQDHLKDLKVVFDRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIEID
Subjt:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID

Query:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL
        QSKI+ IQKM RPK+                                           CQNAFDSIKKYLLTP VLG P+  KPLILYIA QE+SL ALL
Subjt:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL

Query:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA
        AQEE KGKER+LYYLSRTLIGAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPIIS  LAKWAVLLQQYDIVYI QKAIK  ALA
Subjt:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA

Query:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        DFLADHPIPSDWKLC +L +DEV +TEVMEPWTMYFDGAAR SGAGA IVLIS EKHMLPY+F L E CSNNVA+Y ALII LQ+ LEI VS IE+
Subjt:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]0.0e+0059.95Show/hide
Query:  VVRSTLLV---EANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKD
        ++R  L++   +A+ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KN+ ILE LP ETPL KG+D
Subjt:  VVRSTLLV---EANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKD

Query:  NSH-KSLATTEPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE--------
        NS  KSLATTEPHES  TFN GKGEAYTS+TK +ILKDE AAN PVL YVPLS RK  ESP +ESPKGLKV DIE++KESFTT LTKI KQE        
Subjt:  NSH-KSLATTEPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE--------

Query:  -------------------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKE
                                                         LL+EGH+I VSRKGLGYKS E I IT+KGKE+VVD NHI IEE DNTD KE
Subjt:  -------------------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKE

Query:  GDNQITSTFDRIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA------------
        GDNQ  S FDRIRP VA                   ++ N + H   + ++                        G++  K +   +A            
Subjt:  GDNQITSTFDRIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA------------

Query:  ---------------------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHIT
                                         SK+FPC+TKE GEIHSNVPSRMKRKTF+TLN SQGSLKVKRH+VILTNPEKE SEQG+ ET CHHIT
Subjt:  ---------------------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHIT

Query:  IIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIK
        IIEESET TH ED EN PQS EDGGQ TVDELKEVNLG IEEP PT +SA LSNEE DKYMSLLT YR+IFAWSYKEMPGLDPKVAVHHL IK GYR IK
Subjt:  IIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIK

Query:  QAQRRFQAELIPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------
        QAQRRF+ ELIPQIE                              NGQLRV VDF DLNNAC KDDFPLPITEIMVDATTGHE                 
Subjt:  QAQRRFQAELIPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------

Query:  ---------------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGF
                                         R MQK+FDDMLH+YVECYVDDLV+K+KRRQDHLKDLKVVFDRL+KY+LRMN LKCAFGVTSGKFLGF
Subjt:  ---------------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGF

Query:  IVRHRGIEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIA
        IVRHRGIEIDQSKI+ IQKM RPK+                                           CQNAFDSIKKYLLTP VLG P+  KPLILYIA
Subjt:  IVRHRGIEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIA

Query:  TQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYIL
         QE+SL ALLAQEE KGKER+LYYLSRTLIGAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPII+  LAKWAVLLQQYDIVYI 
Subjt:  TQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYIL

Query:  QKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIE
        QKAIK  ALADFLADHPIPSDWKLC +L +DEV +TEVMEPWTMYFDGAAR SGAGA IVLIS EKHMLPY+F L E CSNNVA+Y ALII LQ+ LEI 
Subjt:  QKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIE

Query:  VSLIEI
        VS IE+
Subjt:  VSLIEI

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]0.0e+0060.31Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++A+ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KN+ ILE LP ETPL KG+DNS  KSLATT
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        EPHES  TFN GKGEAYTS+TK +ILKDE AAN PVL YVPLS RK  ESP +ESPKGLKV DIE++KESFTT LTKI KQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD
                                               LL+EGH+I VSRKGLGYKS E I IT+KGKE+VVD NHI IEE DNTD KEGDNQ  S FD
Subjt:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD

Query:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------
        RIRP VA                   ++ N + H   + ++                        G++  K +   +A                      
Subjt:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------

Query:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH
                               SK+FPC+TKE GEIHSNVPSRMKRKTF+TLN SQGSLKVKRH+VILTNPEKE SEQG+ ET CHHITIIEESET TH
Subjt:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH

Query:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL
         ED EN PQS EDGGQ TVDELKEVNLG IEEP PT +SA LSNEE DKYMSLLT YR+IFAWSYKEMPGLDPKVAVHHL IK GYR IKQAQRRF+ EL
Subjt:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL

Query:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------
        IPQIE                              NGQLRV VDF DLNNAC KDDFPLPITEIMVDATTGHE                           
Subjt:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------

Query:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID
                               R MQK+FDDMLH+YVECYVDDLV+K+KRRQDHLKDLKVVFDRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIEID
Subjt:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID

Query:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL
        QSKI+ IQKM RPK+                                           CQNAFDSIKKYLLTP VLG P+  KPLILYIA QE+SL ALL
Subjt:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL

Query:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA
        AQEE KGKER+LYYLSRTLIGAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPII+  LAKWAVLLQQYDIVYI QKAIK  ALA
Subjt:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA

Query:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        DFLADHPIPSDWKLC +L +DEV +TEVMEPWTMYFDGAAR SGAGA IVLIS EKHMLPY+F L E CSNNVA+Y ALII LQ+ LEI VS IE+
Subjt:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]0.0e+0060.31Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++A+ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KN+ ILE LP ETPL KG+DNS  KSLATT
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        EPHES  TFN GKGEAYTS+TK +ILKDE AAN PVL YVPLS RK  ESP +ESPKGLKV DIE++KESFTT LTKI KQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD
                                               LL+EGH+I VSRKGLGYKS E I IT+KGKE+VVD NHI IEE DNTD KEGDNQ  S FD
Subjt:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD

Query:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------
        RIRP VA                   ++ N + H   + ++                        G++  K +   +A                      
Subjt:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------

Query:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH
                               SK+FPC+TKE GEIHSNVPSRMKRKTF+TLN SQGSLKVKRH+VILTNPEKE SEQG+ ET CHHITIIEESET TH
Subjt:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH

Query:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL
         ED EN PQS EDGGQ TVDELKEVNLG IEEP PT +SA LSNEE DKYMSLLT YR+IFAWSYKEMPGLDPKVAVHHL IK GYR IKQAQRRF+ EL
Subjt:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL

Query:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------
        IPQIE                              NGQLRV VDF DLNNAC KDDFPLPITEIMVDATTGHE                           
Subjt:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------

Query:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID
                               R MQK+FDDMLH+YVECYVDDLV+K+KRRQDHLKDLKVVFDRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIEID
Subjt:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID

Query:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL
        QSKI+ IQKM RPK+                                           CQNAFDSIKKYLLTP VLG P+  KPLILYIA QE+SL ALL
Subjt:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL

Query:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA
        AQEE KGKER+LYYLSRTLIGAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPII+  LAKWAVLLQQYDIVYI QKAIK  ALA
Subjt:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA

Query:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        DFLADHPIPSDWKLC +L +DEV +TEVMEPWTMYFDGAAR SGAGA IVLIS EKHMLPY+F L E CSNNVA+Y ALII LQ+ LEI VS IE+
Subjt:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]0.0e+0060.4Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++A+ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KN+ ILE LP ETPL KG+DNS  KSLATT
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        EPHES  TFN GKGEAYTSNTK +ILKDE AAN PVL YVPLS RK  ESP +ESPKGLKV DIE++KESFTT LTKI KQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD
                                               LL+EGH+I VSRKGLGYKS E I IT+KGKE+VVD NHI IEE DNTD KEGDNQ  S FD
Subjt:  ---------------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQITSTFD

Query:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------
        RIRP VA                   ++ N + H   + ++                        G++  K +   +A                      
Subjt:  RIRPHVA------------------CTLANYDSHKGEKHMS------------------------GLNNYKTISLSKA----------------------

Query:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH
                               SK+FPC+TKE GEIHSNVPSRMKRKTF+TLN SQGSLKVKRH+VILTNPEKE SEQG+ ET CHHITIIEESET TH
Subjt:  -----------------------SKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTH

Query:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL
         ED EN PQS EDGGQ TVDELKEVNLG IEEP PT +SA LSNEE DKYMSLLT YR+IFAWSYKEMPGLDPKVAVHHL IK GYR IKQAQRRF+ EL
Subjt:  AEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAEL

Query:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------
        IPQIE                              NGQLRV VDF DLNNAC KDDFPLPITEIMVDATTGHE                           
Subjt:  IPQIE------------------------------NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE---------------------------

Query:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID
                               R MQK+FDDMLH+YVECYVDDLV+K+KRRQDHLKDLKVVFDRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIEID
Subjt:  -----------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEID

Query:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL
        QSKI+ IQKM RPK+                                           CQNAFDSIKKYLLTP VLG P+  KPLILYIA QE+SL ALL
Subjt:  QSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALL

Query:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA
        AQEE KGKER+LYYLSRTLIGAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPII+  LAKWAVLLQQYDIVYI QKAIK  ALA
Subjt:  AQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALA

Query:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        DFLADHPIPSDWKLC +L +DEV +TEVMEPWTMYFDGAAR SGAGA IVLIS EKHMLPY+F L E CSNNVA+Y ALII LQ+ LEI VS IE+
Subjt:  DFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

TrEMBL top hitse value%identityAlignment
A0A5A7T485 Reverse transcriptase4.7e-24552.7Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++ +ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KND   EA+ VE PL+  +DN   KSL + 
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        E H+S GTF+ GK EA TS  KSVIL DEK +NPP+L YVPLS  K  ESP VESP+GLKV DIEVLKESFTT LTKITKQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  --------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQIT--STFDRI---
                                        LL+EGH I +SRKGLGYK    I ITRKGKE+VVDSNHI ++EVD   +K+  +  T  S F R+   
Subjt:  --------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQIT--STFDRI---

Query:  -RPHVACTLANYDSHKGEKHM-----SGLNNYKTISLSKAS---------------KEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVI
         + +     A+  +H G+  +     S ++  K  S S+ S               KEFPC+ K E EI SNVPSRMKRKTF+TLN SQGSLKVKRH+VI
Subjt:  -RPHVACTLANYDSHKGEKHM-----SGLNNYKTISLSKAS---------------KEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVI

Query:  LTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEM
        LTNPEKE+SEQG+GE  CHHITI+EE E +T  ED E+ PQS EDGGQ TVD+LKEVNLG IEEP  T +SA LS+EEE+                    
Subjt:  LTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEM

Query:  PGLDPKVAVH-HLVIKLGY-RLIKQAQRRFQAELIPQIENGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------------
          L P+V V  + +++ G+ R +K         L+ + +NGQL V VDF DLNNAC KDDF LPITEIMVDATTGHE                       
Subjt:  PGLDPKVAVH-HLVIKLGY-RLIKQAQRRFQAELIPQIENGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------------

Query:  ---------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRG
                                   R MQ +FDDMLHKYVECY                             LRMN LKCAFGVTSGKFLGFIVRH+G
Subjt:  ---------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRG

Query:  IEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSL
        IEIDQSKI+ IQKMPRPK+                                           CQN FDSIKKYLL P VLG  + G+PLILYIA QE+SL
Subjt:  IEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSL

Query:  RALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKR
         ALLAQE++KGKERALYYLS+TL+GAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPIIS  LAKWAV+LQQYDIVYI QK IK 
Subjt:  RALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKR

Query:  HALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
         AL DFLADHPIPSDWKLC +L +DEV +TEV+EPWT+                            FVL E CSNNVA+Y ALII LQM LEI VS IEI
Subjt:  HALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

A0A5A7TZU9 Ribonuclease H9.5e-24648.12Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE
        ++A+ +F+VIDS TTY +LLG PWIH N +VTSTLHQCFKFY+ G+KKV+ DS PF +A+SHFA AKF  K++ + E +  E P+ KG   + + + T++
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE

Query:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------
                N  +    T+ TK    + EK A       NPPVL Y+PLS RK  ESP  E  K L V + E+LKE+FT  LTKI                
Subjt:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------

Query:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K
                                                     T+++L ++G++I  SR G+GY+S E + IT KGK +V ++ HI +EE  +++E K
Subjt:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K

Query:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE
        +  +Q +S FDR     IRP     V+ ++A   N  S      +S                         +K +S+S      KAS     K+     +
Subjt:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE

Query:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH
         EI S  PSRMKRK F+++N ++GSLKVKRH+V+ T PE  E E       C+H+TI E S+     ED E  P S EDGGQ T+DELKEVNLG  EEP 
Subjt:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH

Query:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------
        PT +S  LS+ +E++Y++LL  Y+++FAWSYKEMPGLDPKVAVH L IK  +R +KQAQRRF+ ELI QIE                             
Subjt:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------

Query:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML
         NGQLRV VDF DLNNAC KDDFPLPI EIM+DAT GHE                                                  R MQ+IFDDML
Subjt:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML

Query:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------
        HK+VECYVDDLV+KSK++ DHLKDLK+V DRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIE+D SKI+ IQKMP PK                    
Subjt:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------

Query:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL
                               +CQNAFDSIKKYLL P VL  P +GKPLILYIA QE SL ALLAQE  KGKE ALYYLSRTL GAE+NYS I KMCL
Subjt:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL

Query:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM
        ALFFAIDKLRHYMQAF  H VAKAD +KY+LSRP+IS  LAKWA++LQQYDIVYI QKA+K  ALADFLADHP+PS+WKLC +L ++EVL+ E MEPW M
Subjt:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM

Query:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        +FDGAAR SGAG  IV IS EKHMLPY+F L E CSNNVA+Y A II LQM  E  +  IEI
Subjt:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

A0A5D3BIH8 Uncharacterized protein1.8e-24447.83Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE
        ++A+ +F+VIDS TTY +LLG PWIH N +VTSTLHQCFKFY+ G+KKV+ DS PF +A+SHFA AKF  K++ + E +  E P+ KG   + + + T++
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE

Query:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------
                N  +    T+ TK    + EK A       NPPVL Y+PLS RK  ESP  E  K L V + E+LKE+FT  LTKI                
Subjt:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------

Query:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K
                                                     T+++L ++G++I  SR G+GY+S E + IT KGK +V ++ HI +EE  +++E K
Subjt:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K

Query:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE
        +  +Q +S FDR     IRP     V+ ++A   N  S      +S                         +K +S+S      KAS     K+     +
Subjt:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE

Query:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH
         EI S  PSRMKRK F+++N ++GSLKVKRH+V+ T PE  E E       C+H+TI E S+     ED E  P S EDGGQ T+DELKEVNLG  EEP 
Subjt:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH

Query:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------
        PT +S  LS+ +E++Y++LL  Y+++FAWSYKEMPGLDPKVAVH L IK  +R +KQAQRRF+ ELI QIE                             
Subjt:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------

Query:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML
         NGQLRV VDF DLNNAC KDDFPLPI EIM+DAT GHE                                                  R MQ+IFDDML
Subjt:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML

Query:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------
        HK++ECYVDDLV+KSK++ DHLKDLK+V DRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIE+D SKI+ IQKMP PK                    
Subjt:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------

Query:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL
                               +CQNAFDSIKKYLL P VL  P +GKPLILYIA QE SL ALLAQE  KGKE ALYYLSRTL GAE+NYS I KMCL
Subjt:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL

Query:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM
        ALFFAIDKLRHYMQAF  H VAKAD +KY+LSRP+IS  LAKWA++LQQYDIVYI QKA+K  ALADFLADHP+PS+WKLC +L ++EVL+ E MEPW M
Subjt:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM

Query:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        +FDGAAR SGAG  IV IS +KHMLPY+F L E CSNNV +Y A II LQM  E  +  IEI
Subjt:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

A0A5D3BV77 Reverse transcriptase2.3e-24452.5Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT
        ++ +ALF+VIDS TTY LLLG PWIHGN VVTSTLHQCFKFYQDGVKKVE DSNPF EA+SHFA AKF  KND   EA+ VE PL+  +DN   KSL + 
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSH-KSLATT

Query:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------
        E H+S GTF+ GK EA TS  KSVIL DEK +NPP+L YVPLS  K  ESP VESP+GLKV DIE+LKE FTT LTKITKQE                  
Subjt:  EPHESVGTFNFGKGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQE------------------

Query:  --------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQIT--STFDRI---
                                        LL+EGH I +SRKGLGYK    I ITRKGKE+VVDSNHI ++EVD   +K+  +  T  S F R+   
Subjt:  --------------------------------LLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDEKEGDNQIT--STFDRI---

Query:  -RPHVACTLANYDSHKGEKHM-----SGLNNYKTISLSKAS---------------KEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVI
         + +     A+  +H G+  +     S ++  K  S S+ S               KEFPC+ K E EI SNVPSRMKRKTF+TLN SQGSLKVKRH+VI
Subjt:  -RPHVACTLANYDSHKGEKHM-----SGLNNYKTISLSKAS---------------KEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVI

Query:  LTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEM
        LTNPEKE+SEQG+GE  CHHITI+EE E +T  ED E+ PQS EDGGQ TVD+LKEVNLG IEEP  T +SA LS+EEE+                    
Subjt:  LTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEM

Query:  PGLDPKVAVH-HLVIKLGY-RLIKQAQRRFQAELIPQIENGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------------
          L P+V V  + +++ G+ R +K         L+ + +NGQL V VDF DLNNAC KDDF LPITEIMVDATTGHE                       
Subjt:  PGLDPKVAVH-HLVIKLGY-RLIKQAQRRFQAELIPQIENGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE-----------------------

Query:  ---------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRG
                                   R MQ +FDDMLHKYVECY                             LRMN LKCAFGVTSGKFLGFIVRH+G
Subjt:  ---------------------------RVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRG

Query:  IEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSL
        IEIDQSKI+ IQKMPRPK+                                           CQN FDSIKKYLL P VLG  + G+PLILYIA QE+SL
Subjt:  IEIDQSKINVIQKMPRPKT-------------------------------------------CQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSL

Query:  RALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKR
         ALLAQE++KGKERALYYLS+TL+GAEVNYS I KMCLALFFAIDKLRHYMQAF  H VAKAD IKYVLSRPIIS  LAKWAV+LQQYDIVYI QK IK 
Subjt:  RALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKR

Query:  HALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
         AL DFLADHPIPSDWKLC +L +DEV +TEV+EPWT+                            FVL E CSNNVA+Y ALII LQM LEI VS IEI
Subjt:  HALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

A0A5D3D1E5 Ribonuclease H9.5e-24648.12Show/hide
Query:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE
        ++A+ +F+VIDS TTY +LLG PWIH N +VTSTLHQCFKFY+ G+KKV+ DS PF +A+SHFA AKF  K++ + E +  E P+ KG   + + + T++
Subjt:  VEANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTE

Query:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------
                N  +    T+ TK    + EK A       NPPVL Y+PLS RK  ESP  E  K L V + E+LKE+FT  LTKI                
Subjt:  PHESVGTFNFGKGEAYTSNTKSVILKDEKAA-------NPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKI----------------

Query:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K
                                                     T+++L ++G++I  SR G+GY+S E + IT KGK +V ++ HI +EE  +++E K
Subjt:  ---------------------------------------------TKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIAIEEVDNTDE-K

Query:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE
        +  +Q +S FDR     IRP     V+ ++A   N  S      +S                         +K +S+S      KAS     K+     +
Subjt:  EGDNQITSTFDR-----IRP----HVACTLA---NYDSHKGEKHMSGLNN---------------------YKTISLS------KASKEFPCKTK---EE

Query:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH
         EI S  PSRMKRK F+++N ++GSLKVKRH+V+ T PE  E E       C+H+TI E S+     ED E  P S EDGGQ T+DELKEVNLG  EEP 
Subjt:  GEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEESEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPH

Query:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------
        PT +S  LS+ +E++Y++LL  Y+++FAWSYKEMPGLDPKVAVH L IK  +R +KQAQRRF+ ELI QIE                             
Subjt:  PTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGYRLIKQAQRRFQAELIPQIE-----------------------------

Query:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML
         NGQLRV VDF DLNNAC KDDFPLPI EIM+DAT GHE                                                  R MQ+IFDDML
Subjt:  -NGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHE--------------------------------------------------RVMQKIFDDML

Query:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------
        HK+VECYVDDLV+KSK++ DHLKDLK+V DRL+KY+LRMN LKCAFGVTSGKFLGFIVRHRGIE+D SKI+ IQKMP PK                    
Subjt:  HKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPK--------------------

Query:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL
                               +CQNAFDSIKKYLL P VL  P +GKPLILYIA QE SL ALLAQE  KGKE ALYYLSRTL GAE+NYS I KMCL
Subjt:  -----------------------TCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLIGKMCL

Query:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM
        ALFFAIDKLRHYMQAF  H VAKAD +KY+LSRP+IS  LAKWA++LQQYDIVYI QKA+K  ALADFLADHP+PS+WKLC +L ++EVL+ E MEPW M
Subjt:  ALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTM

Query:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI
        +FDGAAR SGAG  IV IS EKHMLPY+F L E CSNNVA+Y A II LQM  E  +  IEI
Subjt:  YFDGAARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.2e-1525.95Show/hide
Query:  DATTGHERVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMP---RP
        +A    +R M  I   +L+K+   Y+DD+++ S    +HL+ L +VF++L K  L++   KC F      FLG ++   GI+ +  KI  IQK P   +P
Subjt:  DATTGHERVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMP---RP

Query:  KTCQ-----------------------------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYL
        K  +                                         +AF  +K  +    +L VP   K   L     + +L A+L+Q+        L Y+
Subjt:  KTCQ-----------------------------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYL

Query:  SRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA
        SRTL   E+NYS I K  LA+ +A    RHY+    F   +    + ++      +  L +W V L ++D      K  K + +AD L+
Subjt:  SRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA

P0CT34 Transposon Tf2-1 polyprotein3.7e-1325.6Show/hide
Query:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------
        I  +    +V CY+DD++I SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +  +G    Q  I+ V+Q K P+              
Subjt:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------

Query:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL
             PKT Q                       A ++IK+ L++P VL      K ++L     + ++ A+L+Q+    K   + Y S  +  A++NYS+
Subjt:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL

Query:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD
          K  LA+  ++   RHY+++ +       DH   +      S P  ++ LA+W + LQ ++     +     H +AD L+       PIP D
Subjt:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD

P0CT35 Transposon Tf2-2 polyprotein3.7e-1325.6Show/hide
Query:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------
        I  +    +V CY+DD++I SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +  +G    Q  I+ V+Q K P+              
Subjt:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------

Query:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL
             PKT Q                       A ++IK+ L++P VL      K ++L     + ++ A+L+Q+    K   + Y S  +  A++NYS+
Subjt:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL

Query:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD
          K  LA+  ++   RHY+++ +       DH   +      S P  ++ LA+W + LQ ++     +     H +AD L+       PIP D
Subjt:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD

P0CT41 Transposon Tf2-12 polyprotein3.7e-1325.6Show/hide
Query:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------
        I  +    +V CY+DD++I SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +  +G    Q  I+ V+Q K P+              
Subjt:  IFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKIN-VIQ-KMPR--------------

Query:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL
             PKT Q                       A ++IK+ L++P VL      K ++L     + ++ A+L+Q+    K   + Y S  +  A++NYS+
Subjt:  -----PKTCQ----------------------NAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSL

Query:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD
          K  LA+  ++   RHY+++ +       DH   +      S P  ++ LA+W + LQ ++     +     H +AD L+       PIP D
Subjt:  IGKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYV-----LSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLA-----DHPIPSD

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.6e-1624.67Show/hide
Query:  VMQKIFDDMLHKYV----ECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRP-------
        + Q++ DD+L +++      Y+DD+++ S+    H K+L++V   L K  L++N  K  F  T  +FLG+IV   GI+ D  K+  I +MP P       
Subjt:  VMQKIFDDMLHKYV----ECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSLKCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRP-------

Query:  -----------------------------------------------KTCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKE
                                                       +T   +F+ +K  L +  +L  P   KP  L       ++ A+L+Q++ +G++
Subjt:  -----------------------------------------------KTCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKE

Query:  RALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADH--IKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHP
        R + Y+SR+L   E NY+ I K  LA+ +++D LR Y+       V   DH  + + L     +  L +W   +++Y+   ++ K  K + +AD L+  P
Subjt:  RALYYLSRTLIGAEVNYSLIGKMCLALFFAIDKLRHYMQAFMFHPVAKADH--IKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTGCCGTCAACGATTCAACTTCCTCTAAGGGACGATCGAACTTTTGCCGTTAATTCCATCGCCCTAATCCACCTCCCTAGTAAAAACGATCGGGCTTCTTCATC
CTCAGCGATCCTACTTCAAGGGGACGATCGGGTTTTTGTCGTCAGATCCACCTTCCTAGTAAAAGCGATCGAGCTTCTTCGTTGTCGGGCTTCTTCGTCGTCGAACTTTT
TTGCCTTCAGCGATCAAACTTCTTCTAGGGAACGATCGACCTTTTGGCATCAGATCCAACGCTCTAGTAAAAGCGATCGAACCTCTTCGCTGTCAAGCTTTTTTGTCGTC
AGTGATCCAACTTCTTTTAGGGAACGATCGGGCTTTTACCATCAAATCTACCTTCCTAGTAGAAGCAATCAGGCTTGTTCGTCGTCATCGATCCAACTTCTTCTAGGGGA
TGATCGAGCTTTTGCCGTCAGATCCACCACCCTAGTAGAAGTGATCGGCCTTCTTCGTCGTCGGTTTTCTTCGTTGTCAGCGATCCAACTTCTTTTAAGGGGCGATCGGA
CTTTTGCCATCCACCTCCCTAGTAGAAACGATCAGACTTCTTCATCGTCAGCAATCCAACTTCTTCTACAGAACGATCGGGCTTTTGTCGTTAGATCTACCTTACTAGTA
GAAGCCAATGCATTGTTTTATGTCATAGACTCATGGACCACTTACACGTTGTTACTAGGTTGTCCTTGGATTCATGGAAATAGAGTAGTAACTTCAACACTGCATCAGTG
CTTCAAATTTTATCAAGACGGTGTAAAGAAGGTTGAAGTCGACTCTAATCCGTTCTTAGAGGCTAAGTCTCACTTCGCATATGCAAAGTTTTGTCCGAAGAATGATATTA
TCTTAGAAGCTTTGCCAGTAGAAACCCCTCTTATAAAAGGAAAAGATAATTCACATAAATCACTTGCAACCACAGAACCACATGAAAGTGTGGGAACCTTTAACTTTGGA
AAGGGTGAAGCATACACTAGCAACACAAAGAGTGTGATCTTGAAGGATGAAAAGGCTGCAAACCCACCAGTTTTGCACTATGTTCCTCTGTCAATGCGCAAGATAGACGA
ATCACCACTCGTTGAGTCTCCAAAAGGTTTGAAGGTTGATGACATTGAAGTCTTAAAAGAAAGCTTCACTACAAAATTAACTAAGATAACGAAGCAGGAGCTTTTACAAG
AAGGACATGCTATACACGTGTCGAGAAAAGGGCTCGGATACAAGTCGCAAGAGTCGATCCATATAACTAGAAAGGGGAAGGAAAGAGTGGTTGATAGCAATCATATAGCT
ATAGAGGAGGTTGACAATACTGACGAAAAAGAAGGCGATAATCAGATAACCTCAACATTTGATCGAATTAGACCACATGTTGCATGCACCCTAGCTAACTATGACTCTCA
TAAAGGAGAAAAGCACATGTCAGGCCTCAACAACTACAAGACCATCAGCCTTTCAAAGGCTAGTAAGGAGTTTCCTTGCAAGACAAAAGAAGAGGGAGAAATTCATAGCA
ATGTTCCTTCACGAATGAAAAGAAAAACATTTATTACTCTCAATAGAAGTCAAGGTTCGTTGAAGGTAAAAAGACATAATGTTATACTAACGAATCCTGAAAAAGAAGAG
TCAGAACAAGGGAAAGGTGAAACTCCGTGTCATCACATCACCATTATTGAGGAATCAGAGACTAAAACTCATGCAGAAGACGTAGAAAATGTCCCACAGAGTTCAGAGGA
TGGTGGCCAATATACTGTAGATGAGCTAAAAGAGGTGAACCTTGGTAAAATAGAGGAACCACACCCGACTTTGGTTAGTGCATTCCTCTCTAACGAGGAGGAGGATAAAT
ACATGAGTTTGCTCACCAGATACAGGAACATCTTTGCTTGGTCGTACAAGGAGATGCCAGGACTTGATCCAAAAGTAGCAGTCCATCATCTTGTAATTAAACTAGGGTAT
CGATTGATTAAACAAGCACAACGACGTTTTCAAGCGGAGCTTATCCCTCAAATCGAGAACGGACAACTTCGCGTTAGTGTAGACTTTTGCGACCTGAATAATGCATGCCA
TAAAGATGATTTTCCCTTACCCATCACAGAAATCATGGTTGATGCAACTACTGGACACGAGCGTGTCATGCAGAAAATCTTTGATGATATGTTGCATAAGTATGTTGAAT
GTTATGTTGATGACCTTGTAATCAAATCAAAGAGACGACAAGACCATTTGAAGGATCTAAAAGTTGTGTTCGACCGCTTGCAAAAATATAAGTTAAGGATGAACTCTCTC
AAGTGTGCGTTTGGTGTGACTTCAGGAAAGTTTCTTGGATTCATTGTAAGACATCGGGGGATCGAAATAGACCAGTCCAAGATTAATGTCATTCAAAAGATGCCAAGACC
TAAGACTTGTCAAAATGCTTTTGATAGCATAAAGAAATACTTGCTTACTCCCTCAGTGTTGGGAGTTCCAATATCTGGCAAGCCATTAATATTGTACATTGCTACACAAG
AAAAGTCCTTACGAGCATTACTGGCACAAGAGGAGAAAAAGGGAAAGGAACGTGCTCTCTACTATCTAAGCAGAACATTAATTGGAGCTGAAGTTAACTATTCCCTCATC
GGGAAGATGTGCCTCGCACTTTTCTTTGCCATTGATAAGTTAAGGCATTATATGCAGGCCTTCATGTTTCATCCAGTGGCAAAAGCAGACCATATAAAGTATGTTCTGTC
CAGGCCAATCATCTCTGAATACTTAGCCAAATGGGCGGTTCTACTCCAACAATATGACATTGTCTATATTCTCCAAAAGGCGATAAAAAGACATGCGCTAGCAGACTTTT
TAGCAGACCACCCAATTCCTTCAGATTGGAAGTTATGTGTAAACCTGTCAAACGATGAAGTTCTCTACACGGAAGTTATGGAACCTTGGACTATGTATTTCGATGGTGCA
GCGCGAATAAGTGGTGCAGGGGCAAGCATCGTCCTCATTTCTCTTGAGAAGCATATGTTGCCTTATAACTTTGTACTTGTTGAACGATGTTCAAACAATGTGGCTAAATA
TTCGGCTTTGATAATTGACCTTCAAATGACATTGGAAATCGAAGTATCACTCATAGAGATACGTAGGCAGCTTAAGAAAAACCTTAAGCCCAGTCCAAAAAAAATAAAAA
AATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTGCCGTCAACGATTCAACTTCCTCTAAGGGACGATCGAACTTTTGCCGTTAATTCCATCGCCCTAATCCACCTCCCTAGTAAAAACGATCGGGCTTCTTCATC
CTCAGCGATCCTACTTCAAGGGGACGATCGGGTTTTTGTCGTCAGATCCACCTTCCTAGTAAAAGCGATCGAGCTTCTTCGTTGTCGGGCTTCTTCGTCGTCGAACTTTT
TTGCCTTCAGCGATCAAACTTCTTCTAGGGAACGATCGACCTTTTGGCATCAGATCCAACGCTCTAGTAAAAGCGATCGAACCTCTTCGCTGTCAAGCTTTTTTGTCGTC
AGTGATCCAACTTCTTTTAGGGAACGATCGGGCTTTTACCATCAAATCTACCTTCCTAGTAGAAGCAATCAGGCTTGTTCGTCGTCATCGATCCAACTTCTTCTAGGGGA
TGATCGAGCTTTTGCCGTCAGATCCACCACCCTAGTAGAAGTGATCGGCCTTCTTCGTCGTCGGTTTTCTTCGTTGTCAGCGATCCAACTTCTTTTAAGGGGCGATCGGA
CTTTTGCCATCCACCTCCCTAGTAGAAACGATCAGACTTCTTCATCGTCAGCAATCCAACTTCTTCTACAGAACGATCGGGCTTTTGTCGTTAGATCTACCTTACTAGTA
GAAGCCAATGCATTGTTTTATGTCATAGACTCATGGACCACTTACACGTTGTTACTAGGTTGTCCTTGGATTCATGGAAATAGAGTAGTAACTTCAACACTGCATCAGTG
CTTCAAATTTTATCAAGACGGTGTAAAGAAGGTTGAAGTCGACTCTAATCCGTTCTTAGAGGCTAAGTCTCACTTCGCATATGCAAAGTTTTGTCCGAAGAATGATATTA
TCTTAGAAGCTTTGCCAGTAGAAACCCCTCTTATAAAAGGAAAAGATAATTCACATAAATCACTTGCAACCACAGAACCACATGAAAGTGTGGGAACCTTTAACTTTGGA
AAGGGTGAAGCATACACTAGCAACACAAAGAGTGTGATCTTGAAGGATGAAAAGGCTGCAAACCCACCAGTTTTGCACTATGTTCCTCTGTCAATGCGCAAGATAGACGA
ATCACCACTCGTTGAGTCTCCAAAAGGTTTGAAGGTTGATGACATTGAAGTCTTAAAAGAAAGCTTCACTACAAAATTAACTAAGATAACGAAGCAGGAGCTTTTACAAG
AAGGACATGCTATACACGTGTCGAGAAAAGGGCTCGGATACAAGTCGCAAGAGTCGATCCATATAACTAGAAAGGGGAAGGAAAGAGTGGTTGATAGCAATCATATAGCT
ATAGAGGAGGTTGACAATACTGACGAAAAAGAAGGCGATAATCAGATAACCTCAACATTTGATCGAATTAGACCACATGTTGCATGCACCCTAGCTAACTATGACTCTCA
TAAAGGAGAAAAGCACATGTCAGGCCTCAACAACTACAAGACCATCAGCCTTTCAAAGGCTAGTAAGGAGTTTCCTTGCAAGACAAAAGAAGAGGGAGAAATTCATAGCA
ATGTTCCTTCACGAATGAAAAGAAAAACATTTATTACTCTCAATAGAAGTCAAGGTTCGTTGAAGGTAAAAAGACATAATGTTATACTAACGAATCCTGAAAAAGAAGAG
TCAGAACAAGGGAAAGGTGAAACTCCGTGTCATCACATCACCATTATTGAGGAATCAGAGACTAAAACTCATGCAGAAGACGTAGAAAATGTCCCACAGAGTTCAGAGGA
TGGTGGCCAATATACTGTAGATGAGCTAAAAGAGGTGAACCTTGGTAAAATAGAGGAACCACACCCGACTTTGGTTAGTGCATTCCTCTCTAACGAGGAGGAGGATAAAT
ACATGAGTTTGCTCACCAGATACAGGAACATCTTTGCTTGGTCGTACAAGGAGATGCCAGGACTTGATCCAAAAGTAGCAGTCCATCATCTTGTAATTAAACTAGGGTAT
CGATTGATTAAACAAGCACAACGACGTTTTCAAGCGGAGCTTATCCCTCAAATCGAGAACGGACAACTTCGCGTTAGTGTAGACTTTTGCGACCTGAATAATGCATGCCA
TAAAGATGATTTTCCCTTACCCATCACAGAAATCATGGTTGATGCAACTACTGGACACGAGCGTGTCATGCAGAAAATCTTTGATGATATGTTGCATAAGTATGTTGAAT
GTTATGTTGATGACCTTGTAATCAAATCAAAGAGACGACAAGACCATTTGAAGGATCTAAAAGTTGTGTTCGACCGCTTGCAAAAATATAAGTTAAGGATGAACTCTCTC
AAGTGTGCGTTTGGTGTGACTTCAGGAAAGTTTCTTGGATTCATTGTAAGACATCGGGGGATCGAAATAGACCAGTCCAAGATTAATGTCATTCAAAAGATGCCAAGACC
TAAGACTTGTCAAAATGCTTTTGATAGCATAAAGAAATACTTGCTTACTCCCTCAGTGTTGGGAGTTCCAATATCTGGCAAGCCATTAATATTGTACATTGCTACACAAG
AAAAGTCCTTACGAGCATTACTGGCACAAGAGGAGAAAAAGGGAAAGGAACGTGCTCTCTACTATCTAAGCAGAACATTAATTGGAGCTGAAGTTAACTATTCCCTCATC
GGGAAGATGTGCCTCGCACTTTTCTTTGCCATTGATAAGTTAAGGCATTATATGCAGGCCTTCATGTTTCATCCAGTGGCAAAAGCAGACCATATAAAGTATGTTCTGTC
CAGGCCAATCATCTCTGAATACTTAGCCAAATGGGCGGTTCTACTCCAACAATATGACATTGTCTATATTCTCCAAAAGGCGATAAAAAGACATGCGCTAGCAGACTTTT
TAGCAGACCACCCAATTCCTTCAGATTGGAAGTTATGTGTAAACCTGTCAAACGATGAAGTTCTCTACACGGAAGTTATGGAACCTTGGACTATGTATTTCGATGGTGCA
GCGCGAATAAGTGGTGCAGGGGCAAGCATCGTCCTCATTTCTCTTGAGAAGCATATGTTGCCTTATAACTTTGTACTTGTTGAACGATGTTCAAACAATGTGGCTAAATA
TTCGGCTTTGATAATTGACCTTCAAATGACATTGGAAATCGAAGTATCACTCATAGAGATACGTAGGCAGCTTAAGAAAAACCTTAAGCCCAGTCCAAAAAAAATAAAAA
AATAA
Protein sequenceShow/hide protein sequence
MILPSTIQLPLRDDRTFAVNSIALIHLPSKNDRASSSSAILLQGDDRVFVVRSTFLVKAIELLRCRASSSSNFFAFSDQTSSRERSTFWHQIQRSSKSDRTSSLSSFFVV
SDPTSFRERSGFYHQIYLPSRSNQACSSSSIQLLLGDDRAFAVRSTTLVEVIGLLRRRFSSLSAIQLLLRGDRTFAIHLPSRNDQTSSSSAIQLLLQNDRAFVVRSTLLV
EANALFYVIDSWTTYTLLLGCPWIHGNRVVTSTLHQCFKFYQDGVKKVEVDSNPFLEAKSHFAYAKFCPKNDIILEALPVETPLIKGKDNSHKSLATTEPHESVGTFNFG
KGEAYTSNTKSVILKDEKAANPPVLHYVPLSMRKIDESPLVESPKGLKVDDIEVLKESFTTKLTKITKQELLQEGHAIHVSRKGLGYKSQESIHITRKGKERVVDSNHIA
IEEVDNTDEKEGDNQITSTFDRIRPHVACTLANYDSHKGEKHMSGLNNYKTISLSKASKEFPCKTKEEGEIHSNVPSRMKRKTFITLNRSQGSLKVKRHNVILTNPEKEE
SEQGKGETPCHHITIIEESETKTHAEDVENVPQSSEDGGQYTVDELKEVNLGKIEEPHPTLVSAFLSNEEEDKYMSLLTRYRNIFAWSYKEMPGLDPKVAVHHLVIKLGY
RLIKQAQRRFQAELIPQIENGQLRVSVDFCDLNNACHKDDFPLPITEIMVDATTGHERVMQKIFDDMLHKYVECYVDDLVIKSKRRQDHLKDLKVVFDRLQKYKLRMNSL
KCAFGVTSGKFLGFIVRHRGIEIDQSKINVIQKMPRPKTCQNAFDSIKKYLLTPSVLGVPISGKPLILYIATQEKSLRALLAQEEKKGKERALYYLSRTLIGAEVNYSLI
GKMCLALFFAIDKLRHYMQAFMFHPVAKADHIKYVLSRPIISEYLAKWAVLLQQYDIVYILQKAIKRHALADFLADHPIPSDWKLCVNLSNDEVLYTEVMEPWTMYFDGA
ARISGAGASIVLISLEKHMLPYNFVLVERCSNNVAKYSALIIDLQMTLEIEVSLIEIRRQLKKNLKPSPKKIKK