; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011040 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011040
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr10:6389778..6396263
RNA-Seq ExpressionCmoCh10G011040
SyntenyCmoCh10G011040
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0019538 - protein metabolic process (biological process)
GO:0005622 - intracellular (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140657 - ATP-dependent activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR005162 - Retrotransposon gag domain
IPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025848.1 pol protein [Cucumis melo var. makuwa]0.0e+0059.03Show/hide
Query:  QTMLDTQQATLLATQQVAAMATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETT
        Q ML    A  LA QQ  A   Q   A A   A  V  Q     AP+    V +Q       +  A  L+DFR+YNP+ F GS ++PT AQMW+ ++ET 
Subjt:  QTMLDTQQATLLATQQVAAMATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETT

Query:  FESMRCPDEHKVACATYVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEK
        F  M+CP++ KV CA + L+     WW   ++ +       TWE FKE F   ++    +  K QEF +L QGD TV+QYD +F  L RFAP +   E  
Subjt:  FESMRCPDEHKVACATYVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEK

Query:  QTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPE---EKKTEPTVATGRKRPVEVDTTEFQPPSQRPR--YQNRPPAPPPIGRYLAMEKPLCRN
        +TEKFV GL    + ++ A  P T+ +ALR A  L  P      K      A G+KR VE         + R    +Q         GR L  E P C  
Subjt:  QTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPE---EKKTEPTVATGRKRPVEVDTTEFQPPSQRPR--YQNRPPAPPPIGRYLAMEKPLCRN

Query:  CGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVAL
        CG+ H GRCLAGSG+C+ C   GH A  CP K   F   P        P      Q + + TT +EA  +GTVVTGTL ILGH+A  LFDSGS+HSF++ 
Subjt:  CGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVAL

Query:  PFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTT
         FV+  G  +EPL   LSV TP+G  L++K++++  +V IA + + V L V+DM DFDVILGMDWL+ N A IDC+ KEV+F PP+  +FKF+G      
Subjt:  PFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTT

Query:  PKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGF
        PK+IS MKA +L+ QG W  LA  V+ +  E  + + PVV E+ DVFP++LPG+PP REVDF I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGF
Subjt:  PKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGF

Query:  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN
        IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++I++ DIPKTAFR+RYGHYEFVVMSFGLTN
Subjt:  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN

Query:  APAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVT
        APAVFM+LMNRVFKE LD FVIVFIDDILIYSKT+ EH+EHL + L  LR N+LYAKF+KCEFWLR+V+FLGHVVS +G+ VDP KIEAVT W RP+TV+
Subjt:  APAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVT

Query:  EIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY
        EIRSFLGLAGYYRRFV+DF+RIA+PLTQLT+KG PFVW   CE SFQELKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +
Subjt:  EIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY

Query:  KKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTD
        ++NYPTHDLELAAVVF LKIWRHYLYGEK QIYTDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A IT++  +  D
Subjt:  KKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTD

Query:  FEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMY
        FE+A+IAV   EV AQ+A+LTV+PTLRQ+II +Q +DP L++    +     +GF+ S+DDGL+ +GRLCVP  S +K E+LTEAH+S F++HPG TKMY
Subjt:  FEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMY

Query:  QDLKKHFWWRSMKKDIAE
        QDL+  +WWR MK+D+A+
Subjt:  QDLKKHFWWRSMKKDIAE

KAA0025848.1 pol protein [Cucumis melo var. makuwa]8.4e-0368.75Show/hide
Query:  SPLCWDEVGERELIGPELVRVTNEAIQKIRIR
        SP+CW EVGE+ ++GPELV+ TN AIQKIR R
Subjt:  SPLCWDEVGERELIGPELVRVTNEAIQKIRIR

KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0060.07Show/hide
Query:  PANAVAPQPVDAVAIQPVNNPH--PNR---GANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNKQSI
        PA A AP P  A A  PV  PH  P++    A  L+DFR+YNP  F GS EDPT AQMW++++ET F  M+CP++ KV CA ++L      WW   ++ +
Subjt:  PANAVAPQPVDAVAIQPVNNPH--PNR---GANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNKQSI

Query:  NPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTA--
               TW+ FKE+F   ++    R  K+QEF +L QGD TV+QYD +F  L RFAP +  TE  + +KFV GL    + ++ AF P T+ +ALR A  
Subjt:  NPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTA--

Query:  KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPT
         +L+E     KT    +T G+KR     PV V    F+   +  R+Q +P       R     KPLC  CGK H+G CL G+  C+ C    H A  CP 
Subjt:  KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPT

Query:  KSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKD
        +  G  +    G            Q + + T   EA  +GTVVTGTL ++GH+AL LFDSGS+HSF++  FV  A   +EPL H LSV TP+G  +++K+
Subjt:  KSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKD

Query:  RVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKE
        +V+  Q+ IAG  I V L V+DM DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A +L+ QG W  LA  V+T+  +
Subjt:  RVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKE

Query:  KPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
          +   PVV ++ DVFPE+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  KPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIY
        NK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LD FVIVFIDDILIY
Subjt:  NKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIY

Query:  SKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK
        SKT++EH+EHLR  L  LR+N+LYAKF KCEFWL+QVSFLGHVVSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRRFV++F+RIATPLTQLT+
Subjt:  SKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK

Query:  KGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQ
        KG PFVW   CE SFQ LKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +++NYPTHDLELAAVVF LKIWRHYLYGEK Q
Subjt:  KGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQ

Query:  IYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRII
        I+TDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A ITR+  +  D E+A+IAV    V  Q+A+LTV+PTLRQRII
Subjt:  IYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRII

Query:  DSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        D+Q  DP L +      VG    F+ S+D GLL + RLCVP  S IK E+L+EAH+S FS+HPG TKMYQDLK+ +WWR+MK+++AE
Subjt:  DSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]0.0e+0094.15Show/hide
Query:  MATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVL
        MATQPANA+ATQPAN VATQPANAVAPQPVDAVAI PVNNPHP+RGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKV CATYVL
Subjt:  MATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVL

Query:  QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA
        QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKE RIKKQQEFN LTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA
Subjt:  QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA

Query:  FNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH
        FNPKTYEEALRTAKALEEPPEEKKTEPTVA GRKRPVEVDT EFQPPSQRPRYQ+RPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH
Subjt:  FNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH

Query:  VARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAG
        VARTCPTKSPG PREPLRGPVIREPTLQT PQTKAYVTT  EAGTSGTVVTGTL+ILGHFALTLFDSGSTHSFVA PF+KQAGFVIEPLMHALSVGTPAG
Subjt:  VARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAG

Query:  VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA
        VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA
Subjt:  VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA

Query:  VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLE GTGPISKAPYRMAPAELKELK QLQDLLD                    KD SMRL
Subjt:  VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF
        CI YRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF
Subjt:  CIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF

Query:  IDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIAT
        IDDILIYSKTDL+HQEHLRKALTILREN+LYA FTKCEFW+ QVSFLGH+VSKDGIFVDPNKIEAVTK KRPTTVTEIRSFLGL GYYRRFV DFARIAT
Subjt:  IDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIAT

Query:  PLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHY
        PLTQLTKKGVPFVWDDTCE SFQELKQRLVS PVLTVPE SVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY+KNYPTHDLELAAVVF LKIWRHY
Subjt:  PLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHY

Query:  LYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRP
         YGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSA ITREPR RTDFEQADI VVTKEVAAQ+AR+TVRP
Subjt:  LYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRP

Query:  TLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        TLRQRIIDSQREDPSLSKILDQL VGPVDGFTKSTDDGLLCQGRLCVPP+SGIKNEILTEAHNS FSIHPGGTKMYQDLKKHFWWRSMKKDIAE
Subjt:  TLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]0.0e+0085.55Show/hide
Query:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        GDRTVDQYDQDFMRLRRFAPSL DT+EKQTEKFVLGLN KTRR+LEAFNPKTYEEALRTAKALE+PPEEKKTEPTV T RKRPVEVDTTEFQPP QRPRY
Subjt:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

Query:  QNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGT
        Q R P+PPPI RYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPG PREPLRGPVIREPTLQT PQTKAYVTT KEAGTSGTVVTGT
Subjt:  QNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGT

Query:  LAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHK
        L+ILGHFALTLF+S STHSFVALPFVKQAGFV+EPLMHALSVGTPAGVDLVTK+RV+DGQVVIAGQTIH+DLKVVDM  FD ILGMDWLAENFATIDCHK
Subjt:  LAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHK

Query:  KEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAP
        KEVIFTPPN LTFKFKGTSTGTTPKIISMMKARRLIQQG  AFLA AVNTKGKEKPIDT+PVVNEFMDVFPEDLPGIPPS+EVDFGIDLEP TGPISKAP
Subjt:  KEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAP

Query:  YRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDI
        YRMAPAELKELK QLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLR GYHQIKIKNEDI
Subjt:  YRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDI

Query:  PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSK
        PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD FVIVFIDDILIYSKTDLEHQEHLRKALTILREN+LYAKFT+CEFWLRQVSFLGHVVSK
Subjt:  PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSK

Query:  DGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKK
         GIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVS PVLTVPE SVGY IYSDASKK
Subjt:  DGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKK

Query:  GLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVA
        GLGCVLMQHGKVVAYAS QL DY+KNYPTHDLELAAVVF LKIWRHYLYGE +QIYTDHKSLKYLFTQKELNMRQR+WLELVKDYDIDIQYHPGKANVVA
Subjt:  GLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVA

Query:  DALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGI
        DALSRKVVHSSA ITREPRVRTDFEQADIA                                                                      
Subjt:  DALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGI

Query:  KNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
                      I+P GTKMYQDLK+HFWWRSMKKDIAE
Subjt:  KNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]3.0e+0092.86Show/hide
Query:  MQTMLDTQQATLLATQQVAAMATQPANA
        MQTMLDTQQATLLATQQV  MATQPANA
Subjt:  MQTMLDTQQATLLATQQVAAMATQPANA

XP_022951914.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111454654 [Cucurbita moschata]0.0e+0060.09Show/hide
Query:  PANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNK
        PA A A  P  A AP PV  VA Q V +   +  A  L+DFR+YNP  F GS EDPT AQMW++++ET F  M+CP++ KV CA ++L      WW   +
Subjt:  PANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNK

Query:  QSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRT
        + +       TW+ FKE+F   ++    R  K+QEF +L QGD TV+QYD +F  L RFAP +  TE  + +KFV GL    + ++ AF P T+ +ALR 
Subjt:  QSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRT

Query:  A--KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVART
        A   +L+E     KT    +T G+KR     PV V    F+P  +   +Q +P       R     KPLC  CGK H+GRCL G+  C+ C   GH A  
Subjt:  A--KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVART

Query:  CPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLV
        CP +  G  +    G            Q + + T   EA  +GTVVTGTL +LGH+AL LFDSGS+HSF++  FV  A   +EPL H LSV TP+G  ++
Subjt:  CPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLV

Query:  TKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTK
        +K++V+  Q+ IAG  I V L V+DM DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A +L+ QG W  LA  V+T+
Subjt:  TKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTK

Query:  GKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY
          +  + + PVV ++ DVFPE+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY
Subjt:  GKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY

Query:  RELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDI
        RELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LD FVIVFIDDI
Subjt:  RELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDI

Query:  LIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQ
        LIYSKT+ EH+EHLR  L  LR+N+LYAKF+KCEFWL+QVSFLGHVVSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRRFV++F+RIATPLTQ
Subjt:  LIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQ

Query:  LTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGE
        LT+KG PFVW   CE SFQ LKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +++NYPTHDLELAAVVF LKIWRHYLYGE
Subjt:  LTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGE

Query:  KTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQ
        K QI+TDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A ITR+  +  D E+A+IAV    V  Q+A+LTV+PTLRQ
Subjt:  KTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQ

Query:  RIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        RIID+Q  DP L +       G    F+ S+D GLL + RLCVP  S +K E+L+EAH+S FS+HPG TKMYQDLK+ +WWR+MK+++AE
Subjt:  RIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase0.0e+0060.07Show/hide
Query:  PANAVAPQPVDAVAIQPVNNPH--PNR---GANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNKQSI
        PA A AP P  A A  PV  PH  P++    A  L+DFR+YNP  F GS EDPT AQMW++++ET F  M+CP++ KV CA ++L      WW   ++ +
Subjt:  PANAVAPQPVDAVAIQPVNNPH--PNR---GANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNKQSI

Query:  NPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTA--
               TW+ FKE+F   ++    R  K+QEF +L QGD TV+QYD +F  L RFAP +  TE  + +KFV GL    + ++ AF P T+ +ALR A  
Subjt:  NPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTA--

Query:  KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPT
         +L+E     KT    +T G+KR     PV V    F+   +  R+Q +P       R     KPLC  CGK H+G CL G+  C+ C    H A  CP 
Subjt:  KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPT

Query:  KSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKD
        +  G  +    G            Q + + T   EA  +GTVVTGTL ++GH+AL LFDSGS+HSF++  FV  A   +EPL H LSV TP+G  +++K+
Subjt:  KSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKD

Query:  RVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKE
        +V+  Q+ IAG  I V L V+DM DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A +L+ QG W  LA  V+T+  +
Subjt:  RVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKE

Query:  KPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
          +   PVV ++ DVFPE+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL
Subjt:  KPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYREL

Query:  NKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIY
        NK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LD FVIVFIDDILIY
Subjt:  NKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIY

Query:  SKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK
        SKT++EH+EHLR  L  LR+N+LYAKF KCEFWL+QVSFLGHVVSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRRFV++F+RIATPLTQLT+
Subjt:  SKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTK

Query:  KGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQ
        KG PFVW   CE SFQ LKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +++NYPTHDLELAAVVF LKIWRHYLYGEK Q
Subjt:  KGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQ

Query:  IYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRII
        I+TDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A ITR+  +  D E+A+IAV    V  Q+A+LTV+PTLRQRII
Subjt:  IYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRII

Query:  DSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        D+Q  DP L +      VG    F+ S+D GLL + RLCVP  S IK E+L+EAH+S FS+HPG TKMYQDLK+ +WWR+MK+++AE
Subjt:  DSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

A0A5D3CQB5 Reverse transcriptase0.0e+0059.03Show/hide
Query:  QTMLDTQQATLLATQQVAAMATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETT
        Q ML    A  LA QQ  A   Q   A A   A  V  Q     AP+    V +Q       +  A  L+DFR+YNP+ F GS ++PT AQMW+ ++ET 
Subjt:  QTMLDTQQATLLATQQVAAMATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETT

Query:  FESMRCPDEHKVACATYVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEK
        F  M+CP++ KV CA + L+     WW   ++ +       TWE FKE F   ++    +  K QEF +L QGD TV+QYD +F  L RFAP +   E  
Subjt:  FESMRCPDEHKVACATYVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEK

Query:  QTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPE---EKKTEPTVATGRKRPVEVDTTEFQPPSQRPR--YQNRPPAPPPIGRYLAMEKPLCRN
        +TEKFV GL    + ++ A  P T+ +ALR A  L  P      K      A G+KR VE         + R    +Q         GR L  E P C  
Subjt:  QTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPE---EKKTEPTVATGRKRPVEVDTTEFQPPSQRPR--YQNRPPAPPPIGRYLAMEKPLCRN

Query:  CGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVAL
        CG+ H GRCLAGSG+C+ C   GH A  CP K   F   P        P      Q + + TT +EA  +GTVVTGTL ILGH+A  LFDSGS+HSF++ 
Subjt:  CGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVAL

Query:  PFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTT
         FV+  G  +EPL   LSV TP+G  L++K++++  +V IA + + V L V+DM DFDVILGMDWL+ N A IDC+ KEV+F PP+  +FKF+G      
Subjt:  PFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTT

Query:  PKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGF
        PK+IS MKA +L+ QG W  LA  V+ +  E  + + PVV E+ DVFP++LPG+PP REVDF I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGF
Subjt:  PKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGF

Query:  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN
        IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++I++ DIPKTAFR+RYGHYEFVVMSFGLTN
Subjt:  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTN

Query:  APAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVT
        APAVFM+LMNRVFKE LD FVIVFIDDILIYSKT+ EH+EHL + L  LR N+LYAKF+KCEFWLR+V+FLGHVVS +G+ VDP KIEAVT W RP+TV+
Subjt:  APAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVT

Query:  EIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY
        EIRSFLGLAGYYRRFV+DF+RIA+PLTQLT+KG PFVW   CE SFQELKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +
Subjt:  EIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY

Query:  KKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTD
        ++NYPTHDLELAAVVF LKIWRHYLYGEK QIYTDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A IT++  +  D
Subjt:  KKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTD

Query:  FEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMY
        FE+A+IAV   EV AQ+A+LTV+PTLRQ+II +Q +DP L++    +     +GF+ S+DDGL+ +GRLCVP  S +K E+LTEAH+S F++HPG TKMY
Subjt:  FEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMY

Query:  QDLKKHFWWRSMKKDIAE
        QDL+  +WWR MK+D+A+
Subjt:  QDLKKHFWWRSMKKDIAE

A0A5D3CQB5 Reverse transcriptase4.1e-0368.75Show/hide
Query:  SPLCWDEVGERELIGPELVRVTNEAIQKIRIR
        SP+CW EVGE+ ++GPELV+ TN AIQKIR R
Subjt:  SPLCWDEVGERELIGPELVRVTNEAIQKIRIR

A0A6J1EYH9 Reverse transcriptase0.0e+0094.15Show/hide
Query:  MATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVL
        MATQPANA+ATQPAN VATQPANAVAPQPVDAVAI PVNNPHP+RGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKV CATYVL
Subjt:  MATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVL

Query:  QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA
        QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKE RIKKQQEFN LTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA
Subjt:  QKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA

Query:  FNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH
        FNPKTYEEALRTAKALEEPPEEKKTEPTVA GRKRPVEVDT EFQPPSQRPRYQ+RPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH
Subjt:  FNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGH

Query:  VARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAG
        VARTCPTKSPG PREPLRGPVIREPTLQT PQTKAYVTT  EAGTSGTVVTGTL+ILGHFALTLFDSGSTHSFVA PF+KQAGFVIEPLMHALSVGTPAG
Subjt:  VARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAG

Query:  VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA
        VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA
Subjt:  VDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYA

Query:  VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLE GTGPISKAPYRMAPAELKELK QLQDLLD                    KD SMRL
Subjt:  VNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF
        CI YRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF
Subjt:  CIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVF

Query:  IDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIAT
        IDDILIYSKTDL+HQEHLRKALTILREN+LYA FTKCEFW+ QVSFLGH+VSKDGIFVDPNKIEAVTK KRPTTVTEIRSFLGL GYYRRFV DFARIAT
Subjt:  IDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIAT

Query:  PLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHY
        PLTQLTKKGVPFVWDDTCE SFQELKQRLVS PVLTVPE SVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDY+KNYPTHDLELAAVVF LKIWRHY
Subjt:  PLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHY

Query:  LYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRP
         YGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSA ITREPR RTDFEQADI VVTKEVAAQ+AR+TVRP
Subjt:  LYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRP

Query:  TLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        TLRQRIIDSQREDPSLSKILDQL VGPVDGFTKSTDDGLLCQGRLCVPP+SGIKNEILTEAHNS FSIHPGGTKMYQDLKKHFWWRSMKKDIAE
Subjt:  TLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

A0A6J1GK52 Reverse transcriptase0.0e+0085.55Show/hide
Query:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY
        GDRTVDQYDQDFMRLRRFAPSL DT+EKQTEKFVLGLN KTRR+LEAFNPKTYEEALRTAKALE+PPEEKKTEPTV T RKRPVEVDTTEFQPP QRPRY
Subjt:  GDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRY

Query:  QNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGT
        Q R P+PPPI RYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPG PREPLRGPVIREPTLQT PQTKAYVTT KEAGTSGTVVTGT
Subjt:  QNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGT

Query:  LAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHK
        L+ILGHFALTLF+S STHSFVALPFVKQAGFV+EPLMHALSVGTPAGVDLVTK+RV+DGQVVIAGQTIH+DLKVVDM  FD ILGMDWLAENFATIDCHK
Subjt:  LAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHK

Query:  KEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAP
        KEVIFTPPN LTFKFKGTSTGTTPKIISMMKARRLIQQG  AFLA AVNTKGKEKPIDT+PVVNEFMDVFPEDLPGIPPS+EVDFGIDLEP TGPISKAP
Subjt:  KEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAP

Query:  YRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDI
        YRMAPAELKELK QLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLR GYHQIKIKNEDI
Subjt:  YRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDI

Query:  PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSK
        PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLD FVIVFIDDILIYSKTDLEHQEHLRKALTILREN+LYAKFT+CEFWLRQVSFLGHVVSK
Subjt:  PKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSK

Query:  DGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKK
         GIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVS PVLTVPE SVGY IYSDASKK
Subjt:  DGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKK

Query:  GLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVA
        GLGCVLMQHGKVVAYAS QL DY+KNYPTHDLELAAVVF LKIWRHYLYGE +QIYTDHKSLKYLFTQKELNMRQR+WLELVKDYDIDIQYHPGKANVVA
Subjt:  GLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVA

Query:  DALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGI
        DALSRKVVHSSA ITREPRVRTDFEQADIA                                                                      
Subjt:  DALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGI

Query:  KNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
                      I+P GTKMYQDLK+HFWWRSMKKDIAE
Subjt:  KNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

A0A6J1GK52 Reverse transcriptase1.5e+0092.86Show/hide
Query:  MQTMLDTQQATLLATQQVAAMATQPANA
        MQTMLDTQQATLLATQQV  MATQPANA
Subjt:  MQTMLDTQQATLLATQQVAAMATQPANA

A0A6J1GK52 Reverse transcriptase0.0e+0060.09Show/hide
Query:  PANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNK
        PA A A  P  A AP PV  VA Q V +   +  A  L+DFR+YNP  F GS EDPT AQMW++++ET F  M+CP++ KV CA ++L      WW   +
Subjt:  PANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDEHKVACATYVLQKDAEVWWSDNK

Query:  QSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRT
        + +       TW+ FKE+F   ++    R  K+QEF +L QGD TV+QYD +F  L RFAP +  TE  + +KFV GL    + ++ AF P T+ +ALR 
Subjt:  QSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEAFNPKTYEEALRT

Query:  A--KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVART
        A   +L+E     KT    +T G+KR     PV V    F+P  +   +Q +P       R     KPLC  CGK H+GRCL G+  C+ C   GH A  
Subjt:  A--KALEEPPEEKKTEPTVAT-GRKR-----PVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVART

Query:  CPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLV
        CP +  G  +    G            Q + + T   EA  +GTVVTGTL +LGH+AL LFDSGS+HSF++  FV  A   +EPL H LSV TP+G  ++
Subjt:  CPTKSPGFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLV

Query:  TKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTK
        +K++V+  Q+ IAG  I V L V+DM DFDVILGMDWLA N A+IDC +KEV F PP+  +FKFKG  + + P++IS ++A +L+ QG W  LA  V+T+
Subjt:  TKDRVRDGQVVIAGQTIHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTK

Query:  GKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY
          +  + + PVV ++ DVFPE+LPG+PP REV+F I+LEPGT PIS+APYRMAPAELKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY
Subjt:  GKEKPIDTIPVVNEFMDVFPEDLPGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDY

Query:  RELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDI
        RELNK TVKN+YPLPRI+DLFDQL+GATVFSKIDLRSGYHQ++IK+ED+PKTAFR+RYGHYEF+VMSFGLTNAPAVFM+LMNRVF+E LD FVIVFIDDI
Subjt:  RELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDI

Query:  LIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQ
        LIYSKT+ EH+EHLR  L  LR+N+LYAKF+KCEFWL+QVSFLGHVVSK G+ VDP KIEAVT W RP+TV+E+RSFLGLAGYYRRFV++F+RIATPLTQ
Subjt:  LIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQ

Query:  LTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGE
        LT+KG PFVW   CE SFQ LKQ+LV+ PVLTVP+GS  + IYSDASKKGLGCVLMQ GKVVAYAS QLK +++NYPTHDLELAAVVF LKIWRHYLYGE
Subjt:  LTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGE

Query:  KTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQ
        K QI+TDHKSLKY FTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRKV HS+A ITR+  +  D E+A+IAV    V  Q+A+LTV+PTLRQ
Subjt:  KTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQ

Query:  RIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE
        RIID+Q  DP L +       G    F+ S+D GLL + RLCVP  S +K E+L+EAH+S FS+HPG TKMYQDLK+ +WWR+MK+++AE
Subjt:  RIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.7e-8933.22Show/hide
Query:  DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFL----AYAVNTKGKEKPIDTIPVVNEFMDVFPEDL
        ++D++LG   LAE  ATI    +EV          +   T   +  + ++M+    L Q    + +     Y +     E+      ++ ++ D+   + 
Subjt:  DFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFL----AYAVNTKGKEKPIDTIPVVNEFMDVFPEDL

Query:  PGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKRTVKNKYPLPRIE
          +  + +    I+ +      SK  Y  A  +  E++ Q+QD+L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN+ TV +++P+P ++
Subjt:  PGIPPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKRTVKNKYPLPRIE

Query:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKAL
        ++  +L     F+ IDL  G+HQI++  E + KTAF T++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + L    
Subjt:  DLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKAL

Query:  TILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF-VWDDTCEAS
          L +  L  +  KCEF  ++ +FLGHV++ DGI  +P KIEA+ K+  PT   EI++FLGL GYYR+F+ +FA IA P+T+  KK +     +   +++
Subjt:  TILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF-VWDDTCEAS

Query:  FQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQ
        F++LK  +   P+L VP+ +  + + +DAS   LG VL Q G  ++Y S  L +++ NY T + EL A+V+  K +RHYL G   +I +DH+ L +L+  
Subjt:  FQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQ

Query:  KELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTK
        K+ N +  RW   + ++D DI+Y  GK N VADALSR +     +++ + +   + + +D+  +T+
Subjt:  KELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTK

P0CT34 Transposon Tf2-1 polyprotein3.5e-8431.83Show/hide
Query:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV
        +  EF D+  E +   +P P + ++F ++L      +    Y + P +++ +  ++   L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV

Query:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL
         N YPLP IE L  +++G+T+F+K+DL+S YH I+++  D  K AFR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK++ 
Subjt:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL

Query:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF
        EH +H++  L  L+   L     KCEF   QV F+G+ +S+ G       I+ V +WK+P    E+R FLG   Y R+F+   +++  PL  L KK V +
Subjt:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF

Query:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK
         W  T   + + +KQ LVS PVL   + S    + +DAS   +G VL Q         V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK

Query:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR
         +I TDH++L    T +    N R  RW   ++D++ +I Y PG AN +ADALSR V  +      EP +  D E   I  V         ++++    +
Subjt:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR

Query:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN
         +++     D  L  +L+       +       DGLL   +  + +P  + +   I+ + H     IHPG   +   + + F W+ ++K I E  QN
Subjt:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN

P0CT35 Transposon Tf2-2 polyprotein3.5e-8431.83Show/hide
Query:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV
        +  EF D+  E +   +P P + ++F ++L      +    Y + P +++ +  ++   L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV

Query:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL
         N YPLP IE L  +++G+T+F+K+DL+S YH I+++  D  K AFR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK++ 
Subjt:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL

Query:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF
        EH +H++  L  L+   L     KCEF   QV F+G+ +S+ G       I+ V +WK+P    E+R FLG   Y R+F+   +++  PL  L KK V +
Subjt:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF

Query:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK
         W  T   + + +KQ LVS PVL   + S    + +DAS   +G VL Q         V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK

Query:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR
         +I TDH++L    T +    N R  RW   ++D++ +I Y PG AN +ADALSR V  +      EP +  D E   I  V         ++++    +
Subjt:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR

Query:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN
         +++     D  L  +L+       +       DGLL   +  + +P  + +   I+ + H     IHPG   +   + + F W+ ++K I E  QN
Subjt:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN

P0CT41 Transposon Tf2-12 polyprotein3.5e-8431.83Show/hide
Query:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV
        +  EF D+  E +   +P P + ++F ++L      +    Y + P +++ +  ++   L  G IR S +    PV+FV KK+G++R+ +DY+ LNK   
Subjt:  VVNEFMDVFPE-DLPGIP-PSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTV

Query:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL
         N YPLP IE L  +++G+T+F+K+DL+S YH I+++  D  K AFR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK++ 
Subjt:  KNKYPLPRIEDLFDQLRGATVFSKIDLRSGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDL

Query:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF
        EH +H++  L  L+   L     KCEF   QV F+G+ +S+ G       I+ V +WK+P    E+R FLG   Y R+F+   +++  PL  L KK V +
Subjt:  EHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPF

Query:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK
         W  T   + + +KQ LVS PVL   + S    + +DAS   +G VL Q         V Y S ++   + NY   D E+ A++ +LK WRHYL    E 
Subjt:  VWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLMQHGK-----VVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYG--EK

Query:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR
         +I TDH++L    T +    N R  RW   ++D++ +I Y PG AN +ADALSR V  +      EP +  D E   I  V         ++++    +
Subjt:  TQIYTDHKSLKYLFTQKE--LNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITREPRVRTDFEQADIAVVTKEVAAQIARLTVRPTLR

Query:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN
         +++     D  L  +L+       +       DGLL   +  + +P  + +   I+ + H     IHPG   +   + + F W+ ++K I E  QN
Subjt:  QRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGR--LCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLKKHFWWRSMKKDIAELTQN

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-8438.57Show/hide
Query:  PISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSG
        PI    Y +A     E++ Q+Q++L++G IR S SP+ +P   V KK         R+ IDYR+LN+ T+ ++YP+P ++++  +L     F+ IDL  G
Subjt:  PISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLRSG

Query:  YHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLR
        +HQI++  E I KTAF T+ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  EH   ++   T L +  L  +  KCEF  +
Subjt:  YHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLR

Query:  QVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCE----ASFQELKQRLVSTPVLTVP
        + +FLGH+V+ DGI  +P K++A+  +  PT   EIR+FLGL GYYR+F+ ++A IA P+T   KK       DT +     +F++LK  ++  P+L +P
Subjt:  QVSFLGHVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCE----ASFQELKQRLVSTPVLTVP

Query:  EGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDY
        +    + + +DAS   LG VL Q+G  +++ S  L D++ NY   + EL A+V+  K +RHYL G +  I +DH+ L++L   KE   +  RW   + +Y
Subjt:  EGSVGYAIYSDASKKGLGCVLMQHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDY

Query:  DIDIQYHPGKANVVADALSR
           I Y  GK N VADALSR
Subjt:  DIDIQYHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.1e-2440.8Show/hide
Query:  HLRKALTILRENRLYAKFTKCEFWLRQVSFLG--HVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVW
        HL   L I  +++ YA   KC F   Q+++LG  H++S +G+  DP K+EA+  W  P   TE+R FLGL GYYRRFV+++ +I  PLT+L KK     W
Subjt:  HLRKALTILRENRLYAKFTKCEFWLRQVSFLG--HVVSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVW

Query:  DDTCEASFQELKQRLVSTPVLTVPE
         +    +F+ LK  + + PVL +P+
Subjt:  DDTCEASFQELKQRLVSTPVLTVPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACTATGCTAGACACACAACAGGCCACGTTGTTGGCTACGCAACAAGTCGCGGCAATGGCCACGCAACCAGCCAACGCGGTGGCCACGCAACCAGCCAACGCGGT
GGCCACGCAACCAGCCAACGCGGTGGCCCCGCAACCGGTGGATGCGGTAGCTATCCAACCGGTGAACAACCCGCACCCGAATAGGGGCGCGAATTGGTTAAAGGATTTCA
GGCGGTATAACCCCCGTCCTTTTGTCGGTTCGAAGGAAGATCCCACAGCGGCTCAAATGTGGATCGCAAACATGGAAACGACCTTTGAATCAATGCGATGCCCTGACGAA
CATAAGGTGGCATGTGCGACATATGTGTTACAAAAGGATGCAGAAGTGTGGTGGTCGGATAATAAACAGAGCATCAACCCGGGTGGGGGAATTACAACATGGGAGACCTT
TAAGGAAGCCTTTCTAAAATACTATTATCCAAAGGAAACCCGCATAAAGAAACAGCAAGAGTTTAACCACTTAACCCAAGGTGATCGCACGGTGGATCAGTACGATCAGG
ACTTCATGAGATTAAGAAGGTTTGCACCGTCTTTAGCGGACACTGAAGAGAAACAGACAGAAAAATTTGTGTTAGGATTGAATCCGAAAACCCGCCGCATGTTAGAGGCC
TTCAACCCAAAAACCTATGAAGAGGCACTAAGGACGGCCAAGGCCCTTGAGGAACCCCCAGAGGAAAAGAAAACAGAGCCAACAGTTGCCACAGGGAGGAAACGCCCTGT
CGAGGTCGATACCACAGAATTCCAACCACCGTCCCAAAGGCCTCGGTATCAGAATAGGCCACCTGCTCCACCTCCAATAGGCCGATACCTAGCAATGGAGAAGCCCCTGT
GCCGTAACTGTGGAAAGCAACATGTTGGGAGATGTTTGGCGGGCTCAGGTATGTGTTATATCTGTGGTCATGCAGGTCATGTGGCCAGAACTTGCCCTACAAAGAGCCCG
GGATTTCCAAGGGAACCCCTTAGAGGACCGGTCATCCGAGAGCCCACCTTACAAACTCGTCCACAGACCAAGGCATATGTAACGACCGGTAAAGAGGCGGGAACATCTGG
CACCGTGGTGACAGGTACGCTTGCTATACTAGGACACTTTGCGTTGACATTGTTTGATTCTGGTTCTACCCATTCCTTTGTTGCTTTACCATTTGTTAAACAAGCAGGAT
TCGTAATAGAACCCTTAATGCATGCGTTGTCGGTCGGTACCCCGGCAGGGGTAGACCTAGTTACGAAAGATAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACC
ATCCACGTAGACTTAAAGGTAGTGGATATGACGGATTTTGACGTCATACTAGGAATGGACTGGTTAGCAGAAAACTTTGCTACCATAGACTGCCATAAGAAGGAGGTGAT
ATTCACACCTCCTAACGGACTTACCTTTAAGTTTAAAGGAACCTCCACAGGTACCACCCCAAAAATAATATCGATGATGAAAGCGAGACGCTTGATACAACAAGGGGGGT
GGGCGTTCCTAGCTTATGCAGTAAATACAAAAGGAAAGGAAAAACCCATAGATACAATACCGGTAGTAAACGAATTTATGGATGTCTTTCCGGAGGACCTCCCGGGAATT
CCTCCATCGCGAGAGGTAGACTTTGGAATAGATCTAGAACCAGGAACGGGACCCATCTCTAAGGCCCCGTACCGCATGGCACCAGCAGAACTAAAAGAGCTTAAAATACA
ACTGCAAGACCTGCTAGATAAGGGCTTCATTCGACCTAGCGTGTCCCCTTGGGGTGCGCCAGTGTTGTTTGTTAAAAAGAAAGACGGCTCAATGCGTTTATGCATTGATT
ATAGGGAATTAAACAAGAGGACGGTAAAGAACAAATACCCGTTACCCCGAATTGAGGACCTATTTGATCAACTACGTGGGGCGACAGTGTTTTCTAAAATAGATCTTCGA
TCAGGATACCATCAAATTAAGATTAAAAATGAAGACATACCGAAAACGGCCTTTCGAACCAGATATGGCCACTACGAGTTTGTGGTGATGTCTTTTGGTCTCACCAATGC
CCCAGCGGTATTTATGGAACTAATGAACCGAGTATTTAAGGAGTGCCTAGACTTGTTTGTGATCGTGTTCATAGACGATATCCTGATATACTCGAAAACGGACCTAGAAC
ACCAAGAACACCTTCGCAAAGCCTTAACTATCCTAAGAGAGAATAGGTTGTATGCCAAGTTCACCAAGTGTGAATTTTGGCTACGACAGGTTTCGTTTCTGGGGCACGTA
GTGTCTAAAGACGGAATCTTCGTAGATCCCAATAAGATAGAAGCGGTCACGAAATGGAAACGCCCAACAACGGTCACCGAGATACGAAGTTTCTTGGGATTGGCGGGTTA
TTATCGAAGGTTTGTCCAGGACTTCGCTAGAATAGCCACGCCTCTCACCCAATTAACAAAAAAAGGTGTACCTTTTGTTTGGGATGATACTTGTGAGGCCAGCTTTCAAG
AACTAAAACAACGACTAGTATCCACCCCAGTGCTCACAGTTCCAGAAGGTTCTGTGGGATACGCGATTTACAGTGACGCATCCAAAAAGGGATTAGGTTGTGTACTGATG
CAACATGGCAAGGTTGTTGCGTATGCGTCACACCAACTAAAAGATTACAAGAAAAACTATCCTACACATGATTTGGAGCTGGCAGCAGTAGTGTTCACGTTAAAAATTTG
GCGACACTACTTGTATGGCGAAAAGACCCAGATCTACACAGACCACAAAAGCCTAAAATATTTGTTCACGCAAAAAGAATTGAACATGCGGCAACGTAGATGGTTAGAGC
TGGTAAAAGATTACGACATAGACATCCAGTACCACCCTGGGAAAGCAAACGTAGTCGCTGATGCACTGAGCCGAAAAGTAGTACACTCGTCGGCCTTCATCACCAGGGAG
CCACGAGTGCGAACTGATTTCGAACAGGCTGATATCGCGGTAGTAACCAAAGAAGTTGCCGCCCAAATAGCCCGACTGACAGTGCGCCCCACGCTCAGACAAAGAATCAT
AGACTCACAACGTGAGGATCCAAGCCTAAGTAAAATCCTAGACCAATTGGCTGTTGGCCCAGTGGACGGATTCACTAAATCAACAGATGACGGATTATTATGCCAAGGGC
GTTTATGTGTCCCACCCATGAGCGGAATAAAGAATGAAATCCTAACGGAGGCACACAACTCAACCTTCTCGATACATCCAGGTGGAACAAAGATGTATCAAGACTTGAAA
AAACACTTCTGGTGGCGAAGTATGAAGAAGGATATCGCGGAGCTTACCCAAAACATTAAAGGGCTGTACGGTAATCTGGGTGGTCGTGGATCGCCTGACAAAGTCGGCTC
ACTTCCTACCGGGTCCCCCTTGTGTTGGGACGAGGTAGGAGAACGAGAATTGATAGGACCCGAGCTGGTCCGTGTCACCAACGAAGCAATTCAGAAAATCCGAATAAGGG
CAAGGCACCCATGTGACGACGACGACGGCATCGTACCCCCAGGATCGTGGCACATGCATAGGGTAGTGGGTTGTTCTTTTTCCATTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAACTATGCTAGACACACAACAGGCCACGTTGTTGGCTACGCAACAAGTCGCGGCAATGGCCACGCAACCAGCCAACGCGGTGGCCACGCAACCAGCCAACGCGGT
GGCCACGCAACCAGCCAACGCGGTGGCCCCGCAACCGGTGGATGCGGTAGCTATCCAACCGGTGAACAACCCGCACCCGAATAGGGGCGCGAATTGGTTAAAGGATTTCA
GGCGGTATAACCCCCGTCCTTTTGTCGGTTCGAAGGAAGATCCCACAGCGGCTCAAATGTGGATCGCAAACATGGAAACGACCTTTGAATCAATGCGATGCCCTGACGAA
CATAAGGTGGCATGTGCGACATATGTGTTACAAAAGGATGCAGAAGTGTGGTGGTCGGATAATAAACAGAGCATCAACCCGGGTGGGGGAATTACAACATGGGAGACCTT
TAAGGAAGCCTTTCTAAAATACTATTATCCAAAGGAAACCCGCATAAAGAAACAGCAAGAGTTTAACCACTTAACCCAAGGTGATCGCACGGTGGATCAGTACGATCAGG
ACTTCATGAGATTAAGAAGGTTTGCACCGTCTTTAGCGGACACTGAAGAGAAACAGACAGAAAAATTTGTGTTAGGATTGAATCCGAAAACCCGCCGCATGTTAGAGGCC
TTCAACCCAAAAACCTATGAAGAGGCACTAAGGACGGCCAAGGCCCTTGAGGAACCCCCAGAGGAAAAGAAAACAGAGCCAACAGTTGCCACAGGGAGGAAACGCCCTGT
CGAGGTCGATACCACAGAATTCCAACCACCGTCCCAAAGGCCTCGGTATCAGAATAGGCCACCTGCTCCACCTCCAATAGGCCGATACCTAGCAATGGAGAAGCCCCTGT
GCCGTAACTGTGGAAAGCAACATGTTGGGAGATGTTTGGCGGGCTCAGGTATGTGTTATATCTGTGGTCATGCAGGTCATGTGGCCAGAACTTGCCCTACAAAGAGCCCG
GGATTTCCAAGGGAACCCCTTAGAGGACCGGTCATCCGAGAGCCCACCTTACAAACTCGTCCACAGACCAAGGCATATGTAACGACCGGTAAAGAGGCGGGAACATCTGG
CACCGTGGTGACAGGTACGCTTGCTATACTAGGACACTTTGCGTTGACATTGTTTGATTCTGGTTCTACCCATTCCTTTGTTGCTTTACCATTTGTTAAACAAGCAGGAT
TCGTAATAGAACCCTTAATGCATGCGTTGTCGGTCGGTACCCCGGCAGGGGTAGACCTAGTTACGAAAGATAGAGTAAGGGACGGACAAGTGGTAATAGCTGGACAAACC
ATCCACGTAGACTTAAAGGTAGTGGATATGACGGATTTTGACGTCATACTAGGAATGGACTGGTTAGCAGAAAACTTTGCTACCATAGACTGCCATAAGAAGGAGGTGAT
ATTCACACCTCCTAACGGACTTACCTTTAAGTTTAAAGGAACCTCCACAGGTACCACCCCAAAAATAATATCGATGATGAAAGCGAGACGCTTGATACAACAAGGGGGGT
GGGCGTTCCTAGCTTATGCAGTAAATACAAAAGGAAAGGAAAAACCCATAGATACAATACCGGTAGTAAACGAATTTATGGATGTCTTTCCGGAGGACCTCCCGGGAATT
CCTCCATCGCGAGAGGTAGACTTTGGAATAGATCTAGAACCAGGAACGGGACCCATCTCTAAGGCCCCGTACCGCATGGCACCAGCAGAACTAAAAGAGCTTAAAATACA
ACTGCAAGACCTGCTAGATAAGGGCTTCATTCGACCTAGCGTGTCCCCTTGGGGTGCGCCAGTGTTGTTTGTTAAAAAGAAAGACGGCTCAATGCGTTTATGCATTGATT
ATAGGGAATTAAACAAGAGGACGGTAAAGAACAAATACCCGTTACCCCGAATTGAGGACCTATTTGATCAACTACGTGGGGCGACAGTGTTTTCTAAAATAGATCTTCGA
TCAGGATACCATCAAATTAAGATTAAAAATGAAGACATACCGAAAACGGCCTTTCGAACCAGATATGGCCACTACGAGTTTGTGGTGATGTCTTTTGGTCTCACCAATGC
CCCAGCGGTATTTATGGAACTAATGAACCGAGTATTTAAGGAGTGCCTAGACTTGTTTGTGATCGTGTTCATAGACGATATCCTGATATACTCGAAAACGGACCTAGAAC
ACCAAGAACACCTTCGCAAAGCCTTAACTATCCTAAGAGAGAATAGGTTGTATGCCAAGTTCACCAAGTGTGAATTTTGGCTACGACAGGTTTCGTTTCTGGGGCACGTA
GTGTCTAAAGACGGAATCTTCGTAGATCCCAATAAGATAGAAGCGGTCACGAAATGGAAACGCCCAACAACGGTCACCGAGATACGAAGTTTCTTGGGATTGGCGGGTTA
TTATCGAAGGTTTGTCCAGGACTTCGCTAGAATAGCCACGCCTCTCACCCAATTAACAAAAAAAGGTGTACCTTTTGTTTGGGATGATACTTGTGAGGCCAGCTTTCAAG
AACTAAAACAACGACTAGTATCCACCCCAGTGCTCACAGTTCCAGAAGGTTCTGTGGGATACGCGATTTACAGTGACGCATCCAAAAAGGGATTAGGTTGTGTACTGATG
CAACATGGCAAGGTTGTTGCGTATGCGTCACACCAACTAAAAGATTACAAGAAAAACTATCCTACACATGATTTGGAGCTGGCAGCAGTAGTGTTCACGTTAAAAATTTG
GCGACACTACTTGTATGGCGAAAAGACCCAGATCTACACAGACCACAAAAGCCTAAAATATTTGTTCACGCAAAAAGAATTGAACATGCGGCAACGTAGATGGTTAGAGC
TGGTAAAAGATTACGACATAGACATCCAGTACCACCCTGGGAAAGCAAACGTAGTCGCTGATGCACTGAGCCGAAAAGTAGTACACTCGTCGGCCTTCATCACCAGGGAG
CCACGAGTGCGAACTGATTTCGAACAGGCTGATATCGCGGTAGTAACCAAAGAAGTTGCCGCCCAAATAGCCCGACTGACAGTGCGCCCCACGCTCAGACAAAGAATCAT
AGACTCACAACGTGAGGATCCAAGCCTAAGTAAAATCCTAGACCAATTGGCTGTTGGCCCAGTGGACGGATTCACTAAATCAACAGATGACGGATTATTATGCCAAGGGC
GTTTATGTGTCCCACCCATGAGCGGAATAAAGAATGAAATCCTAACGGAGGCACACAACTCAACCTTCTCGATACATCCAGGTGGAACAAAGATGTATCAAGACTTGAAA
AAACACTTCTGGTGGCGAAGTATGAAGAAGGATATCGCGGAGCTTACCCAAAACATTAAAGGGCTGTACGGTAATCTGGGTGGTCGTGGATCGCCTGACAAAGTCGGCTC
ACTTCCTACCGGGTCCCCCTTGTGTTGGGACGAGGTAGGAGAACGAGAATTGATAGGACCCGAGCTGGTCCGTGTCACCAACGAAGCAATTCAGAAAATCCGAATAAGGG
CAAGGCACCCATGTGACGACGACGACGGCATCGTACCCCCAGGATCGTGGCACATGCATAGGGTAGTGGGTTGTTCTTTTTCCATTTCGTAG
Protein sequenceShow/hide protein sequence
MQTMLDTQQATLLATQQVAAMATQPANAVATQPANAVATQPANAVAPQPVDAVAIQPVNNPHPNRGANWLKDFRRYNPRPFVGSKEDPTAAQMWIANMETTFESMRCPDE
HKVACATYVLQKDAEVWWSDNKQSINPGGGITTWETFKEAFLKYYYPKETRIKKQQEFNHLTQGDRTVDQYDQDFMRLRRFAPSLADTEEKQTEKFVLGLNPKTRRMLEA
FNPKTYEEALRTAKALEEPPEEKKTEPTVATGRKRPVEVDTTEFQPPSQRPRYQNRPPAPPPIGRYLAMEKPLCRNCGKQHVGRCLAGSGMCYICGHAGHVARTCPTKSP
GFPREPLRGPVIREPTLQTRPQTKAYVTTGKEAGTSGTVVTGTLAILGHFALTLFDSGSTHSFVALPFVKQAGFVIEPLMHALSVGTPAGVDLVTKDRVRDGQVVIAGQT
IHVDLKVVDMTDFDVILGMDWLAENFATIDCHKKEVIFTPPNGLTFKFKGTSTGTTPKIISMMKARRLIQQGGWAFLAYAVNTKGKEKPIDTIPVVNEFMDVFPEDLPGI
PPSREVDFGIDLEPGTGPISKAPYRMAPAELKELKIQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKRTVKNKYPLPRIEDLFDQLRGATVFSKIDLR
SGYHQIKIKNEDIPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDLFVIVFIDDILIYSKTDLEHQEHLRKALTILRENRLYAKFTKCEFWLRQVSFLGHV
VSKDGIFVDPNKIEAVTKWKRPTTVTEIRSFLGLAGYYRRFVQDFARIATPLTQLTKKGVPFVWDDTCEASFQELKQRLVSTPVLTVPEGSVGYAIYSDASKKGLGCVLM
QHGKVVAYASHQLKDYKKNYPTHDLELAAVVFTLKIWRHYLYGEKTQIYTDHKSLKYLFTQKELNMRQRRWLELVKDYDIDIQYHPGKANVVADALSRKVVHSSAFITRE
PRVRTDFEQADIAVVTKEVAAQIARLTVRPTLRQRIIDSQREDPSLSKILDQLAVGPVDGFTKSTDDGLLCQGRLCVPPMSGIKNEILTEAHNSTFSIHPGGTKMYQDLK
KHFWWRSMKKDIAELTQNIKGLYGNLGGRGSPDKVGSLPTGSPLCWDEVGERELIGPELVRVTNEAIQKIRIRARHPCDDDDGIVPPGSWHMHRVVGCSFSIS