; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011150 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011150
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr10:7484327..7491727
RNA-Seq ExpressionCmoCh10G011150
SyntenyCmoCh10G011150
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0019538 - protein metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031437.1 pol protein [Cucumis melo var. makuwa]0.0e+0060.42Show/hide
Query:  AQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNK
        A  QAQA A  QAQA     P    P PV+ S     +RDF++Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   +
Subjt:  AQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNK

Query:  TLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRT
         +L  +   + WE+FKE F  +++  + +  K QEF +L Q  +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR 
Subjt:  TLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRT

Query:  AKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA-------HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAG
        A  L  P+     K       +GQKR  E+     P           RHR     A                    G   R    CT CGR+HGGRC+AG
Subjt:  AKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA-------HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAG

Query:  SRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRA-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELE
        S  C+RC Q GH A  C           PR   +T  P P  A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+E
Subjt:  SRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRA-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELE

Query:  PLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKK
        PL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA K
Subjt:  PLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKK

Query:  LVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAP
        L+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG+PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAP
Subjt:  LVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAP

Query:  VLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNR
        VLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+ ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNR
Subjt:  VLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNR

Query:  VFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGY
        VFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGY
Subjt:  VFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGY

Query:  YRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLEL
        YRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++LVTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLEL
Subjt:  YRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLEL

Query:  AAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATE
        AAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+  
Subjt:  AAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATE

Query:  GVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
         V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

KAA0032535.1 pol protein [Cucumis melo var. makuwa]0.0e+0058.93Show/hide
Query:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK
        PEA P    VD N  +     QA +  +    Q  LQ A     A  Q QA  +    A  QAQA A  QAQA A   P    P PV+ S     +RDF+
Subjt:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK

Query:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD
        +Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   + +L  +   + WE+FKE F  +++  + +  K QEF +L Q 
Subjt:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD

Query:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---
         +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR A  L  P+     K       +GQKR  E+     P      
Subjt:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---

Query:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR
             RHR     A                    G   R    CT CGR+HGGRC+AGS  C+RC Q GH A  C           PR   +T  P P  
Subjt:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR

Query:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN
        A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+EPL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++
Subjt:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN

Query:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG
        M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA KL+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG
Subjt:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG

Query:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR
        +PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+
Subjt:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR

Query:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK
         ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +K
Subjt:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK

Query:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL
        LYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGYYRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++L
Subjt:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL

Query:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR
        VTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD
        RWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+   V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +
Subjt:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD

Query:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
        GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

KAA0033825.1 pol protein [Cucumis melo var. makuwa]0.0e+0058.93Show/hide
Query:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK
        PEA P    VD N  +     QA +  +    Q  LQ A     A  Q QA  +    A  QAQA A  QAQA A   P    P PV+ S     +RDF+
Subjt:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK

Query:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD
        +Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   + +L  +   + WE+FKE F  +++  + +  K QEF +L Q 
Subjt:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD

Query:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---
         +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR A  L  P+     K       +GQKR  E+     P      
Subjt:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---

Query:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR
             RHR     A                    G   R    CT CGR+HGGRC+AGS  C+RC Q GH A  C           PR   +T  P P  
Subjt:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR

Query:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN
        A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+EPL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++
Subjt:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN

Query:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG
        M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA KL+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG
Subjt:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG

Query:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR
        +PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+
Subjt:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR

Query:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK
         ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +K
Subjt:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK

Query:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL
        LYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGYYRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++L
Subjt:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL

Query:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR
        VTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD
        RWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+   V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +
Subjt:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD

Query:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
        GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

XP_022933231.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111440131 [Cucurbita moschata]0.0e+0068.57Show/hide
Query:  AQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQ
        A   A A+A           A A A     A A PPVN P P R +NW++DF+RY+PRPF GS  DP  AQMWIA +ETTFE+M CP+  KV CAT+VLQ
Subjt:  AQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQ

Query:  KDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAF
        KDAE+WW DNK  +NP GG   WE FKEAFLK YYPK  R+KKQQEF  L Q   TV++Y+++F RL+RFAPS+ DTEEK TEKFVLGL P+ RRMLEAF
Subjt:  KDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAF

Query:  NPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMA
        NPKTYEEALRTAKALE+P +EK+ E  V IG+KRP E       PP+ R R  +RP        PP      P  +  A  +    C  CG+ H GRC+A
Subjt:  NPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMA

Query:  GSRACYRCGQEGHIAVNC---TAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGF
        GS  CY CG  GH+A  C   + G        P + E T Q +P   Q +AY +TS + G S  VVTGTLSILGHFA TLFDSGSTHSF++ PF+ QAGF
Subjt:  GSRACYRCGQEGHIAVNC---TAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGF

Query:  ELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMK
         +EPL+H +SV TPAGVDLV++DRV+DGQV+I  QT+ VDL VV+MTDFDVILGMDWLAEN A+IDCH+KEV F+PP G TFKFKGT+TG TPK++SMMK
Subjt:  ELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMK

Query:  AKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPW
        A++L+QQGGWA LA AV+ +GKE+ +  +P+VNEF DVFP+DLPGIPPSR VDF I+LE GTGPISKAPYRMAPAELKELK QLQDLLD           
Subjt:  AKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPW

Query:  GAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL
                 KD SMRLCI YRELNK T+KNKYPLPRIEDLFDQLR ATVFSKIDLRSGYHQI+I  +D+PKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL
Subjt:  GAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL

Query:  MNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGL
        MNRVFKECLD+FVIVFIDDILIYS+TD++H+EHLRK LT LRE+KLYA F+KCEFW+ QVSFLGH+VSKDGI VDP K+EA+TK +RPTTVTEIRSFLGL
Subjt:  MNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGL

Query:  AGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHD
         GYYRRFV DFA+I++PLT+LTKKGVPF WDD CE SFQ LK+RLV+APVL VPESS GY IYSDAS KGLGCVLMQHGKVVAYAS QLK+YEKNYPTHD
Subjt:  AGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHD

Query:  LELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAV
        LELAAVVFALKIWRHY YGEKTQI+TDHKSLKY FTQKELNMRQRRWLELVKDYD+DIQYHPGKANVVADALSRK VHSSALITRE R + +FE+A+I V
Subjt:  LELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAV

Query:  ATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGG
         T+ V AQLAR+TV+PTLRQRII SQ+EDP+L K+L QL+  PVDGF+KS+D+GLL QGRLCVP +  ++ EIL EAHNS F++HPGG
Subjt:  ATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGG

XP_023522446.1 uncharacterized protein LOC111786377 [Cucurbita pepo subsp. pepo]0.0e+0087.27Show/hide
Query:  RGRGRGRGRGRRGRGRPRHREVGRGVAVGGPPQERSETGPVTVTPPPVLPLVELPPQAPLQAEPPPPVEAPPQPTLPLQVAPEAAPPMGGVDANTRLMCD
        RGRGRGRGRGRRGRGRPR REVGRGVAV GPPQERSETGPVTVTPPPVLP V          +PPP VE PPQPTLPLQVAPEAAPP+GGVDANTRLMCD
Subjt:  RGRGRGRGRGRRGRGRPRHREVGRGVAVGGPPQERSETGPVTVTPPPVLPLVELPPQAPLQAEPPPPVEAPPQPTLPLQVAPEAAPPMGGVDANTRLMCD

Query:  SIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIA
        SIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQ                                     RYDPRPFTGSLVDPIEAQMWIA
Subjt:  SIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIA

Query:  AVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMV
        AVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLK+QQEFAHLVQ GLTVE+YNREFN+LKRFAPSMV
Subjt:  AVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMV

Query:  DTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRN
        DTEEKMTEKFVLGL PRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRREE VIIGQK PHESGGS RPPP  RHRSNNR APRWDER PPRRTDRNPRN
Subjt:  DTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRN

Query:  QDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAF
        QDGAR RREEGCTICGRLH GRCMAGSRACYRCGQEGHIAVNCTAGNAAAQAN PRVVEQTDQPAPPRAQARAY STS+DTGRSDAVVTGTLSILGHF F
Subjt:  QDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAF

Query:  TLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPI
        TLFDS STHSFISMPFVVQAGFELEPLLHEMSVSTP GVDLVSR RVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCH+K+VKFSPP 
Subjt:  TLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPI

Query:  GPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELK
        GP FKFKGT+TG+TPKVVSMMKAK+LVQQ GWAIL+C VDVRGKE TLVNVPIVNEFPDVF DDLPGIPPSRAV+FVIEL+ GTGPISKAPYRMAPAELK
Subjt:  GPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELK

Query:  ELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRY
        ELKAQLQDLLDKGFIRPSVSP GAPVLFVKKKDGSMRLCIDYRELNK +IKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRY
Subjt:  ELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRY

Query:  GHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTK
        GHYEFVVMSFGLTN+ AVFMELMNRVFKECLDMF+IVFIDDILIYS+T +EHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG+VVSKD ISVDPTK
Subjt:  GHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTK

Query:  VEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKL
        VEAITKWERPTTV E+RSFLG    Y    ++F +I   L K+
Subjt:  VEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKL

TrEMBL top hitse value%identityAlignment
A0A5A7SPM4 Reverse transcriptase0.0e+0060.42Show/hide
Query:  AQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNK
        A  QAQA A  QAQA     P    P PV+ S     +RDF++Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   +
Subjt:  AQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNK

Query:  TLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRT
         +L  +   + WE+FKE F  +++  + +  K QEF +L Q  +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR 
Subjt:  TLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRT

Query:  AKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA-------HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAG
        A  L  P+     K       +GQKR  E+     P           RHR     A                    G   R    CT CGR+HGGRC+AG
Subjt:  AKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA-------HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAG

Query:  SRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRA-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELE
        S  C+RC Q GH A  C           PR   +T  P P  A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+E
Subjt:  SRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPRA-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELE

Query:  PLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKK
        PL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA K
Subjt:  PLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKK

Query:  LVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAP
        L+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG+PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAP
Subjt:  LVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAP

Query:  VLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNR
        VLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+ ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNR
Subjt:  VLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNR

Query:  VFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGY
        VFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +KLYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGY
Subjt:  VFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGY

Query:  YRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLEL
        YRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++LVTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLEL
Subjt:  YRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLEL

Query:  AAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATE
        AAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQRRWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+  
Subjt:  AAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATE

Query:  GVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
         V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

A0A5A7TSL0 Reverse transcriptase0.0e+0058.93Show/hide
Query:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK
        PEA P    VD N  +     QA +  +    Q  LQ A     A  Q QA  +    A  QAQA A  QAQA A   P    P PV+ S     +RDF+
Subjt:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK

Query:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD
        +Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   + +L  +   + WE+FKE F  +++  + +  K QEF +L Q 
Subjt:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD

Query:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---
         +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR A  L  P+     K       +GQKR  E+     P      
Subjt:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---

Query:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR
             RHR     A                    G   R    CT CGR+HGGRC+AGS  C+RC Q GH A  C           PR   +T  P P  
Subjt:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR

Query:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN
        A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+EPL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++
Subjt:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN

Query:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG
        M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA KL+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG
Subjt:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG

Query:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR
        +PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+
Subjt:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR

Query:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK
         ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +K
Subjt:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK

Query:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL
        LYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGYYRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++L
Subjt:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL

Query:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR
        VTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD
        RWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+   V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +
Subjt:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD

Query:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
        GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

A0A5A7UBH7 Reverse transcriptase0.0e+0058.93Show/hide
Query:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK
        PEA P    VD N  +     QA +  +    Q  LQ A     A  Q QA  +    A  QAQA A  QAQA A   P    P PV+ S     +RDF+
Subjt:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK

Query:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD
        +Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   + +L  +   + WE+FKE F  +++  + +  K QEF +L Q 
Subjt:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD

Query:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---
         +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR A  L  P+     K       +GQKR  E+     P      
Subjt:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---

Query:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR
             RHR     A                    G   R    CT CGR+HGGRC+AGS  C+RC Q GH A  C           PR   +T  P P  
Subjt:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR

Query:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN
        A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+EPL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++
Subjt:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN

Query:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG
        M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA KL+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG
Subjt:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG

Query:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR
        +PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+
Subjt:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR

Query:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK
         ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +K
Subjt:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK

Query:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL
        LYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGYYRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++L
Subjt:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL

Query:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR
        VTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD
        RWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+   V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +
Subjt:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD

Query:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
        GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

A0A5A7UBS1 Reverse transcriptase0.0e+0058.93Show/hide
Query:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK
        PEA P    VD N  +     QA +  +    Q  LQ A     A  Q QA  +    A  QAQA A  QAQA A   P    P PV+ S     +RDF+
Subjt:  PEAAPPMGGVDANTRLMCDSIQALIQTLVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCS---NWIRDFK

Query:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD
        +Y+P+ F GS+ +P +AQMW+ ++ET F  M+CPE+QKV CA F L+     WW   + +L  +   + WE+FKE F  +++  + +  K QEF +L Q 
Subjt:  RYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQD

Query:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---
         +TVE+Y+ EF+ L RFAP MV  E   TEKFV GL   ++ ++ A  P T+ +ALR A  L  P+     K       +GQKR  E+     P      
Subjt:  GLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPKTYEEALRTAKALEKPK---DEKRREERVIIGQKRPHESGGSGRPPPA---

Query:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR
             RHR     A                    G   R    CT CGR+HGGRC+AGS  C+RC Q GH A  C           PR   +T  P P  
Subjt:  ----HRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGHIAVNCTAGNAAAQANLPRVVEQTDQPAPPR

Query:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN
        A Q R +A+T ++  R+  VVTGTL ILGH+AF LFDSGS+HSFIS  FV   G E+EPL   +SVSTP+G  L+S++++K  +V I N+ L V L+V++
Subjt:  A-QARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVN

Query:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG
        M DFDVILGMDWL+ N A+IDC+ KEV F+PP   +FKF+G      PKV+S MKA KL+ QG W ILA  VDVR  E +L + P+V E+PDVFPD+LPG
Subjt:  MTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPG

Query:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR
        +PP R VDF IELEPGT PIS+APYRMAPAELKELK QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK+T+KN+YPLPRI+DLFDQL+
Subjt:  IPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLR

Query:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK
         ATVFSKIDLRSGYHQ+RI + D+PKTAFR+RYGHYEFVVMSFGLTNAPAVFM+LMNRVFKE LD FVIVFIDDILIYS+T+ EHEEHL +VL TLR +K
Subjt:  EATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHK

Query:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL
        LYAKFSKCEFWLR+V+FLGHVVS +G+SVDP K+EA+T W RP+TV+EIRSFLGLAGYYRRFV+DF++I+SPLT+LT+KG PF W  ACE SFQ LK++L
Subjt:  LYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERL

Query:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR
        VTAPVL VP+ S  +VIYSDAS KGLGCVLMQ GKVVAYASRQLK +E+NYPTHDLELAAVVFALKIWRHYLYGEK QI+TDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD
        RWLELVKDYD +I YHPGKANVVADALSRK  HS+ALIT++  + R+FERA IAV+   V AQLA+LTVQPTLRQ+II +Q +DP L +    ++    +
Subjt:  RWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVD

Query:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG
        GFS SSD+GL+++GRLCVP    ++ E+L EAH+SPF MHPG
Subjt:  GFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPG

A0A6J1EYH9 Reverse transcriptase0.0e+0068.57Show/hide
Query:  AQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQ
        A   A A+A           A A A     A A PPVN P P R +NW++DF+RY+PRPF GS  DP  AQMWIA +ETTFE+M CP+  KV CAT+VLQ
Subjt:  AQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVACATFVLQ

Query:  KDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAF
        KDAE+WW DNK  +NP GG   WE FKEAFLK YYPK  R+KKQQEF  L Q   TV++Y+++F RL+RFAPS+ DTEEK TEKFVLGL P+ RRMLEAF
Subjt:  KDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAF

Query:  NPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMA
        NPKTYEEALRTAKALE+P +EK+ E  V IG+KRP E       PP+ R R  +RP        PP      P  +  A  +    C  CG+ H GRC+A
Subjt:  NPKTYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMA

Query:  GSRACYRCGQEGHIAVNC---TAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGF
        GS  CY CG  GH+A  C   + G        P + E T Q +P   Q +AY +TS + G S  VVTGTLSILGHFA TLFDSGSTHSF++ PF+ QAGF
Subjt:  GSRACYRCGQEGHIAVNC---TAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGF

Query:  ELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMK
         +EPL+H +SV TPAGVDLV++DRV+DGQV+I  QT+ VDL VV+MTDFDVILGMDWLAEN A+IDCH+KEV F+PP G TFKFKGT+TG TPK++SMMK
Subjt:  ELEPLLHEMSVSTPAGVDLVSRDRVKDGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMK

Query:  AKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPW
        A++L+QQGGWA LA AV+ +GKE+ +  +P+VNEF DVFP+DLPGIPPSR VDF I+LE GTGPISKAPYRMAPAELKELK QLQDLLD           
Subjt:  AKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPW

Query:  GAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL
                 KD SMRLCI YRELNK T+KNKYPLPRIEDLFDQLR ATVFSKIDLRSGYHQI+I  +D+PKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL
Subjt:  GAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMEL

Query:  MNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGL
        MNRVFKECLD+FVIVFIDDILIYS+TD++H+EHLRK LT LRE+KLYA F+KCEFW+ QVSFLGH+VSKDGI VDP K+EA+TK +RPTTVTEIRSFLGL
Subjt:  MNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGL

Query:  AGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHD
         GYYRRFV DFA+I++PLT+LTKKGVPF WDD CE SFQ LK+RLV+APVL VPESS GY IYSDAS KGLGCVLMQHGKVVAYAS QLK+YEKNYPTHD
Subjt:  AGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHD

Query:  LELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAV
        LELAAVVFALKIWRHY YGEKTQI+TDHKSLKY FTQKELNMRQRRWLELVKDYD+DIQYHPGKANVVADALSRK VHSSALITRE R + +FE+A+I V
Subjt:  LELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAV

Query:  ATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGG
         T+ V AQLAR+TV+PTLRQRII SQ+EDP+L K+L QL+  PVDGF+KS+D+GLL QGRLCVP +  ++ EIL EAHNS F++HPGG
Subjt:  ATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHPGG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.3e-9039.09Show/hide
Query:  YRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRI
        Y    A  +E+++Q+QD+L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++T+ +++P+P ++++  +L     F+ IDL  G+HQI +
Subjt:  YRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRI

Query:  NEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG
        + + V KTAF T++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + L  V   L +  L  +  KCEF  ++ +FLG
Subjt:  NEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG

Query:  HVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPF-RWDDACEASFQNLKERLVTAPVLIVPESSEGYVIY
        HV++ DGI  +P K+EAI K+  PT   EI++FLGL GYYR+F+ +FA I+ P+TK  KK +     +   +++F+ LK  +   P+L VP+ ++ + + 
Subjt:  HVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPF-RWDDACEASFQNLKERLVTAPVLIVPESSEGYVIY

Query:  SDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPG
        +DAS   LG VL Q G  ++Y SR L E+E NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D DI+Y  G
Subjt:  SDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPG

Query:  KANVVADALSRKTVHSSALITREVRVQREFERANIAVATE
        K N VADALSR  +  + L + + +   E + +++   TE
Subjt:  KANVVADALSRKTVHSSALITREVRVQREFERANIAVATE

P20825 Retrovirus-related Pol polyprotein from transposon 2972.0e-8737.2Show/hide
Query:  VDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKD-----
        +D   +EET     ++N+F ++   +   +  +  +  V+     + PI    Y +A     E++ Q+Q++L++G IR S SP+ +P   V KK      
Subjt:  VDVRGKEETLVNVPIVNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKD-----

Query:  GSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDM
           R+ IDYR+LN++TI ++YP+P ++++  +L +   F+ IDL  G+HQI ++E+ + KTAF T+ GHYE++ M FGL NAPA F   MN + +  L+ 
Subjt:  GSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDM

Query:  FVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDF
          +V++DDI+I+S +  EH   ++ V T L +  L  +  KCEF  ++ +FLGH+V+ DGI  +P KV+AI  +  PT   EIR+FLGL GYYR+F+ ++
Subjt:  FVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDF

Query:  AKISSPLTKLTKKGVPFRWD--DACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFA
        A I+ P+T   KK         +  EA F+ LK  ++  P+L +P+  + +V+ +DAS   LG VL Q+G  +++ SR L ++E NY   + EL A+V+A
Subjt:  AKISSPLTKLTKKGVPFRWD--DACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFA

Query:  LKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSR
         K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y   I Y  GK N VADALSR
Subjt:  LKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.6e-8134.9Show/hide
Query:  EETLVNVPI--VNEFPDVFPDDLPGIPP---SRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC
        ++T   +P+    ++ ++  +DLP  P    +  V   IE++PG       PY +     +E+   +Q LLD  FI PS SP  +PV+ V KKDG+ RLC
Subjt:  EETLVNVPI--VNEFPDVFPDDLPGIPP---SRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFI
        +DYR LNK TI + +PLPRI++L  ++  A +F+ +DL SGYHQI +  KD  KTAF T  G YE+ VM FGL NAP+ F   M   F++    FV V++
Subjt:  IDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFI

Query:  DDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSP
        DDILI+S +  EH +HL  VL  L+   L  K  KC+F   +  FLG+ +    I+    K  AI  +  P TV + + FLG+  YYRRF+ + +KI+ P
Subjt:  DDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSP

Query:  LTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKNYPTHDLELAAVVFALK
        +          +W +  + + + LK  L  +PVL+   +   Y + +DAS  G+G VL +         VV Y S+ L+  +KNYP  +LEL  ++ AL 
Subjt:  LTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKNYPTHDLELAAVVFALK

Query:  IWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLAR
         +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  ++Y  G  NVVADA+SR     +   +R +  +              V+  +  
Subjt:  IWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLAR

Query:  LTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVP
        LT      + +   +    + QK L +L E+    +S   DE + YQ RL VP
Subjt:  LTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.0e-8834.57Show/hide
Query:  IGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPI
        +GN +     ++ N+  FD I+G D L + +A +D     +  +P I               K+  + +A   V       L  A    G +E L +  +
Subjt:  IGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPI

Query:  VNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKM
        + EFP +F   L G+    AV   I       PI    Y        E++ Q+ +LL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN +
Subjt:  VNEFPDVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKM

Query:  TIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRT
        TI + YP+P I      L  A  F+ +DL SG+HQI + E D+PKTAF T  G YEF+ + FGL NAPA+F  +++ + +E +     V+IDDI+++S  
Subjt:  TIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRT

Query:  DIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTK---
           H ++LR VL +L +  L     K  F   QV FLG++V+ DGI  DP KV AI++   PT+V E++ FLG+  YYR+F+QD+AK++ PLT LT+   
Subjt:  DIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTK---

Query:  --------KGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQ----HGKVVAYASRQLKEYEKNYPTHDLELAAVVFALK
                  VP   D+    SF +LK  L ++ +L  P  ++ + + +DAS   +G VL Q      + +AY SR L + E+NY T + E+ A++++L 
Subjt:  --------KGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQ----HGKVVAYASRQLKEYEKNYPTHDLELAAVVFALK

Query:  IWRHYLYGEKT-QIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSR
          R YLYG  T +++TDH+ L +    +  N + +RW   +++Y+ ++ Y PGK+NVVADALSR
Subjt:  IWRHYLYGEKT-QIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.1e-8134.9Show/hide
Query:  EETLVNVPI--VNEFPDVFPDDLPGIPP---SRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC
        ++T   +P+    ++ ++  +DLP  P    +  V   IE++PG       PY +     +E+   +Q LLD  FI PS SP  +PV+ V KKDG+ RLC
Subjt:  EETLVNVPI--VNEFPDVFPDDLPGIPP---SRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFI
        +DYR LNK TI + +PLPRI++L  ++  A +F+ +DL SGYHQI +  KD  KTAF T  G YE+ VM FGL NAP+ F   M   F++    FV V++
Subjt:  IDYRELNKMTIKNKYPLPRIEDLFDQLREATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFI

Query:  DDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSP
        DDILI+S +  EH +HL  VL  L+   L  K  KC+F   +  FLG+ +    I+    K  AI  +  P TV + + FLG+  YYRRF+ + +KI+ P
Subjt:  DDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFWLRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSP

Query:  LTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKNYPTHDLELAAVVFALK
        +          +W +  + +   LK+ L  +PVL+   +   Y + +DAS  G+G VL +         VV Y S+ L+  +KNYP  +LEL  ++ AL 
Subjt:  LTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDASMKGLGCVLMQHGK------VVAYASRQLKEYEKNYPTHDLELAAVVFALK

Query:  IWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLAR
         +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  ++Y  G  NVVADA+SR     +   +R +  +              V+  +  
Subjt:  IWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKTVHSSALITREVRVQREFERANIAVATEGVIAQLAR

Query:  LTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVP
        LT      + +   +    + QK L +L E+    +S   DE + YQ RL VP
Subjt:  LTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVP

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.9e-2644Show/hide
Query:  HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRW
        HL  VL    +H+ YA   KC F   Q+++LG  H++S +G+S DP K+EA+  W  P   TE+R FLGL GYYRRFV+++ KI  PLT+L KK    +W
Subjt:  HLRKVLTTLREHKLYAKFSKCEFWLRQVSFLG--HVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRW

Query:  DDACEASFQNLKERLVTAPVLIVPE
         +    +F+ LK  + T PVL +P+
Subjt:  DDACEASFQNLKERLVTAPVLIVPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGAGGACGAGGAAGAGGAAGAGGAAGAGGAAGACGAGGAAGGGGGCGACCCCGTCATAGAGAAGTAGGACGAGGTGTGGCAGTAGGGGGCCCACCTCAAGAAAG
ATCAGAGACTGGGCCGGTAACAGTTACGCCGCCCCCAGTCTTACCGCTAGTTGAGCTACCTCCTCAAGCTCCACTACAAGCTGAGCCTCCACCACCAGTAGAAGCCCCAC
CCCAGCCTACCCTACCACTCCAAGTCGCACCTGAAGCTGCACCCCCAATGGGTGGAGTAGATGCTAATACTAGGTTAATGTGTGACTCGATCCAAGCCCTAATTCAAACC
TTAGTAACCAATCAACAAGCCAGTCTACAACAGGCCCAGGAACAAGCCCAAGCACAGGCCCAGGCACAGGCGTTGGCACTGGCACAAGTACAGGCACAGGTACAAGCCCA
GGCACAGGCACAGGCACAGGCACAGGCCACTGCCCAGGCCAATCCCCCAGTAAACCCTCCACAACCTGTTAGATGTTCAAACTGGATAAGGGATTTCAAGAGATACGACC
CTCGACCATTCACTGGTTCCCTAGTAGACCCCATAGAAGCCCAAATGTGGATTGCAGCGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAAGGTCGCC
TGTGCAACCTTCGTCCTACAGAAGGACGCAGAGATATGGTGGAGAGATAACAAGACCCTTCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTT
CCTTAAAGAATATTATCCTAAGTCAGAGCGACTTAAAAAGCAGCAGGAGTTCGCCCACCTGGTACAGGACGGACTCACAGTGGAGAGGTACAACCGAGAGTTCAATAGAC
TCAAGAGATTCGCACCGTCCATGGTGGACACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGTACCAAGAATCCGCCGCATGTTGGAGGCGTTCAACCCAAAG
ACCTATGAGGAAGCCTTGAGAACTGCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGGCGTGAAGAGCGAGTTATAATTGGGCAGAAGCGTCCTCATGAATCAGGAGG
CTCTGGCCGTCCACCACCAGCACATAGGCACCGTTCCAATAACAGACCCGCTCCTAGATGGGATGAGCGACGCCCTCCCCGACGTACTGACAGGAACCCCAGGAATCAGG
ATGGGGCCAGAAGGAGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACACGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGGTGTGGCCAAGAGGGGCAC
ATCGCCGTGAACTGTACGGCCGGAAATGCCGCAGCACAAGCAAACCTGCCCAGAGTAGTAGAGCAAACGGATCAACCAGCACCACCGCGAGCTCAAGCTAGGGCCTACGC
GTCAACCAGCAGGGACACTGGGAGGTCTGACGCCGTGGTGACAGGTACACTGTCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTTA
TTTCCATGCCTTTTGTTGTACAAGCGGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAG
GATGGCCAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACGTCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGC
TAGTATAGACTGTCACCAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGGTTACCCCCAAGGTAGTCTCGATGATGA
AAGCAAAGAAGTTGGTCCAACAAGGTGGATGGGCTATATTAGCATGTGCTGTAGACGTGAGAGGAAAGGAAGAGACCCTAGTAAACGTGCCAATAGTAAACGAGTTTCCG
GATGTATTTCCGGATGACTTACCTGGAATACCCCCTTCCCGAGCGGTTGACTTCGTCATCGAACTCGAGCCGGGAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGC
ACCAGCTGAGTTGAAAGAACTCAAGGCGCAACTGCAAGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGA
AAGATGGATCGATGCGTCTGTGCATCGATTACAGAGAGCTAAACAAGATGACCATAAAAAACAAATATCCTCTACCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAA
GCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACGGCATTTAGGACAAGGTACGGTCACTACGAGTT
TGTAGTGATGTCATTTGGCCTCACCAATGCCCCAGCGGTATTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACA
TCCTCATATACTCGAGAACCGACATAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGG
TTACGACAAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAATGGGAACGCCCAACTACGGTAACGGA
AATAAGGAGTTTCCTAGGATTGGCGGGATATTATCGAAGGTTCGTACAGGACTTCGCTAAAATATCCTCGCCTTTAACAAAGTTAACAAAAAAAGGGGTGCCATTTAGAT
GGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCGCCCCGGTACTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCC
TCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTT
GGCCGCTGTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTGTATGGCGAGAAAACTCAAATTTTTACCGACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGT
TAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCCAGTACCACCCTGGGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACG
GTCCACTCGTCGGCCCTCATTACAAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCATAGCACAGTTGGCCCGACTCAC
TGTACAACCTACACTTAGGCAGAGAATTATTGTCTCCCAACAAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCCAGTAGATGGATTCTCGAAGT
CATCAGATGAAGGACTATTGTATCAGGGACGCTTATGTGTTCCAGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCA
GGAGGTAAAGGCAAGGCGCCCATGTACGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAGAGGACGAGGAAGAGGAAGAGGAAGAGGAAGACGAGGAAGGGGGCGACCCCGTCATAGAGAAGTAGGACGAGGTGTGGCAGTAGGGGGCCCACCTCAAGAAAG
ATCAGAGACTGGGCCGGTAACAGTTACGCCGCCCCCAGTCTTACCGCTAGTTGAGCTACCTCCTCAAGCTCCACTACAAGCTGAGCCTCCACCACCAGTAGAAGCCCCAC
CCCAGCCTACCCTACCACTCCAAGTCGCACCTGAAGCTGCACCCCCAATGGGTGGAGTAGATGCTAATACTAGGTTAATGTGTGACTCGATCCAAGCCCTAATTCAAACC
TTAGTAACCAATCAACAAGCCAGTCTACAACAGGCCCAGGAACAAGCCCAAGCACAGGCCCAGGCACAGGCGTTGGCACTGGCACAAGTACAGGCACAGGTACAAGCCCA
GGCACAGGCACAGGCACAGGCACAGGCCACTGCCCAGGCCAATCCCCCAGTAAACCCTCCACAACCTGTTAGATGTTCAAACTGGATAAGGGATTTCAAGAGATACGACC
CTCGACCATTCACTGGTTCCCTAGTAGACCCCATAGAAGCCCAAATGTGGATTGCAGCGGTGGAAACCACCTTTGAGACTATGGAGTGCCCAGAGAACCAAAAGGTCGCC
TGTGCAACCTTCGTCCTACAGAAGGACGCAGAGATATGGTGGAGAGATAACAAGACCCTTCTTAACCCAGAAGGGGGACCAATGAACTGGGAACGATTTAAGGAGGCCTT
CCTTAAAGAATATTATCCTAAGTCAGAGCGACTTAAAAAGCAGCAGGAGTTCGCCCACCTGGTACAGGACGGACTCACAGTGGAGAGGTACAACCGAGAGTTCAATAGAC
TCAAGAGATTCGCACCGTCCATGGTGGACACTGAGGAAAAGATGACAGAGAAATTTGTATTGGGTTTGGTACCAAGAATCCGCCGCATGTTGGAGGCGTTCAACCCAAAG
ACCTATGAGGAAGCCTTGAGAACTGCCAAGGCTTTAGAGAAACCAAAGGATGAGAAACGGCGTGAAGAGCGAGTTATAATTGGGCAGAAGCGTCCTCATGAATCAGGAGG
CTCTGGCCGTCCACCACCAGCACATAGGCACCGTTCCAATAACAGACCCGCTCCTAGATGGGATGAGCGACGCCCTCCCCGACGTACTGACAGGAACCCCAGGAATCAGG
ATGGGGCCAGAAGGAGGAGAGAGGAAGGGTGCACTATCTGTGGAAGACTACACGGTGGGAGGTGCATGGCTGGCAGCCGAGCGTGTTATAGGTGTGGCCAAGAGGGGCAC
ATCGCCGTGAACTGTACGGCCGGAAATGCCGCAGCACAAGCAAACCTGCCCAGAGTAGTAGAGCAAACGGATCAACCAGCACCACCGCGAGCTCAAGCTAGGGCCTACGC
GTCAACCAGCAGGGACACTGGGAGGTCTGACGCCGTGGTGACAGGTACACTGTCCATTTTAGGTCATTTCGCTTTTACCTTATTTGATTCTGGTTCCACGCATTCCTTTA
TTTCCATGCCTTTTGTTGTACAAGCGGGGTTCGAATTAGAACCCTTATTGCATGAAATGTCTGTAAGCACCCCTGCGGGGGTAGACTTAGTATCTAGGGATAGAGTAAAG
GATGGCCAAGTAATCATAGGGAACCAAACTTTAAGCGTTGACCTGATGGTGGTAAACATGACAGATTTCGACGTCATACTAGGCATGGATTGGTTAGCTGAAAATCGAGC
TAGTATAGACTGTCACCAAAAGGAAGTAAAATTTTCACCACCGATAGGACCTACCTTTAAATTTAAAGGCACAAATACCGGGGTTACCCCCAAGGTAGTCTCGATGATGA
AAGCAAAGAAGTTGGTCCAACAAGGTGGATGGGCTATATTAGCATGTGCTGTAGACGTGAGAGGAAAGGAAGAGACCCTAGTAAACGTGCCAATAGTAAACGAGTTTCCG
GATGTATTTCCGGATGACTTACCTGGAATACCCCCTTCCCGAGCGGTTGACTTCGTCATCGAACTCGAGCCGGGAACTGGGCCTATTTCCAAAGCACCCTATCGCATGGC
ACCAGCTGAGTTGAAAGAACTCAAGGCGCAACTGCAAGACTTACTAGATAAAGGATTCATTCGACCTAGCGTGTCCCCCTGGGGTGCGCCAGTGTTGTTTGTTAAGAAGA
AAGATGGATCGATGCGTCTGTGCATCGATTACAGAGAGCTAAACAAGATGACCATAAAAAACAAATATCCTCTACCTAGAATAGAAGACTTGTTTGATCAACTCAGAGAA
GCAACAGTATTCTCTAAGATAGATCTTCGGTCCGGTTACCACCAAATTAGGATTAATGAAAAAGACGTACCAAAAACGGCATTTAGGACAAGGTACGGTCACTACGAGTT
TGTAGTGATGTCATTTGGCCTCACCAATGCCCCAGCGGTATTTATGGAGTTAATGAACCGGGTATTCAAAGAATGCCTAGACATGTTTGTGATTGTGTTCATTGACGACA
TCCTCATATACTCGAGAACCGACATAGAGCACGAGGAACACCTCCGAAAAGTCCTAACCACCCTAAGAGAGCACAAGTTGTACGCCAAGTTCTCCAAATGCGAATTTTGG
TTACGACAAGTCTCTTTCCTAGGACACGTGGTGTCAAAGGACGGAATATCTGTAGATCCCACCAAGGTCGAAGCGATCACAAAATGGGAACGCCCAACTACGGTAACGGA
AATAAGGAGTTTCCTAGGATTGGCGGGATATTATCGAAGGTTCGTACAGGACTTCGCTAAAATATCCTCGCCTTTAACAAAGTTAACAAAAAAAGGGGTGCCATTTAGAT
GGGATGATGCTTGTGAGGCAAGCTTCCAGAACCTAAAAGAGAGATTGGTAACCGCCCCGGTACTCATAGTACCCGAGAGCTCAGAAGGATATGTGATCTATAGTGATGCC
TCCATGAAAGGACTGGGATGTGTGTTAATGCAACACGGCAAGGTTGTCGCATACGCATCTCGTCAACTTAAAGAATATGAAAAGAACTACCCTACCCATGACCTAGAGTT
GGCCGCTGTAGTGTTCGCGCTGAAAATCTGGCGACATTACCTGTATGGCGAGAAAACTCAAATTTTTACCGACCACAAAAGTTTGAAATACTTCTTCACCCAGAAAGAGT
TAAACATGAGGCAGAGAAGGTGGTTAGAATTGGTGAAGGATTATGACGTAGATATCCAGTACCACCCTGGGAAAGCAAATGTGGTTGCAGATGCCTTGAGTAGGAAGACG
GTCCACTCGTCGGCCCTCATTACAAGGGAAGTAAGGGTACAAAGAGAGTTCGAGCGAGCCAACATAGCTGTAGCGACCGAGGGAGTCATAGCACAGTTGGCCCGACTCAC
TGTACAACCTACACTTAGGCAGAGAATTATTGTCTCCCAACAAGAGGATCCTAACCTACAGAAAGTCCTAGGACAGCTAGACGAAAGTCCAGTAGATGGATTCTCGAAGT
CATCAGATGAAGGACTATTGTATCAGGGACGCTTATGTGTTCCAGCAATAGAAGATTTAAGGAAGGAAATACTGATGGAAGCTCACAACTCACCATTTGCGATGCATCCA
GGAGGTAAAGGCAAGGCGCCCATGTACGGTTGA
Protein sequenceShow/hide protein sequence
MPRGRGRGRGRGRRGRGRPRHREVGRGVAVGGPPQERSETGPVTVTPPPVLPLVELPPQAPLQAEPPPPVEAPPQPTLPLQVAPEAAPPMGGVDANTRLMCDSIQALIQT
LVTNQQASLQQAQEQAQAQAQAQALALAQVQAQVQAQAQAQAQAQATAQANPPVNPPQPVRCSNWIRDFKRYDPRPFTGSLVDPIEAQMWIAAVETTFETMECPENQKVA
CATFVLQKDAEIWWRDNKTLLNPEGGPMNWERFKEAFLKEYYPKSERLKKQQEFAHLVQDGLTVERYNREFNRLKRFAPSMVDTEEKMTEKFVLGLVPRIRRMLEAFNPK
TYEEALRTAKALEKPKDEKRREERVIIGQKRPHESGGSGRPPPAHRHRSNNRPAPRWDERRPPRRTDRNPRNQDGARRRREEGCTICGRLHGGRCMAGSRACYRCGQEGH
IAVNCTAGNAAAQANLPRVVEQTDQPAPPRAQARAYASTSRDTGRSDAVVTGTLSILGHFAFTLFDSGSTHSFISMPFVVQAGFELEPLLHEMSVSTPAGVDLVSRDRVK
DGQVIIGNQTLSVDLMVVNMTDFDVILGMDWLAENRASIDCHQKEVKFSPPIGPTFKFKGTNTGVTPKVVSMMKAKKLVQQGGWAILACAVDVRGKEETLVNVPIVNEFP
DVFPDDLPGIPPSRAVDFVIELEPGTGPISKAPYRMAPAELKELKAQLQDLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKMTIKNKYPLPRIEDLFDQLRE
ATVFSKIDLRSGYHQIRINEKDVPKTAFRTRYGHYEFVVMSFGLTNAPAVFMELMNRVFKECLDMFVIVFIDDILIYSRTDIEHEEHLRKVLTTLREHKLYAKFSKCEFW
LRQVSFLGHVVSKDGISVDPTKVEAITKWERPTTVTEIRSFLGLAGYYRRFVQDFAKISSPLTKLTKKGVPFRWDDACEASFQNLKERLVTAPVLIVPESSEGYVIYSDA
SMKGLGCVLMQHGKVVAYASRQLKEYEKNYPTHDLELAAVVFALKIWRHYLYGEKTQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDVDIQYHPGKANVVADALSRKT
VHSSALITREVRVQREFERANIAVATEGVIAQLARLTVQPTLRQRIIVSQQEDPNLQKVLGQLDESPVDGFSKSSDEGLLYQGRLCVPAIEDLRKEILMEAHNSPFAMHP
GGKGKAPMYG