; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035756 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035756
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:29710872..29720575
RNA-Seq ExpressionLag0035756
SyntenyLag0035756
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY13707.1 putative copia-type protein, partial [Trifolium pratense]1.3e-20745.52Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WV+ FLTAV+LINR+P   L+ ++P+ KL+ + P YS +RV GC+CFP L+     NKFS +TYPC+F+GYS  HKGYRCLDP   R+YISRHVV
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        FDE  FPF+    +        LE+++F  +   D  F  +     S ++SSQ                    ++TLE  KD   +Q             
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL
         P + NT +    +   G  A    N            E P +  +     G          D+ +T +++    P+            Q P+       
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL

Query:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNI--RRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEE
                        VE  L+ T T +T E           + +  P   P   ++  ++     KQ+         + PK  K AL  P W  AM+EE
Subjt:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNI--RRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEE

Query:  MKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLK
        + AL  N+TWELVPRP++ N+VGSKW+++ K+KE+G +DR+KARLVA+ +TQ+ G+D++ET+SPVVK TTIR +++++ S +W +RQLDVKNAFLHG +K
Subjt:  MKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLK

Query:  EQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFA
        E V+MEQPPGF++    NHVC L+KS+YGL QAPRAWFDRL+  LLH GF CS +D SLFILK   V  + LIYVDDI++ GNN   I  L+  L +EFA
Subjt:  EQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFA

Query:  LKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPT
        +KDLG LHYFLG+E      G+ L+Q KY  DLL K+ M G  +I TP     +   +D    DA  +RSIVG+LQYLT TRPDI  AVN+ CQ    PT
Subjt:  LKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPT

Query:  VKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRD
        + D KAVKRILRY++GT ++G+    ++  +LYGF DADW GC +TRR+TT +C+FLG+NCISWSSKKQPTVARSS+EAEYR+M   TAELTWI +LL+D
Subjt:  VKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRD

Query:  IGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR
        I + L   PQL+CDNISAL+MS+N VFHARTKHIE+DYHFVREKVA+G L+T+Y P+  Q+ D+FTKPL K  F   R KLGV  +   SLR
Subjt:  IGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-21046.12Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        F+E CFP++  +    + ++    VS      ++    ++  H  +S   ++ N     ++           N+ ++E    A  N  +      +    
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL-----S
         P    T+++         VA +++++ T     +  TE PT  ++  S   +    T+      E+T     ++        K+   DL FP       
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL-----S

Query:  HTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQ
        H    L   + S      + D SK + V+I   P G  T N+ T   HM+TR KL  +PSL               Q  ++ A   + + PK Y+ AL  
Subjt:  HTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQ

Query:  PHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDV
        PHW +AM+EE+KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQLDV
Subjt:  PHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDV

Query:  KNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQ
        KNAFLHG LKE+V+MEQPPGFI+  L NHVCKL +S+YGL QAPRAWFDRL+N                                    + GN+ + I  
Subjt:  KNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQ

Query:  LIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVN
        LI  LS EF+LKDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV AVN
Subjt:  LIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVN

Query:  KVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAE
        K CQ    PT  D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ AE
Subjt:  KVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAE

Query:  LTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFS
        +TW+ FLLRDIGI L   PQL CDN+SAL+M++N VFHAR+KHIE+DYHFVREKVA G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV      S
Subjt:  LTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFS

Query:  LR
        LR
Subjt:  LR

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.8e-21647.36Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
          +  F       N  N  VTT   +                                        DVA+G ++          HN              
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL
                                           S+T   T    ++S                      IA+   + +A D+                
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL

Query:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQPHWKRAMEEEM
               +PD SK + V+I   P G  T N+ T   HM+TR KL  +PSL               Q  ++ A   + + PK Y+  L  PHW +AM+EE+
Subjt:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQPHWKRAMEEEM

Query:  KALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKE
        KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQLDVKNAFLHG LKE
Subjt:  KALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKE

Query:  QVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL
        +V+MEQPPGFI+  LSNHVCKL +S+YGL QAPRAWFDRL+  LLH+GF C  +D SLFIL+    +++ LIYVDDII+TGN+ + I  LI  LS EF+L
Subjt:  QVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL

Query:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTV
        KDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV AVNK CQ    PT 
Subjt:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTV

Query:  KDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRDI
         D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ AE+TW+ FLLRDI
Subjt:  KDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRDI

Query:  GIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR
        GI L   PQL CDN+SAL+M +N VFHAR+KHIE+DYHFVREKVA G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV      SLR
Subjt:  GIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR

RVX04530.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.2e-22548.01Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        F+E CFP++  +    + ++      + V    SD +    +   ++ +     A +    K                     T  Q  + +++     +
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNY--TEMDFN-LSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL---
            +NT+ +  +      V  TE +    ++ D N +  TE PT  ++  S   +    T+      E+T     ++        K+   DL FP    
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNY--TEMDFN-LSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL---

Query:  --SHTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVAL
           H    L   + S      + D SK + V+I   P    T N+ T   HM+TR KL  +PSL  ++      IR      S ++E     PK Y+  L
Subjt:  --SHTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVAL

Query:  NQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQL
          PHW + M+EE+KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQL
Subjt:  NQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQL

Query:  DVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHI
        DVKNAFLHG LKE+V+MEQPPGFI+  L NHVCKL +S+YGL QAPRAWFDRL+  LLH+GF C  +D SLFIL+    +++ LIYVDDII+TGN+ + I
Subjt:  DVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHI

Query:  QQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA
          LI  LS EF+LKDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV A
Subjt:  QQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA

Query:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT
        VNK CQ    PT  D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ 
Subjt:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT

Query:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT
        AE+TW+ FLLRDIGI L   PQL CDN+SAL+M++N VFHAR+KHIE+DYHFVREK A G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV     
Subjt:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT

Query:  FSLR
         SLR
Subjt:  FSLR

RWR75576.1 Zinc finger, CCCH-type [Cinnamomum micranthum f. kanehirae]2.7e-19744.47Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL++WVDAF TAV+LINR  ++ ++  SP+Q+L  K P Y ++RVFGC+CFPYL++   NNKF+ R+ PC+F+GYS   KGYRC  P   RIY SRHVV
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEK--HISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQ
        FDE+ FPF+    + + TN+       S++V                 S++TSS +        +I    AV   +     +  A H     +   ++P 
Subjt:  FDEYCFPFEK--HISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQ

Query:  IKNPVEENTQMNHQSDIDCGDVAETEDN------NYTEMDFNLSNTEDPTS----PSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSD
        +     + +    Q+   C    ++  +       +T     LS  + P++    P++  S   +      +     E T  +         + DK   D
Subjt:  IKNPVEENTQMNHQSDIDCGDVAETEDN------NYTEMDFNLSNTEDPTS----PSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSD

Query:  LQFPLSHTTSPLNAINLSSLP-DISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALN
        +  P+ H  +  + +N S+LP D S   H    +S    P  N     H M+TR K  +   + P       N R     ++  AE +   PK+ K AL 
Subjt:  LQFPLSHTTSPLNAINLSSLP-DISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALN

Query:  QPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLD
           W  AM+EE+ AL +N TW LVPR S+ N+VG KW++KTK + +G ++R KARLVA+ + Q+EG+D+ ET+SPVVK  TIR++L+IA + +W +RQLD
Subjt:  QPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLD

Query:  VKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQ
        VKNAFL+G+L E V+MEQPP F H    +HVC L+K++YGL QAPRAWFDR +  LL IGF CS +D SLFI       +  L+YVDDIILTGNN   + 
Subjt:  VKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQ

Query:  QLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA
         ++  L +EFA+KDLG +HYFLGI+V+    G+ L+Q KYA +LL K+ M     I TPM    Q     +A   D + Y+S+VG L YLT TRPDI  +
Subjt:  QLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA

Query:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT
        VN VCQ + NPT   ++ VKRILRYV+GTI YGI L  +SS  LY F DADW GC LTRR+TT +C +LGSNCISWSSK+QPTVARSS+EAEYRA+ S  
Subjt:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT

Query:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT
        AE+TW+ ++LRDIG+ L+  P L+CDNISAL+M++N VFHARTKHIE+DYHFVREKVALG L+T+++PS  Q+ DI TKPL++  F+ LR KLGV+  + 
Subjt:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT

Query:  FSLR
         SL+
Subjt:  FSLR

TrEMBL top hitse value%identityAlignment
A0A2N9EWB3 Integrase catalytic domain-containing protein2.1e-20845.42Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +P   W++AF+TAV+LINR+P+  L++D+PF KL+   P Y++++VFGCRCFPYL++    NKF  ++YPCIF+GYS  HKGYRCL P   R+Y+SRHVV
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDE----YCFPFEKHISNGTNQRVTTLE--VSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLD
        FDE    Y  P     S  T+  ++T    V+ F+   + ++  T  +         + +   +                    P  +++H         
Subjt:  FDE----YCFPFEKHISNGTNQRVTTLE--VSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLD

Query:  DIPQIKNPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVN----ATSSERDYAETTEENIASRPDTITAQDKNYSDLQ
         IP   N      Q +                       +  +T   + PS  +S    E N    A S   D        +AS P      D + S L 
Subjt:  DIPQIKNPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVN----ATSSERDYAETTEENIASRPDTITAQDKNYSDLQ

Query:  FPLSHTTSPLNAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPH
           S T  P      S LP  + Y+ + +   P      +++T+ H MVTR K                      Q H  +     T PK+ K AL   H
Subjt:  FPLSHTTSPLNAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPH

Query:  WKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKN
        W++AM +E+ AL +N+TW LVPR +D NIVGS+W+FKTK K +G ++R+KARLVA+ Y Q+EGLD+ ET+SPV+K TTIRL+LS+A +  W LRQLDVKN
Subjt:  WKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKN

Query:  AFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLI
        AFLHG+LKE VYMEQPPGF       HVC L K+IYGL QAPRAWFDR ++ LL IGF CS +D SLF+ +     ++ L+YVDDII+T ++ SH+  LI
Subjt:  AFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLI

Query:  HILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQP-TDAKYYRSIVGSLQYLTLTRPDIVQAVNK
          LS EFA+KDLG L+YFLG++V     G+ LSQ KYA+++LAK++M+    I TP+A     L  +  P  +A  YRSIVG+LQYLTLTRPD+  AVN 
Subjt:  HILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQP-TDAKYYRSIVGSLQYLTLTRPDIVQAVNK

Query:  VCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAEL
        VCQ +  P+   ++AVKRILRY+QGT+DYGI L  HSS  LYGF DADW GC  TRR+TT +CI+LG+NCISW+SKKQ TV+RSS+EAEYRAM SA AEL
Subjt:  VCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAEL

Query:  TWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSL
        TW+ +LLRD+G+  +++P L+CDN SAL+M++N VFHARTKHIE+DYHFVREKVA G L T+Y+PS+ Q+ D+FTK ++K VF   R KLGV      SL
Subjt:  TWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSL

Query:  RNDRLAL
        R    A+
Subjt:  RNDRLAL

A0A2N9I9N7 CCHC-type domain-containing protein4.1e-21246.15Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +P   W++AF+TAV+LINR+P+  +++ +PF KL+   P Y++++VFGCRCFPYL++    NKF  ++YPCIF+GYS  HKGYRCL P   R+Y+SRHVV
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPF----EKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTK--------DATHNQI
        FDE  FP+        S  TN + +T        +  S  N       +   ATSS +A    +   I        + + + P            T N I
Subjt:  FDEYCFPF----EKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTK--------DATHNQI

Query:  EELNLDDIPQIKNPVEENTQMNHQSDI----DCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDK
            L+  P+       +T M     +     C  +A       T   F    T  P +P+            +  E  + +T +  I   P + T    
Subjt:  EELNLDDIPQIKNPVEENTQMNHQSDI----DCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDK

Query:  NYSDLQFPLSHTTSPLNAINLSSLPDISK--YLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQH--HSYVAEHLNTGPK
            L  P   + +P+  ++ S +P  S+    ++++P++P    T  + T+ H M+TR                    R+ ++H  H  +       PK
Subjt:  NYSDLQFPLSHTTSPLNAINLSSLPDISK--YLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQH--HSYVAEHLNTGPK

Query:  NYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSAS
        + K AL  PHW +AM +E+ AL +N TW LVPR SD NIVGS+W+FKTK K +G ++R+KARLVA+ Y Q+EGLD+ ET+SPVVK TTIRL+LS+AT+  
Subjt:  NYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSAS

Query:  WPLRQLDVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTG
        WPLRQLDVKNAFLHG+LKE VYMEQPPGF  SS   HVC+L K+IYGL QAPRAWFDR ++ LLH+GF CS +D SLF+ +    +++ L+YVDDII+T 
Subjt:  WPLRQLDVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTG

Query:  NNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTR
        N++S +  LI  LS EF++KDLG LHYFLGI+V     GI LSQ KYAR++LAK++M+    I TP+A   +         DA  YRSIVG+LQYLTLTR
Subjt:  NNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTR

Query:  PDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYR
        PD+  AVN VCQ +  P    ++AVKRILRY+QGT+ YGI +  HSS  LYGF DADW GC  TRR+TT +CI+LG+NCISW+SKKQ TV+RSS+EAEYR
Subjt:  PDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYR

Query:  AMTSATAELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLG
        AM SA AELTW+ +LL D+GI L   P L+CDN SAL+M++N VFHARTKHIE+D+HFVREKVA G L T+Y+PS+ Q+ D+FTK ++K VF   R KLG
Subjt:  AMTSATAELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLG

Query:  VRSTTTFSLR
        V      SLR
Subjt:  VRSTTTFSLR

A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE11.3e-21046.12Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        F+E CFP++  +    + ++    VS      ++    ++  H  +S   ++ N     ++           N+ ++E    A  N  +      +    
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL-----S
         P    T+++         VA +++++ T     +  TE PT  ++  S   +    T+      E+T     ++        K+   DL FP       
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL-----S

Query:  HTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQ
        H    L   + S      + D SK + V+I   P G  T N+ T   HM+TR KL  +PSL               Q  ++ A   + + PK Y+ AL  
Subjt:  HTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQ

Query:  PHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDV
        PHW +AM+EE+KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQLDV
Subjt:  PHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDV

Query:  KNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQ
        KNAFLHG LKE+V+MEQPPGFI+  L NHVCKL +S+YGL QAPRAWFDRL+N                                    + GN+ + I  
Subjt:  KNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQ

Query:  LIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVN
        LI  LS EF+LKDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV AVN
Subjt:  LIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVN

Query:  KVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAE
        K CQ    PT  D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ AE
Subjt:  KVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAE

Query:  LTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFS
        +TW+ FLLRDIGI L   PQL CDN+SAL+M++N VFHAR+KHIE+DYHFVREKVA G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV      S
Subjt:  LTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFS

Query:  LR
        LR
Subjt:  LR

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-21647.36Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
          +  F       N  N  VTT   +                                        DVA+G ++          HN              
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL
                                           S+T   T    ++S                      IA+   + +A D+                
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL

Query:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQPHWKRAMEEEM
               +PD SK + V+I   P G  T N+ T   HM+TR KL  +PSL               Q  ++ A   + + PK Y+  L  PHW +AM+EE+
Subjt:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLN-TGPKNYKVALNQPHWKRAMEEEM

Query:  KALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKE
        KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQLDVKNAFLHG LKE
Subjt:  KALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKE

Query:  QVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL
        +V+MEQPPGFI+  LSNHVCKL +S+YGL QAPRAWFDRL+  LLH+GF C  +D SLFIL+    +++ LIYVDDII+TGN+ + I  LI  LS EF+L
Subjt:  QVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL

Query:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTV
        KDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV AVNK CQ    PT 
Subjt:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTV

Query:  KDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRDI
         D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ AE+TW+ FLLRDI
Subjt:  KDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLRDI

Query:  GIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR
        GI L   PQL CDN+SAL+M +N VFHAR+KHIE+DYHFVREKVA G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV      SLR
Subjt:  GIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTTFSLR

A0A438J6E1 Retrovirus-related Pol polyprotein from transposon RE12.5e-22548.01Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +PL  WVDAFLTAV+LINR+P+  L ++SPF  L+ + P Y ++R+FGC+CFPYL++    NKFS +TYPC+F+GYS  HKGYRCL P+  R+YISRHV+
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        F+E CFP++  +    + ++      + V    SD +    +   ++ +     A +    K                     T  Q  + +++     +
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNY--TEMDFN-LSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL---
            +NT+ +  +      V  TE +    ++ D N +  TE PT  ++  S   +    T+      E+T     ++        K+   DL FP    
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNY--TEMDFN-LSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKN-YSDLQFPL---

Query:  --SHTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVAL
           H    L   + S      + D SK + V+I   P    T N+ T   HM+TR KL  +PSL  ++      IR      S ++E     PK Y+  L
Subjt:  --SHTTSPLNAINLS-----SLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVAL

Query:  NQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQL
          PHW + M+EE+KAL +N+TW+LVPRP   NIVGSKW+FKTK KE+G +DRYKARLVA+ ++QI GLD+ ET+SPV+K TTIR+I S+A +  W +RQL
Subjt:  NQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQL

Query:  DVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHI
        DVKNAFLHG LKE+V+MEQPPGFI+  L NHVCKL +S+YGL QAPRAWFDRL+  LLH+GF C  +D SLFIL+    +++ LIYVDDII+TGN+ + I
Subjt:  DVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHI

Query:  QQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA
          LI  LS EF+LKDLGSLHYFLG+EVK    G+ +SQ KY RDLL  + M   + INTPMA  +     D QP D   YR +VGSLQYLT TRPDIV A
Subjt:  QQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQA

Query:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT
        VNK CQ    PT  D +AVKRILRY++GT+++GI  +K SS  L GFCDADW GC  TRR+T+ +CIFLG+NCISWSSK+QPTV+RSS+EAEYR++ S+ 
Subjt:  VNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSAT

Query:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT
        AE+TW+ FLLRDIGI L   PQL CDN+SAL+M++N VFHAR+KHIE+DYHFVREK A G+LIT+++PS  Q+ DIFTK L K  F+  R KLGV     
Subjt:  AELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRSTTT

Query:  FSLR
         SLR
Subjt:  FSLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-10530.14Show/hide
Query:  YWVDAFLTAVFLINRMPTKAL--SLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVVFD
        +W +A LTA +LINR+P++AL  S  +P++  ++K+P   ++RVFG   + ++KN     KF  +++  IFVGY  E  G++  D  N +  ++R VV D
Subjt:  YWVDAFLTAVFLINRMPTKAL--SLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVVFD

Query:  EYCFPFEKHISNGTNQRVTTLEVSDFVGIQESD-RNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIKN
        E         +N  N R    E       +ES+ +NF               ++++++Q ++                         E    D+I  +K 
Subjt:  EYCFPFEKHISNGTNQRVTTLEVSDFVGIQESD-RNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIKN

Query:  PVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPLN
                         D  E+E+ N+      +  TE P           NE    S E D  +  +++  S    +    K   D     S  +   N
Subjt:  PVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPLN

Query:  AINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMKA
            S   +  K + ++ P    G    N  + +  + T+ +++ N        +E N++ +   +   +   +       +   ++  W+ A+  E+ A
Subjt:  AINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMKA

Query:  LAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKEQV
           N TW +  RP + NIV S+W+F  KY E G   RYKARLVA+ +TQ   +DYEET++PV + ++ R ILS+    +  + Q+DVK AFL+G LKE++
Subjt:  LAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKEQV

Query:  YMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYV--LMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL
        YM  P G   S  S++VCKL K+IYGL QA R WF+     L    F  S+ D  ++IL    +   +  L+YVDD+++   + + +      L ++F +
Subjt:  YMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYV--LMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL

Query:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTL-TRPDIVQAVNKVCQQLLNPT
         DL  + +F+GI ++     I LSQ  Y + +L+K NM   + ++TP+ +       ++        RS++G L Y+ L TRPD+  AVN + +      
Subjt:  KDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYYRSIVGSLQYLTL-TRPDIVQAVNKVCQQLLNPT

Query:  VKDYKAVKRILRYVQGTIDYGITLYKHSS--TNLYGFCDADWGGCQLTRRNTTRFCI-FLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFL
         + ++ +KR+LRY++GTID  +   K+ +    + G+ D+DW G ++ R++TT +       N I W++K+Q +VA SS+EAEY A+  A  E  W+ FL
Subjt:  VKDYKAVKRILRYVQGTIDYGITLYKHSS--TNLYGFCDADWGGCQLTRRNTTRFCI-FLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFL

Query:  LRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV
        L  I I L N  ++Y DN   + ++ N   H R KHI++ YHF RE+V   ++  +YIP++ QL DIFTKPL  A F  LRDKLG+
Subjt:  LRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-9229.53Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +P  +W +A  TA +LINR P+  L+ + P +   +K+ +YS+++VFGCR F ++       K   ++ PCIF+GY  E  GYR  DP   ++  SR VV
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        F E     E   +   +++V    + +FV I                                     +   N T+ E T D    Q E+          
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL
                         G+V E  +    ++D  +   E PT        +G E +      +          S    + + D+    L+  LSH     
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL

Query:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMK
                                  P  N+                                                             +AM+EEM+
Subjt:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMK

Query:  ALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKEQ
        +L +N T++LV  P     +  KW+FK K   +  + RYKARLV + + Q +G+D++E +SPVVK T+IR ILS+A S    + QLDVK AFLHG+L+E+
Subjt:  ALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKEQ

Query:  VYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILK-DKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL
        +YMEQP GF  +   + VCKL KS+YGL QAPR W+ +  + +    +  + SDP ++  +  +   +I L+YVDD+++ G +   I +L   LSK F +
Subjt:  VYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILK-DKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFAL

Query:  KDLGSLHYFLGIEV--KSTHKGITLSQGKYARDLLAKSNMSGASTINTPMA----TSTQELPTDAQPTD--AKY-YRSIVGSLQY-LTLTRPDIVQAVNK
        KDLG     LG+++  + T + + LSQ KY   +L + NM  A  ++TP+A     S +  PT  +     AK  Y S VGSL Y +  TRPDI  AV  
Subjt:  KDLGSLHYFLGIEV--KSTHKGITLSQGKYARDLLAKSNMSGASTINTPMA----TSTQELPTDAQPTD--AKY-YRSIVGSLQY-LTLTRPDIVQAVNK

Query:  VCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAEL
        V + L NP  + ++AVK ILRY++GT       +  S   L G+ DAD  G    R+++T +        ISW SK Q  VA S++EAEY A T    E+
Subjt:  VCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAEL

Query:  TWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRS
         W+   L+++G+       +YCD+ SA+ +S N ++HARTKHI++ YH++RE V    L    I + +   D+ TK + +  F+  ++ +G+ S
Subjt:  TWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGVRS

P92519 Uncharacterized mitochondrial protein AtMg008101.7e-6152Show/hide
Query:  MIRLIYVDDIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYY
        M  L+YVDDI+LTG++ + +  LI  LS  F++KDLG +HYFLGI++K+   G+ LSQ KYA  +L  + M     ++TP+        + A+  D   +
Subjt:  MIRLIYVDDIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYY

Query:  RSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKK
        RSIVG+LQYLTLTRPDI  AVN VCQ++  PT+ D+  +KR+LRYV+GTI +G+ ++K+S  N+  FCD+DW GC  TRR+TT FC FLG N ISWS+K+
Subjt:  RSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKK

Query:  QPTVARSSSEAEYRAMTSATAELTW
        QPTV+RSS+E EYRA+    AELTW
Subjt:  QPTVARSSSEAEYRAMTSATAELTW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-16238.67Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +P  YW  AF  AV+LINR+PT  L L+SPFQKL+   P Y  +RVFGC C+P+L+   N +K   ++  C+F+GYSL    Y CL    +R+YISRHV 
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDAT---HNQIEELNLDDIP
        FDE CFPF  +++                 +QE  R         ES    S +     +   +        +     P+  +    ++Q+   NLD   
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDAT---HNQIEELNLDDIP

Query:  QIKNPVE-ENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHT
            P   E T              +T+   ++  + + +N   PT+ S    ++     A SS    + TT  +                      S +
Subjt:  QIKNPVE-ENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHT

Query:  TSPLNAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAME
        TSP       + P I   +H   PL+           + H M TR K  +              I+   ++   V+    + P+    AL    W+ AM 
Subjt:  TSPLNAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAME

Query:  EEMKALAENQTWELV-PRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHG
         E+ A   N TW+LV P PS   IVG +WIF  KY  +G ++RYKARLVA+ Y Q  GLDY ET+SPV+K T+IR++L +A   SWP+RQLDV NAFL G
Subjt:  EEMKALAENQTWELV-PRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHG

Query:  NLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSK
         L + VYM QPPGFI     N+VCKL+K++YGL QAPRAW+  L N+LL IGF  S SD SLF+L+    ++  L+YVDDI++TGN+ + +   +  LS+
Subjt:  NLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSK

Query:  EFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQL
         F++KD   LHYFLGIE K    G+ LSQ +Y  DLLA++NM  A  + TPMA S +  L +  + TD   YR IVGSLQYL  TRPDI  AVN++ Q +
Subjt:  EFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQL

Query:  LNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGF
          PT +  +A+KRILRY+ GT ++GI L K ++ +L+ + DADW G +    +T  + ++LG + ISWSSKKQ  V RSS+EAEYR++ + ++E+ WI  
Subjt:  LNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGF

Query:  LLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV
        LL ++GI L   P +YCDN+ A Y+  N VFH+R KHI +DYHF+R +V  G L   ++ +  QL D  TKPL++  F+    K+GV
Subjt:  LLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-16038.69Show/hide
Query:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV
        +P  YW  AF  AV+LINR+PT  L L SPFQKL+ + P Y  ++VFGC C+P+L+   N +K   ++  C F+GYSL    Y CL     R+Y SRHV 
Subjt:  MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVV

Query:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK
        FDE CFPF     + TN  V+T +       Q SD       H      T+      +L      G            P+   T  Q+   NL       
Subjt:  FDEYCFPFEKHISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIK

Query:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL
            E T  +H          +T+++N    +  + N  +P SPS    ++ + +  +     +  T   +I S P++             P S +TS  
Subjt:  NPVEENTQMNHQSDIDCGDVAETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPL

Query:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAE-HLNTGPKNYKVALNQPHWKRAMEEEM
               LP +       +P  P          + H M TR K               + IR+  Q +SY      N+ P+    A+    W++AM  E+
Subjt:  NAINLSSLPDISKYLHVEIPLSPTGTPTTNEATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAE-HLNTGPKNYKVALNQPHWKRAMEEEM

Query:  KALAENQTWELV-PRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLK
         A   N TW+LV P P    IVG +WIF  K+  +G ++RYKARLVA+ Y Q  GLDY ET+SPV+K T+IR++L +A   SWP+RQLDV NAFL G L 
Subjt:  KALAENQTWELV-PRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLK

Query:  EQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFA
        ++VYM QPPGF+     ++VC+L+K+IYGL QAPRAW+  L  +LL +GF  S SD SLF+L+    ++  L+YVDDI++TGN+   ++  +  LS+ F+
Subjt:  EQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFA

Query:  LKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNP
        +K+   LHYFLGIE K   +G+ LSQ +Y  DLLA++NM  A  + TPMATS +  L +  +  D   YR IVGSLQYL  TRPD+  AVN++ Q +  P
Subjt:  LKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQ-ELPTDAQPTDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNP

Query:  TVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLR
        T   + A+KR+LRY+ GT D+GI L K ++ +L+ + DADW G      +T  + ++LG + ISWSSKKQ  V RSS+EAEYR++ + ++EL WI  LL 
Subjt:  TVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARSSSEAEYRAMTSATAELTWIGFLLR

Query:  DIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV
        ++GI L + P +YCDN+ A Y+  N VFH+R KHI +DYHF+R +V  G L   ++ +  QL D  TKPL++  F+    K+GV
Subjt:  DIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-10643.07Show/hide
Query:  PKNYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATS
        P  Y  A     W  AM++E+ A+    TWE+   P +   +G KW++K KY  +G ++RYKARLVA+ YTQ EG+D+ ET+SPV K T+++LIL+I+  
Subjt:  PKNYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATS

Query:  ASWPLRQLDVKNAFLHGNLKEQVYMEQPPGFI----HSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVD
         ++ L QLD+ NAFL+G+L E++YM+ PPG+      S   N VC L+KSIYGL QA R WF + +  L+  GF  S+SD + F+     + +  L+YVD
Subjt:  ASWPLRQLDVKNAFLHGNLKEQVYMEQPPGFI----HSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTCSNSDPSLFILKDKYVLMIRLIYVD

Query:  DIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATS-TQELPTDAQPTDAKYYRSIVGSL
        DII+  NN + + +L   L   F L+DLG L YFLG+E+  +  GI + Q KYA DLL ++ + G    + PM  S T    +     DAK YR ++G L
Subjt:  DIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATS-TQELPTDAQPTDAKYYRSIVGSL

Query:  QYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARS
         YL +TR DI  AVNK+ Q    P +   +AV +IL Y++GT+  G+     +   L  F DA +  C+ TRR+T  +C+FLG++ ISW SKKQ  V++S
Subjt:  QYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTVARS

Query:  SSEAEYRAMTSATAELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREK
        S+EAEYRA++ AT E+ W+    R++ +PL     L+CDN +A++++ N VFH RTKHIE D H VRE+
Subjt:  SSEAEYRAMTSATAELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREK

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.4e-1041.03Show/hide
Query:  YLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFC
        YLT+TRPD+  AVN++ Q          +AV ++L YV+GT+  G+     S   L  F D+DW  C  TRR+ T FC
Subjt:  YLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-6252Show/hide
Query:  MIRLIYVDDIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYY
        M  L+YVDDI+LTG++ + +  LI  LS  F++KDLG +HYFLGI++K+   G+ LSQ KYA  +L  + M     ++TP+        + A+  D   +
Subjt:  MIRLIYVDDIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQPTDAKYY

Query:  RSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKK
        RSIVG+LQYLTLTRPDI  AVN VCQ++  PT+ D+  +KR+LRYV+GTI +G+ ++K+S  N+  FCD+DW GC  TRR+TT FC FLG N ISWS+K+
Subjt:  RSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKK

Query:  QPTVARSSSEAEYRAMTSATAELTW
        QPTV+RSS+E EYRA+    AELTW
Subjt:  QPTVARSSSEAEYRAMTSATAELTW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.8e-2444.6Show/hide
Query:  MVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVD
        M+TR K  +N  L+P+ S  I                +   PK+   AL  P W +AM+EE+ AL+ N+TW LVP P + NI+G KW+FKTK   +G +D
Subjt:  MVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVD

Query:  RYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIA
        R KARLVA+ + Q EG+ + ETYSPVV+  TIR IL++A
Subjt:  RYKARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTCAAATATTGGGTAGATGCATTTCTCACCGCTGTTTTTCTTATAAATCGAATGCCCACAAAGGCATTGAGCTTAGACTCACCATTTCAAAAACTTTATAGCAA
ACAACCAACCTATTCTAATATTAGAGTGTTTGGGTGTAGATGTTTTCCTTACCTTAAGAATCTCCCCAACAACAACAAATTCTCCAAAAGAACCTACCCTTGCATATTTG
TCGGCTATAGCTTAGAACACAAAGGATATCGTTGTCTTGATCCAACAAACAATCGCATTTATATCTCTAGACATGTCGTTTTTGATGAATATTGTTTTCCTTTTGAAAAA
CACATCTCTAACGGCACCAACCAGCGTGTCACTACTCTTGAAGTTTCTGATTTTGTAGGTATTCAAGAGAGTGATCGAAACTTCACAAGGAGGCAGCATATTGAAGAGTC
TGATGCAACCAGTAGTCAAAATGCTCAAGAAATGTTGCAGCAAAAATATATTGGTGGTGATGTAGCTGTAGGTGGAAATGAAACCACGCTTGAGCCCACTAAAGATGCTA
CACACAATCAAATAGAGGAGTTAAATCTTGACGATATCCCTCAAATAAAAAATCCAGTGGAGGAAAACACACAAATGAACCACCAAAGCGACATTGATTGTGGTGATGTA
GCTGAAACTGAAGACAACAATTACACAGAAATGGACTTCAACTTGTCAAATACTGAAGATCCCACTTCTCCAAGTGTTGAAATGTCATCCAAAGGAAATGAAGTAAACGC
CACTAGTTCTGAGAGAGACTATGCTGAGACCACAGAAGAAAATATTGCAAGCAGACCAGACACTATCACCGCTCAAGACAAAAATTATTCAGATTTGCAATTCCCTCTCT
CACACACTACTTCTCCATTGAATGCAATTAATCTCTCATCTCTTCCAGACATTAGTAAATACTTACATGTTGAAATACCTTTGTCTCCTACAGGAACACCAACCACAAAT
GAAGCAACAAGCAAACATCACATGGTCACTAGGCATAAATTAGCACTAAATCCCTCTCTTGACCCAAGACTCTCTCAAGAAATCAACAACATAAGGAGAGGAAAACAACA
CCACTCCTATGTCGCTGAACATCTTAACACCGGGCCCAAAAATTATAAAGTAGCCTTAAATCAACCACACTGGAAAAGAGCAATGGAAGAAGAAATGAAAGCTCTGGCAG
AAAATCAAACATGGGAATTGGTACCTAGACCATCTGATTGCAATATTGTTGGGTCTAAGTGGATATTCAAGACCAAGTACAAAGAAAATGGCATCGTGGATCGCTATAAA
GCTCGTTTAGTGGCCCAACGATACACTCAAATAGAAGGGCTGGATTATGAGGAGACATACAGCCCTGTAGTAAAACGTACTACAATCAGGTTAATTCTTTCCATTGCCAC
TAGTGCAAGCTGGCCATTGAGGCAACTAGATGTCAAAAATGCTTTTCTACATGGGAATTTGAAGGAACAAGTTTATATGGAACAACCACCAGGTTTTATCCACTCATCTC
TATCTAACCATGTGTGCAAACTTCAAAAATCTATATATGGTTTGATACAAGCTCCACGAGCCTGGTTTGATAGACTTGCTAATCATCTTTTACATATTGGTTTCACATGC
AGTAATTCTGACCCTTCTTTATTTATATTAAAAGACAAGTATGTTTTAATGATTAGGCTAATATACGTTGATGATATCATACTTACAGGAAACAATGCCTCTCACATCCA
ACAACTCATACACATCTTAAGTAAGGAGTTTGCTCTAAAGGACTTGGGCTCTTTACACTACTTCTTGGGGATAGAAGTAAAATCAACACACAAGGGCATCACTTTATCTC
AAGGGAAATATGCAAGAGATCTCCTTGCAAAATCTAATATGTCAGGAGCTTCAACTATTAACACACCCATGGCCACTTCAACTCAAGAATTACCAACTGATGCTCAACCC
ACAGATGCCAAATATTACAGAAGTATTGTTGGCTCTTTACAATACCTTACCTTAACTCGACCCGACATCGTTCAAGCTGTGAATAAAGTCTGCCAACAACTTTTGAACCC
CACCGTCAAGGACTATAAAGCTGTGAAGAGGATACTCCGCTATGTCCAGGGAACAATTGACTATGGTATAACCCTTTATAAACACAGTTCTACTAATTTATATGGTTTCT
GTGATGCAGATTGGGGTGGATGTCAACTCACAAGGCGCAACACCACTAGATTTTGTATTTTCCTTGGATCAAATTGCATTTCTTGGTCATCAAAGAAGCAACCCACAGTT
GCTCGCTCCAGCTCTGAAGCAGAATATCGAGCCATGACCAGTGCCACAGCTGAATTAACATGGATCGGTTTCCTTCTTCGTGACATTGGCATCCCTCTTTACAACACTCC
ACAACTTTATTGTGACAACATAAGTGCCCTCTACATGTCCATAAATCTAGTCTTCCATGCTAGAACGAAGCACATTGAAATGGACTACCATTTCGTTCGAGAAAAGGTAG
CCCTTGGAATGCTTATCACCAAATACATTCCCTCCAAACAACAACTAGTCGATATATTCACAAAGCCCCTCACAAAGGCTGTCTTTAAGGGACTACGAGACAAACTTGGC
GTTCGTTCTACCACTACCTTCAGTTTGAGAAATGATAGGCTTGCTTTGTCGCGCCTCACGTCAGATGTTGATGTCGATGTCGATGTCGATGTTGAGTTCCAACTGTCGAT
GTTGATGGGGAAAGACGTGGCCTGCAAGACAGTAAACCTGCACACTGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAACGGAAGGAGGAGAGCAAGAG
AGCAAGGGAAGAGAGTAGAGAATAAAGTTCGAGATCCCTTCTTCAGTGGCGAAGAGGGGTTTAAATACTTGCTCATGCTCCTAGGATTTTTAGGAATTCAGAGGCGTTTC
GGGATGAACCAGGCAGAATCGGGGCGGCTAGAGGCAGCAGGGACTGAACGGAGACGAACGGGCTCTGTAACGCCCCTTGCCGCAAAACTCACCCCCCTCAGCCGTTCTCG
GGCCGCCGCCCTCTGCAGTTGCCGCTGCCTGTCGTGTAGTCACGCCGTCGCCGGTTCGTGCAGCGCCGCCGTCGCCGTACTCGTCCGCATATCTCTCTCACTCTGCGCGT
CTCTCTCTCTCTCTCTCGGTAGATCTCCCTCTCTCCCGTCGGCCTCTCTCTCCCTCCGTCGCAGCTCGAGTGTCGCCGCCACTACCCTTGCCGCCGTCGCTCAGATCGTG
CCGCCGCCAGCCCCGGTCTCTCTGCTTCTCCGCGTGTTCTTCGTCGTGGGTGTTAGATCCGTGGCTCTCTCTGTCTGTCTTTTCGTTTTTAGCCAAAGACATCTGTGGAT
CTCGCGTGTCAAGCGATTCGGAGCCCTGTCGTTCCCTTTTCAGTCGATTCCGCCTCTGTCCAGCAACGTCTCGACTGTTGTTGGTGTCGTTTGGCGTTTTCGGCTCGACC
TCGGCTTGAGCCGAGGCCGAACCATCATGCTCGGCCTCGACCCAAGGCCGATGGCCCGACCCTTTGGTTCGGTCTTCCTTTGGAAATTTCAACTACAATCGATTTTAAAG
GCTCACAAGTTGTTTGGATTTATCGATGGCTGCATTGATTGCCCTCTACAAACGATTCCACCATCCAACATATCATTGATGGAGACCTCAACCGAAGCACCTCCTGCTCC
AATATCTTCTCAGATTAATCCACTATATGAGGATTTGGTTGCCAAGGGTCAAGCTCTAATGACACTGATCAATGCCACTTTGTCGCTGGAAGCCTTAACCTATGTTGTTG
GTTGTGCATCATGCAAACAAGCTTGGGAGGTCATTGAAAAGCATTACTCTTCGAGCTCAAGTACCAACATTTTCAATCTGAAGTTAGATCTTCAATCAATAACCAAGAAG
CCAAAAGCAACCGCCCGTCGCCGAAGAATTTTTCGACTATCGTCGCCACTACCAACCTCCGTCGACCGACCGCCACCGTCGACCTGTAACTACCGGCTGCCTTTGACCTC
CACTAATCGGCCGTCGTCGCCAACCTTAGACTATCGGCCACGTCGACCTCCACCGACCGACCTCTACCGTCGACCTCTGACTACCGACCTTCGCCAGCTAACTGCCACCG
TCGACCTAGAACAACTGTCAACCTCCGACTACCGGACGCTGTCGACCTTCACCAATCGCCGTCGACCCCCGCTGACTAGCCTCTGCCATCTGACCGCTACCGTCGACCTC
AGAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTCAAATATTGGGTAGATGCATTTCTCACCGCTGTTTTTCTTATAAATCGAATGCCCACAAAGGCATTGAGCTTAGACTCACCATTTCAAAAACTTTATAGCAA
ACAACCAACCTATTCTAATATTAGAGTGTTTGGGTGTAGATGTTTTCCTTACCTTAAGAATCTCCCCAACAACAACAAATTCTCCAAAAGAACCTACCCTTGCATATTTG
TCGGCTATAGCTTAGAACACAAAGGATATCGTTGTCTTGATCCAACAAACAATCGCATTTATATCTCTAGACATGTCGTTTTTGATGAATATTGTTTTCCTTTTGAAAAA
CACATCTCTAACGGCACCAACCAGCGTGTCACTACTCTTGAAGTTTCTGATTTTGTAGGTATTCAAGAGAGTGATCGAAACTTCACAAGGAGGCAGCATATTGAAGAGTC
TGATGCAACCAGTAGTCAAAATGCTCAAGAAATGTTGCAGCAAAAATATATTGGTGGTGATGTAGCTGTAGGTGGAAATGAAACCACGCTTGAGCCCACTAAAGATGCTA
CACACAATCAAATAGAGGAGTTAAATCTTGACGATATCCCTCAAATAAAAAATCCAGTGGAGGAAAACACACAAATGAACCACCAAAGCGACATTGATTGTGGTGATGTA
GCTGAAACTGAAGACAACAATTACACAGAAATGGACTTCAACTTGTCAAATACTGAAGATCCCACTTCTCCAAGTGTTGAAATGTCATCCAAAGGAAATGAAGTAAACGC
CACTAGTTCTGAGAGAGACTATGCTGAGACCACAGAAGAAAATATTGCAAGCAGACCAGACACTATCACCGCTCAAGACAAAAATTATTCAGATTTGCAATTCCCTCTCT
CACACACTACTTCTCCATTGAATGCAATTAATCTCTCATCTCTTCCAGACATTAGTAAATACTTACATGTTGAAATACCTTTGTCTCCTACAGGAACACCAACCACAAAT
GAAGCAACAAGCAAACATCACATGGTCACTAGGCATAAATTAGCACTAAATCCCTCTCTTGACCCAAGACTCTCTCAAGAAATCAACAACATAAGGAGAGGAAAACAACA
CCACTCCTATGTCGCTGAACATCTTAACACCGGGCCCAAAAATTATAAAGTAGCCTTAAATCAACCACACTGGAAAAGAGCAATGGAAGAAGAAATGAAAGCTCTGGCAG
AAAATCAAACATGGGAATTGGTACCTAGACCATCTGATTGCAATATTGTTGGGTCTAAGTGGATATTCAAGACCAAGTACAAAGAAAATGGCATCGTGGATCGCTATAAA
GCTCGTTTAGTGGCCCAACGATACACTCAAATAGAAGGGCTGGATTATGAGGAGACATACAGCCCTGTAGTAAAACGTACTACAATCAGGTTAATTCTTTCCATTGCCAC
TAGTGCAAGCTGGCCATTGAGGCAACTAGATGTCAAAAATGCTTTTCTACATGGGAATTTGAAGGAACAAGTTTATATGGAACAACCACCAGGTTTTATCCACTCATCTC
TATCTAACCATGTGTGCAAACTTCAAAAATCTATATATGGTTTGATACAAGCTCCACGAGCCTGGTTTGATAGACTTGCTAATCATCTTTTACATATTGGTTTCACATGC
AGTAATTCTGACCCTTCTTTATTTATATTAAAAGACAAGTATGTTTTAATGATTAGGCTAATATACGTTGATGATATCATACTTACAGGAAACAATGCCTCTCACATCCA
ACAACTCATACACATCTTAAGTAAGGAGTTTGCTCTAAAGGACTTGGGCTCTTTACACTACTTCTTGGGGATAGAAGTAAAATCAACACACAAGGGCATCACTTTATCTC
AAGGGAAATATGCAAGAGATCTCCTTGCAAAATCTAATATGTCAGGAGCTTCAACTATTAACACACCCATGGCCACTTCAACTCAAGAATTACCAACTGATGCTCAACCC
ACAGATGCCAAATATTACAGAAGTATTGTTGGCTCTTTACAATACCTTACCTTAACTCGACCCGACATCGTTCAAGCTGTGAATAAAGTCTGCCAACAACTTTTGAACCC
CACCGTCAAGGACTATAAAGCTGTGAAGAGGATACTCCGCTATGTCCAGGGAACAATTGACTATGGTATAACCCTTTATAAACACAGTTCTACTAATTTATATGGTTTCT
GTGATGCAGATTGGGGTGGATGTCAACTCACAAGGCGCAACACCACTAGATTTTGTATTTTCCTTGGATCAAATTGCATTTCTTGGTCATCAAAGAAGCAACCCACAGTT
GCTCGCTCCAGCTCTGAAGCAGAATATCGAGCCATGACCAGTGCCACAGCTGAATTAACATGGATCGGTTTCCTTCTTCGTGACATTGGCATCCCTCTTTACAACACTCC
ACAACTTTATTGTGACAACATAAGTGCCCTCTACATGTCCATAAATCTAGTCTTCCATGCTAGAACGAAGCACATTGAAATGGACTACCATTTCGTTCGAGAAAAGGTAG
CCCTTGGAATGCTTATCACCAAATACATTCCCTCCAAACAACAACTAGTCGATATATTCACAAAGCCCCTCACAAAGGCTGTCTTTAAGGGACTACGAGACAAACTTGGC
GTTCGTTCTACCACTACCTTCAGTTTGAGAAATGATAGGCTTGCTTTGTCGCGCCTCACGTCAGATGTTGATGTCGATGTCGATGTCGATGTTGAGTTCCAACTGTCGAT
GTTGATGGGGAAAGACGTGGCCTGCAAGACAGTAAACCTGCACACTGGTGTGGTGCTTGCCACACCGCCTCCGATGCTTAAGTCAGAAAACGGAAGGAGGAGAGCAAGAG
AGCAAGGGAAGAGAGTAGAGAATAAAGTTCGAGATCCCTTCTTCAGTGGCGAAGAGGGGTTTAAATACTTGCTCATGCTCCTAGGATTTTTAGGAATTCAGAGGCGTTTC
GGGATGAACCAGGCAGAATCGGGGCGGCTAGAGGCAGCAGGGACTGAACGGAGACGAACGGGCTCTGTAACGCCCCTTGCCGCAAAACTCACCCCCCTCAGCCGTTCTCG
GGCCGCCGCCCTCTGCAGTTGCCGCTGCCTGTCGTGTAGTCACGCCGTCGCCGGTTCGTGCAGCGCCGCCGTCGCCGTACTCGTCCGCATATCTCTCTCACTCTGCGCGT
CTCTCTCTCTCTCTCTCGGTAGATCTCCCTCTCTCCCGTCGGCCTCTCTCTCCCTCCGTCGCAGCTCGAGTGTCGCCGCCACTACCCTTGCCGCCGTCGCTCAGATCGTG
CCGCCGCCAGCCCCGGTCTCTCTGCTTCTCCGCGTGTTCTTCGTCGTGGGTGTTAGATCCGTGGCTCTCTCTGTCTGTCTTTTCGTTTTTAGCCAAAGACATCTGTGGAT
CTCGCGTGTCAAGCGATTCGGAGCCCTGTCGTTCCCTTTTCAGTCGATTCCGCCTCTGTCCAGCAACGTCTCGACTGTTGTTGGTGTCGTTTGGCGTTTTCGGCTCGACC
TCGGCTTGAGCCGAGGCCGAACCATCATGCTCGGCCTCGACCCAAGGCCGATGGCCCGACCCTTTGGTTCGGTCTTCCTTTGGAAATTTCAACTACAATCGATTTTAAAG
GCTCACAAGTTGTTTGGATTTATCGATGGCTGCATTGATTGCCCTCTACAAACGATTCCACCATCCAACATATCATTGATGGAGACCTCAACCGAAGCACCTCCTGCTCC
AATATCTTCTCAGATTAATCCACTATATGAGGATTTGGTTGCCAAGGGTCAAGCTCTAATGACACTGATCAATGCCACTTTGTCGCTGGAAGCCTTAACCTATGTTGTTG
GTTGTGCATCATGCAAACAAGCTTGGGAGGTCATTGAAAAGCATTACTCTTCGAGCTCAAGTACCAACATTTTCAATCTGAAGTTAGATCTTCAATCAATAACCAAGAAG
CCAAAAGCAACCGCCCGTCGCCGAAGAATTTTTCGACTATCGTCGCCACTACCAACCTCCGTCGACCGACCGCCACCGTCGACCTGTAACTACCGGCTGCCTTTGACCTC
CACTAATCGGCCGTCGTCGCCAACCTTAGACTATCGGCCACGTCGACCTCCACCGACCGACCTCTACCGTCGACCTCTGACTACCGACCTTCGCCAGCTAACTGCCACCG
TCGACCTAGAACAACTGTCAACCTCCGACTACCGGACGCTGTCGACCTTCACCAATCGCCGTCGACCCCCGCTGACTAGCCTCTGCCATCTGACCGCTACCGTCGACCTC
AGAATATAG
Protein sequenceShow/hide protein sequence
MPLKYWVDAFLTAVFLINRMPTKALSLDSPFQKLYSKQPTYSNIRVFGCRCFPYLKNLPNNNKFSKRTYPCIFVGYSLEHKGYRCLDPTNNRIYISRHVVFDEYCFPFEK
HISNGTNQRVTTLEVSDFVGIQESDRNFTRRQHIEESDATSSQNAQEMLQQKYIGGDVAVGGNETTLEPTKDATHNQIEELNLDDIPQIKNPVEENTQMNHQSDIDCGDV
AETEDNNYTEMDFNLSNTEDPTSPSVEMSSKGNEVNATSSERDYAETTEENIASRPDTITAQDKNYSDLQFPLSHTTSPLNAINLSSLPDISKYLHVEIPLSPTGTPTTN
EATSKHHMVTRHKLALNPSLDPRLSQEINNIRRGKQHHSYVAEHLNTGPKNYKVALNQPHWKRAMEEEMKALAENQTWELVPRPSDCNIVGSKWIFKTKYKENGIVDRYK
ARLVAQRYTQIEGLDYEETYSPVVKRTTIRLILSIATSASWPLRQLDVKNAFLHGNLKEQVYMEQPPGFIHSSLSNHVCKLQKSIYGLIQAPRAWFDRLANHLLHIGFTC
SNSDPSLFILKDKYVLMIRLIYVDDIILTGNNASHIQQLIHILSKEFALKDLGSLHYFLGIEVKSTHKGITLSQGKYARDLLAKSNMSGASTINTPMATSTQELPTDAQP
TDAKYYRSIVGSLQYLTLTRPDIVQAVNKVCQQLLNPTVKDYKAVKRILRYVQGTIDYGITLYKHSSTNLYGFCDADWGGCQLTRRNTTRFCIFLGSNCISWSSKKQPTV
ARSSSEAEYRAMTSATAELTWIGFLLRDIGIPLYNTPQLYCDNISALYMSINLVFHARTKHIEMDYHFVREKVALGMLITKYIPSKQQLVDIFTKPLTKAVFKGLRDKLG
VRSTTTFSLRNDRLALSRLTSDVDVDVDVDVEFQLSMLMGKDVACKTVNLHTGVVLATPPPMLKSENGRRRAREQGKRVENKVRDPFFSGEEGFKYLLMLLGFLGIQRRF
GMNQAESGRLEAAGTERRRTGSVTPLAAKLTPLSRSRAAALCSCRCLSCSHAVAGSCSAAVAVLVRISLSLCASLSLSLGRSPSLPSASLSLRRSSSVAATTLAAVAQIV
PPPAPVSLLLRVFFVVGVRSVALSVCLFVFSQRHLWISRVKRFGALSFPFQSIPPLSSNVSTVVGVVWRFRLDLGLSRGRTIMLGLDPRPMARPFGSVFLWKFQLQSILK
AHKLFGFIDGCIDCPLQTIPPSNISLMETSTEAPPAPISSQINPLYEDLVAKGQALMTLINATLSLEALTYVVGCASCKQAWEVIEKHYSSSSSTNIFNLKLDLQSITKK
PKATARRRRIFRLSSPLPTSVDRPPPSTCNYRLPLTSTNRPSSPTLDYRPRRPPPTDLYRRPLTTDLRQLTATVDLEQLSTSDYRTLSTFTNRRRPPLTSLCHLTATVDL
RI