; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000783 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000783
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00000536:37511..42463
RNA-Seq ExpressionSgr000783
SyntenySgr000783
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY13707.1 putative copia-type protein, partial [Trifolium pratense]2.4e-19260.11Show/hide
Query:  LSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT
        L+  + KQ  EPK+ KSALK P W  AM+EE+ AL  N TW+LVP+P++ N++GSKW+++ K KEDGSI+R+KARLVA+G+TQV G+D++ETFSP+VK T
Subjt:  LSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT

Query:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITM
        TIR V+A++++  W +RQLDVKNAFLHG +KE VFM QPPGF +P  P HVC L +SLYGLKQAPRAWF+RLS FL+  GF CS +D S FI KT+ +T 
Subjt:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITM

Query:  IMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYR
        ++LIYVDDI+V GN++  +  L+Q+L  EFA+KDLGP HYFLG+E     DG+ L+Q+KY  D+L+KTKM+      TP      + ++D+ +V+ATE+R
Subjt:  IMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYR

Query:  SIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQ
        SIVG+LQYLT TRPDIT+AVN+ CQ    P + DLKAVKRILRYL GT N G+ +  N+   LY F DADW GCP  RRSTTGYC+FLGANCISW SKKQ
Subjt:  SIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQ

Query:  PTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPL
        PT+ARSS+EAEYR+MA TTAELTWI+++L+DI + + + PQLFCDN+SAL MS+NPVFHARTKHIE+DYHFVRE+VA+G L+TRY P+  Q+AD+ TKPL
Subjt:  PTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPL

Query:  SKESFKKLRSKLGVRCTSSSLRKNERKQVNPTSL
        +K+SF K RSKLGV  +  +  +   K+ N   L
Subjt:  SKESFKKLRSKLGVRCTSSSLRKNERKQVNPTSL

PNY16899.1 copia-like polyprotein, partial [Trifolium pratense]1.6e-18054.39Show/hide
Query:  SIFAGSSMQTTNGDK--QGQQHNLKEKAPVE----PHHMVTRGKSAKDPQLY-------STIHK--KGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEE
        S F   S Q  + D   Q +  NL     +E    P  + TR    K P  Y        TI      H +LI + +  EPK++K+ALK  +W+ AM+EE
Subjt:  SIFAGSSMQTTNGDK--QGQQHNLKEKAPVE----PHHMVTRGKSAKDPQLY-------STIHK--KGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEE

Query:  MKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLK
        + AL  N+TW LVP+P   N++GSKW+F+TKL EDGSI+R+KARLVA+GYTQ+ GLD+ ETFSP+VK  TIR++L++AV   W L+QLDVKNAFLHG L 
Subjt:  MKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLK

Query:  EEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFA
        E V+M QPPGF+HP L  HVC+L++SLYGLKQAPRAWFE+LS  L   GF CS +DPS FI + +    ++L+YVDDII+TGN+   +  LI++L  +FA
Subjt:  EEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFA

Query:  LKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPR
        LKDLG  HYFLGIE+ +   GI +SQ KYA D+L +  M++AS+  TP++   +   +DN+LV+ATEYR + GSLQYLT TRPD+T+AVN VCQ  Q P 
Subjt:  LKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPR

Query:  IKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKD
         KDL+AVKRILRY+ GT  HG+ +   SSL L AF DADW GCP  RRSTTG+CI+LG +CISW SKKQPT++RSS+EAEYRA+A T +ELTW+ ++L D
Subjt:  IKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKD

Query:  IGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGVRCTS-SSLRKNERKQVN
        IGI +   P +FCDN SA+ MS NPVFHARTKHI IDYHF+RE+V  G L  RY+P+  Q+AD+ TKPL K+SF     KLGV   S  SL+   + Q  
Subjt:  IGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGVRCTS-SSLRKNERKQVN

Query:  PTS
         TS
Subjt:  PTS

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.0e-19561.34Show/hide
Query:  HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARL
        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW  AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARL
Subjt:  HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARL

Query:  VAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFL
        VA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  L  HVCKLNRSLYGLKQAPRAWF+RLSQ L
Subjt:  VAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFL

Query:  VQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQY
        +  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +  
Subjt:  VQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQY

Query:  ETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPD
         TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  F DADW GC +
Subjt:  ETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPD

Query:  IRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERV
         RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M +N VFHAR+KHIE+DYHFVRE+V
Subjt:  IRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERV

Query:  ALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        A G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  ALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RVX04530.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.0e-19450.86Show/hide
Query:  DNEGQPNLASNQKTTLNASTESHIPCHLDISRSANHIDISTKLVSNQEPTQITC-IEDH---------TPSQLDNSRCVDQLNISGHQNQNNSQIIQTLT
        ++ G P + + + +T+ AS   H                   +V+ + PT +     DH         T SQ++++       +S   + +   ++    
Subjt:  DNEGQPNLASNQKTTLNASTESHIPCHLDISRSANHIDISTKLVSNQEPTQITC-IEDH---------TPSQLDNSRCVDQLNISGHQNQNNSQIIQTLT

Query:  PQE-------ALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALK
        PQ+         SL + S     D  +   S     +      QH   +       HM+TR K   DP L S +     ++  A + ++ EPK++++ LK
Subjt:  PQE-------ALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALK

Query:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLD
        IPHW   M+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLD
Subjt:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLD

Query:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME
        VKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + 
Subjt:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME

Query:  ELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAV
        +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AV
Subjt:  ELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAV

Query:  NKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTA
        NK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  F DADW GC + RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + A
Subjt:  NKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTA

Query:  ELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        E+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+KHIE+DYHFVRE+ A G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  ELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RVX04589.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.2e-19549.07Show/hide
Query:  WIQGAEDKPENHNNSDLEVAKQCRNAHWFEESEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQKTTLNAS---------TESHIPCHLDISRSA
        W++ A    EN  N     +K+C +     E  ++ ++   ++      +       ++ G P   + + +T+ AS         TE+     L  S   
Subjt:  WIQGAEDKPENHNNSDLEVAKQCRNAHWFEESEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQKTTLNAS---------TESHIPCHLDISRSA

Query:  NHIDISTKLVSNQEPTQITCIEDHTPSQLDNSRCVDQ---LNISGHQNQNNSQIIQTLTPQEALS------LPDISNDMFVDFSIFAGSSMQTTNGDKQG
        +H   +T  +S     QI     + P+++ N   + +   +++S  Q    ++ +      ++ S      +PD S  + VD S              QG
Subjt:  NHIDISTKLVSNQEPTQITCIEDHTPSQLDNSRCVDQ---LNISGHQNQNNSQIIQTLTPQEALS------LPDISNDMFVDFSIFAGSSMQTTNGDKQG

Query:  QQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTK
        Q  + K        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW  AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTK
Subjt:  QQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTK

Query:  LKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLK
        LKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLK
Subjt:  LKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLK

Query:  QAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYAR
        QAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY R
Subjt:  QAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYAR

Query:  DILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLK
        D+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+
Subjt:  DILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLK

Query:  LYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHART
        L  F DADW GC + RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+
Subjt:  LYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHART

Query:  KHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        KHIE+DYHFVRE+VA G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  KHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

TrEMBL top hitse value%identityAlignment
A0A2K3PEI5 Putative copia-type protein (Fragment)1.2e-19260.11Show/hide
Query:  LSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT
        L+  + KQ  EPK+ KSALK P W  AM+EE+ AL  N TW+LVP+P++ N++GSKW+++ K KEDGSI+R+KARLVA+G+TQV G+D++ETFSP+VK T
Subjt:  LSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT

Query:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITM
        TIR V+A++++  W +RQLDVKNAFLHG +KE VFM QPPGF +P  P HVC L +SLYGLKQAPRAWF+RLS FL+  GF CS +D S FI KT+ +T 
Subjt:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITM

Query:  IMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYR
        ++LIYVDDI+V GN++  +  L+Q+L  EFA+KDLGP HYFLG+E     DG+ L+Q+KY  D+L+KTKM+      TP      + ++D+ +V+ATE+R
Subjt:  IMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYR

Query:  SIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQ
        SIVG+LQYLT TRPDIT+AVN+ CQ    P + DLKAVKRILRYL GT N G+ +  N+   LY F DADW GCP  RRSTTGYC+FLGANCISW SKKQ
Subjt:  SIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQ

Query:  PTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPL
        PT+ARSS+EAEYR+MA TTAELTWI+++L+DI + + + PQLFCDN+SAL MS+NPVFHARTKHIE+DYHFVRE+VA+G L+TRY P+  Q+AD+ TKPL
Subjt:  PTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPL

Query:  SKESFKKLRSKLGVRCTSSSLRKNERKQVNPTSL
        +K+SF K RSKLGV  +  +  +   K+ N   L
Subjt:  SKESFKKLRSKLGVRCTSSSLRKNERKQVNPTSL

A0A2N9I9N7 CCHC-type domain-containing protein9.2e-19059.89Show/hide
Query:  PVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERY
        P   H M+TR K+ +            H+ L  T    EPKS KSAL+ PHW  AM +E+ AL +NHTW LVP+ S  NI+GS+W+FKTKLK DGSIER+
Subjt:  PVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERY

Query:  KARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERL
        KARLVA+GY Q+EGLD+ ETFSP+VK TTIR+VL++A T GW LRQLDVKNAFLHG+LKE V+M QPPGF   S P HVC+L++++YGLKQAPRAWF+R 
Subjt:  KARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERL

Query:  SQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIE
        S FL+  GF+CS +D S F+F+++   +++L+YVDDIIVT N    +  LI KLS EF++KDLGP HYFLGI+V H SDGI LSQ+KYAR+IL K  M +
Subjt:  SQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIE

Query:  ASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWR
             TP++           LV+AT YRSIVG+LQYLTLTRPD+T+AVN VCQ +  P     +AVKRILRYL GT  +GI    +SSL LY F DADW 
Subjt:  ASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWR

Query:  GCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFV
        GCPD RRSTTGYCI+LGANCISW SKKQ T++RSS+EAEYRAMA   AELTW++++L D+GI +  PP LFCDN SAL M++NPVFHARTKHIE+D+HFV
Subjt:  GCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFV

Query:  RERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV-RCTSSSLRKNERK
        RE+VA G L TRYVPSQ Q+AD+ TK +SK+ F + R KLGV     SSLR  +++
Subjt:  RERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV-RCTSSSLRKNERK

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-19561.34Show/hide
Query:  HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARL
        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW  AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARL
Subjt:  HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARL

Query:  VAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFL
        VA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  L  HVCKLNRSLYGLKQAPRAWF+RLSQ L
Subjt:  VAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFL

Query:  VQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQY
        +  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +  
Subjt:  VQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQY

Query:  ETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPD
         TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  F DADW GC +
Subjt:  ETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPD

Query:  IRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERV
         RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M +N VFHAR+KHIE+DYHFVRE+V
Subjt:  IRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERV

Query:  ALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        A G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  ALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

A0A438J6E1 Retrovirus-related Pol polyprotein from transposon RE19.5e-19550.86Show/hide
Query:  DNEGQPNLASNQKTTLNASTESHIPCHLDISRSANHIDISTKLVSNQEPTQITC-IEDH---------TPSQLDNSRCVDQLNISGHQNQNNSQIIQTLT
        ++ G P + + + +T+ AS   H                   +V+ + PT +     DH         T SQ++++       +S   + +   ++    
Subjt:  DNEGQPNLASNQKTTLNASTESHIPCHLDISRSANHIDISTKLVSNQEPTQITC-IEDH---------TPSQLDNSRCVDQLNISGHQNQNNSQIIQTLT

Query:  PQE-------ALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALK
        PQ+         SL + S     D  +   S     +      QH   +       HM+TR K   DP L S +     ++  A + ++ EPK++++ LK
Subjt:  PQE-------ALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALK

Query:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLD
        IPHW   M+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLD
Subjt:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLD

Query:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME
        VKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + 
Subjt:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME

Query:  ELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAV
        +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AV
Subjt:  ELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAV

Query:  NKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTA
        NK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  F DADW GC + RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + A
Subjt:  NKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTA

Query:  ELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        E+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+KHIE+DYHFVRE+ A G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  ELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

A0A438J6K3 Retrovirus-related Pol polyprotein from transposon RE12.5e-19549.07Show/hide
Query:  WIQGAEDKPENHNNSDLEVAKQCRNAHWFEESEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQKTTLNAS---------TESHIPCHLDISRSA
        W++ A    EN  N     +K+C +     E  ++ ++   ++      +       ++ G P   + + +T+ AS         TE+     L  S   
Subjt:  WIQGAEDKPENHNNSDLEVAKQCRNAHWFEESEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQKTTLNAS---------TESHIPCHLDISRSA

Query:  NHIDISTKLVSNQEPTQITCIEDHTPSQLDNSRCVDQ---LNISGHQNQNNSQIIQTLTPQEALS------LPDISNDMFVDFSIFAGSSMQTTNGDKQG
        +H   +T  +S     QI     + P+++ N   + +   +++S  Q    ++ +      ++ S      +PD S  + VD S              QG
Subjt:  NHIDISTKLVSNQEPTQITCIEDHTPSQLDNSRCVDQ---LNISGHQNQNNSQIIQTLTPQEALS------LPDISNDMFVDFSIFAGSSMQTTNGDKQG

Query:  QQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTK
        Q  + K        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW  AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTK
Subjt:  QQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTK

Query:  LKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLK
        LKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSP++K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLK
Subjt:  LKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLK

Query:  QAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYAR
        QAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG  HYFLG+EV +  +G+ +SQ KY R
Subjt:  QAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYAR

Query:  DILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLK
        D+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+
Subjt:  DILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLK

Query:  LYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHART
        L  F DADW GC + RRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+
Subjt:  LYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHART

Query:  KHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        KHIE+DYHFVRE+VA G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  KHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-9638.96Show/hide
Query:  WKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKN
        W+ A+  E+ A   N+TW +  +P + NI+ S+W+F  K  E G+  RYKARLVA+G+TQ   +DYEETF+P+ + ++ R +L++ +    K+ Q+DVK 
Subjt:  WKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKN

Query:  AFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEI--TMIMLIYVDDIIVTGNSEKHMEE
        AFL+G LKEE++M  P G    S   +VCKLN+++YGLKQA R WFE   Q L +  F  S  D   +I     I   + +L+YVDD+++       M  
Subjt:  AFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEI--TMIMLIYVDDIIVTGNSEKHMEE

Query:  LIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPM-SNYTSNCANDNELVNATEYRSIVGSLQYLTL-TRPDITYA
          + L  +F + DL    +F+GI +    D I LSQ  Y + IL K  M   +   TP+ S       N +E  N T  RS++G L Y+ L TRPD+T A
Subjt:  LIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPM-SNYTSNCANDNELVNATEYRSIVGSLQYLTL-TRPDITYA

Query:  VNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSL--KLYAFYDADWRGCPDIRRSTTGYCI-FLGANCISWKSKKQPTIARSSSEAEYRAMA
        VN + +       +  + +KR+LRYL GT +  + F KN +   K+  + D+DW G    R+STTGY       N I W +K+Q ++A SS+EAEY A+ 
Subjt:  VNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSL--KLYAFYDADWRGCPDIRRSTTGYCI-FLGANCISWKSKKQPTIARSSSEAEYRAMA

Query:  ITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
            E  W+ F+L  I I +  P +++ DN   + ++ NP  H R KHI+I YHF RE+V   ++   Y+P+++QLADI TKPL    F +LR KLG+
Subjt:  ITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-9538.1Show/hide
Query:  IATKQEMEPKSFKSALKIP---HWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT
        +    + EP+S K  L  P       AM+EEM++L +N T+ LV  P     +  KW+FK K   D  + RYKARLV +G+ Q +G+D++E FSP+VK T
Subjt:  IATKQEMEPKSFKSALKIP---HWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPT

Query:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEIT-
        +IR +L++A +   ++ QLDVK AFLHG L+EE++M QP GF+       VCKLN+SLYGLKQAPR W+ +   F+    +  ++SDP  +  + +E   
Subjt:  TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEIT-

Query:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEV--HHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTS-------NCAND
        +I+L+YVDD+++ G  +  + +L   LS  F +KDLGP    LG+++    TS  + LSQEKY   +L +  M  A    TP++ +             +
Subjt:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEV--HHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTS-------NCAND

Query:  NELVNATEYRSIVGSLQY-LTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLG
           +    Y S VGSL Y +  TRPDI +AV  V + L+ P  +  +AVK ILRYL GTT   + F  +  + L  + DAD  G  D R+S+TGY     
Subjt:  NELVNATEYRSIVGSLQY-LTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLG

Query:  ANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQ
           ISW+SK Q  +A S++EAEY A   T  E+ W+   L+++G+H  +   ++CD+ SA+ +S N ++HARTKHI++ YH++RE V    L    + + 
Subjt:  ANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQ

Query:  DQLADILTKPLSKESFKKLRSKLGV
        +  AD+LTK + +  F+  +  +G+
Subjt:  DQLADILTKPLSKESFKKLRSKLGV

P92519 Uncharacterized mitochondrial protein AtMg008103.4e-6451.54Show/hide
Query:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEY
        M +L+YVDDI++TG+S   +  LI +LS  F++KDLGP HYFLGI++     G+ LSQ KYA  IL    M++     TP+    ++  +  +  + +++
Subjt:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEY

Query:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKK
        RSIVG+LQYLTLTRPDI+YAVN VCQ++  P + D   +KR+LRY+ GT  HG+  +KNS L + AF D+DW GC   RRSTTG+C FLG N ISW +K+
Subjt:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKK

Query:  QPTIARSSSEAEYRAMAITTAELTWIS
        QPT++RSS+E EYRA+A+T AELTW S
Subjt:  QPTIARSSSEAEYRAMAITTAELTWIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-14845Show/hide
Query:  ISRSANHIDISTKLVSNQEPT-------QITCIEDHTPSQLDNSRCVDQLNISGHQNQNNSQIIQTL-TP-QEALSLPDISNDMFVDFSIFAGSSMQTTN
        +S S      S+   S+ EPT       Q T     T +Q  +S+   Q N +   N++ SQ+ Q+L TP Q + S P  +       +     S+    
Subjt:  ISRSANHIDISTKLVSNQEPT-------QITCIEDHTPSQLDNSRCVDQLNISGHQNQNNSQIIQTL-TP-QEALSLPDISNDMFVDFSIFAGSSMQTTN

Query:  GDKQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGS
             Q  N   +AP+  H M TR K+   K    YS          ++   E EP++   ALK   W+ AM  E+ A + NHTWDLV P PSH  I+G 
Subjt:  GDKQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGS

Query:  KWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLN
        +WIF  K   DGS+ RYKARLVA+GY Q  GLDY ETFSP++K T+IRIVL VAV   W +RQLDV NAFL G L ++V+M+QPPGF     P +VCKL 
Subjt:  KWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLN

Query:  RSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIML
        ++LYGLKQAPRAW+  L  +L+  GF  S SD S F+ +  +  + ML+YVDDI++TGN    +   +  LS  F++KD    HYFLGIE      G+ L
Subjt:  RSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIML

Query:  SQEKYARDILIKTKMIEASQYETPMS-NYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGIS
        SQ +Y  D+L +T MI A    TPM+ +   +  +  +L + TEYR IVGSLQYL  TRPDI+YAVN++ Q +  P  + L+A+KRILRYL GT NHGI 
Subjt:  SQEKYARDILIKTKMIEASQYETPMS-NYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGIS

Query:  FYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSI
          K ++L L+A+ DADW G  D   ST GY ++LG + ISW SKKQ  + RSS+EAEYR++A T++E+ WI  +L ++GI + +PP ++CDN+ A  +  
Subjt:  FYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSI

Query:  NPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        NPVFH+R KHI IDYHF+R +V  G L   +V + DQLAD LTKPLS+ +F+   SK+GV
Subjt:  NPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-14347.99Show/hide
Query:  KAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDG
        +APV  H M TR K    K  Q YS        SL A     EP++   A+K   W+ AM  E+ A + NHTWDLV P P    I+G +WIF  K   DG
Subjt:  KAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDG

Query:  SIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRA
        S+ RYKARLVA+GY Q  GLDY ETFSP++K T+IRIVL VAV   W +RQLDV NAFL G L +EV+M+QPPGF     P +VC+L +++YGLKQAPRA
Subjt:  SIERYKARLVAQGYTQVEGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRA

Query:  WFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIK
        W+  L  +L+  GF  S SD S F+ +     + ML+YVDDI++TGN    ++  +  LS  F++K+    HYFLGIE      G+ LSQ +Y  D+L +
Subjt:  WFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIK

Query:  TKMIEASQYETPMSNYTS-NCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAF
        T M+ A    TPM+        +  +L + TEYR IVGSLQYL  TRPD++YAVN++ Q +  P      A+KR+LRYL GT +HGI   K ++L L+A+
Subjt:  TKMIEASQYETPMSNYTS-NCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAF

Query:  YDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIE
         DADW G  D   ST GY ++LG + ISW SKKQ  + RSS+EAEYR++A T++EL WI  +L ++GI +  PP ++CDN+ A  +  NPVFH+R KHI 
Subjt:  YDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIE

Query:  IDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        +DYHF+R +V  G L   +V + DQLAD LTKPLS+ +F+    K+GV
Subjt:  IDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-12043.15Show/hide
Query:  YSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEE
        Y  +    H  L+   +  EP ++  A +   W  AM++E+ A+   HTW++   P +   IG KW++K K   DG+IERYKARLVA+GYTQ EG+D+ E
Subjt:  YSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEE

Query:  TFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGF---QHPSLPQH-VCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSD
        TFSP+ K T+++++LA++    + L QLD+ NAFL+G L EE++M  PPG+   Q  SLP + VC L +S+YGLKQA R WF + S  L+  GF  SHSD
Subjt:  TFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGF---QHPSLPQH-VCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSD

Query:  PSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMS-NYTSN
         ++F+  T  + + +L+YVDDII+  N++  ++EL  +L   F L+DLGP  YFLG+E+  ++ GI + Q KYA D+L +T ++       PM  + T +
Subjt:  PSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMS-NYTSN

Query:  CANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCI
          +  + V+A  YR ++G L YL +TR DI++AVNK+ Q  + PR+   +AV +IL Y+ GT   G+ +   + ++L  F DA ++ C D RRST GYC+
Subjt:  CANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCI

Query:  FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRER
        FLG + ISWKSKKQ  +++SS+EAEYRA++  T E+ W++   +++ + + +P  LFCDN +A+ ++ N VFH RTKHIE D H VRER
Subjt:  FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRER

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.4e-1746.59Show/hide
Query:  YLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCI-----FLGA
        YLT+TRPD+T+AVN++ Q     R   ++AV ++L Y+ GT   G+ +   S L+L AF D+DW  CPD RRS TG+C      FLGA
Subjt:  YLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCI-----FLGA

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-6551.54Show/hide
Query:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEY
        M +L+YVDDI++TG+S   +  LI +LS  F++KDLGP HYFLGI++     G+ LSQ KYA  IL    M++     TP+    ++  +  +  + +++
Subjt:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEY

Query:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKK
        RSIVG+LQYLTLTRPDI+YAVN VCQ++  P + D   +KR+LRY+ GT  HG+  +KNS L + AF D+DW GC   RRSTTG+C FLG N ISW +K+
Subjt:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKK

Query:  QPTIARSSSEAEYRAMAITTAELTWIS
        QPT++RSS+E EYRA+A+T AELTW S
Subjt:  QPTIARSSSEAEYRAMAITTAELTWIS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.9e-2648.85Show/hide
Query:  MVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVA
        M+TR K+       + ++ K  L+ I T  + EPKS   ALK P W  AM+EE+ AL  N TW LVP P + NI+G KW+FKTKL  DG+++R KARLVA
Subjt:  MVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVA

Query:  QGYTQVEGLDYEETFSPIVKPTTIRIVLAVA
        +G+ Q EG+ + ET+SP+V+  TIR +L VA
Subjt:  QGYTQVEGLDYEETFSPIVKPTTIRIVLAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCATCTCCACAAAATCAGTCCCACAATTTGGCAACTCCAGATCCACAAATCAAGGCTGAATTAAATAATGTTTGTGAAGAATGGATACAAGGAGCAGAAGACAA
ACCAGAAAATCACAATAATAGTGATCTGGAAGTTGCAAAACAGTGTAGAAATGCACACTGGTTTGAAGAAAGTGAACATGAAGCAACCATCCCAGTGGCGAATAATCAAT
TGCAGGAAGAAACACCATCTCAACCTACTCAAGACAATGAAGGACAGCCAAATTTAGCATCTAACCAAAAGACTACATTGAATGCAAGCACTGAAAGTCACATTCCGTGC
CATCTAGACATTAGCAGATCTGCAAATCACATTGACATCAGTACAAAGTTGGTGTCTAACCAAGAGCCTACACAAATTACATGCATTGAAGACCACACTCCAAGCCAACT
AGACAATAGTAGATGTGTGGACCAGCTTAACATCAGTGGGCATCAAAACCAGAATAATTCTCAAATCATCCAAACTTTAACCCCACAGGAAGCTCTGTCCTTACCTGATA
TAAGCAATGACATGTTTGTTGATTTTTCCATTTTTGCAGGAAGCTCAATGCAGACCACAAATGGGGATAAACAAGGACAACAACATAACCTTAAGGAAAAAGCTCCTGTT
GAGCCTCATCACATGGTCACTAGGGGAAAATCGGCTAAGGATCCTCAACTATACTCCACGATACACAAAAAAGGACATTTATCCCTGATTGCCACTAAACAAGAAATGGA
GCCAAAGTCTTTCAAATCCGCCCTCAAGATTCCCCATTGGAAGGCGGCTATGGAAGAGGAAATGAAGGCACTTTTGGAAAACCACACATGGGACCTTGTACCCAAACCAT
CTCACACAAACATCATTGGGTCAAAATGGATATTTAAGACCAAACTAAAAGAAGATGGATCCATCGAACGGTACAAGGCTCGCCTAGTTGCTCAGGGATACACACAAGTT
GAGGGACTTGACTATGAAGAGACATTTAGTCCTATAGTGAAACCAACCACTATACGAATTGTCCTAGCTGTGGCAGTAACGTGCGGATGGAAATTAAGGCAACTAGATGT
TAAAAATGCCTTCCTCCATGGTTACCTTAAGGAAGAAGTCTTCATGGCTCAACCACCTGGTTTCCAACATCCAAGCCTTCCTCAACATGTGTGCAAATTAAACCGCTCTC
TTTATGGTCTTAAACAAGCCCCTAGAGCTTGGTTCGAAAGACTATCACAGTTCCTCGTGCAAACAGGATTCTTCTGCTCTCACTCTGATCCCTCATTTTTCATTTTTAAA
ACCAATGAGATCACTATGATCATGCTAATCTATGTAGATGATATTATTGTTACAGGTAACAGTGAAAAGCATATGGAGGAACTCATTCAAAAACTCAGCATAGAGTTTGC
CCTCAAAGACCTTGGACCCTTCCACTACTTCCTAGGCATTGAAGTCCACCACACATCAGATGGCATCATGCTATCACAAGAAAAATATGCTAGAGACATCCTCATCAAAA
CAAAAATGATAGAAGCCTCTCAATATGAAACTCCAATGAGCAATTACACCTCAAATTGTGCAAATGACAATGAACTTGTCAATGCAACTGAATACAGGAGCATTGTGGGA
TCACTCCAATATCTGACCCTTACTCGACCTGATATCACTTATGCAGTCAACAAAGTGTGTCAACAACTCCAAACACCAAGGATAAAGGATCTAAAGGCGGTCAAAAGGAT
ACTACGATACCTAAATGGGACAACAAATCATGGTATATCTTTTTATAAAAATAGCTCACTTAAACTCTATGCTTTCTATGATGCAGATTGGAGAGGCTGTCCAGACATTC
GAAGAAGTACCACAGGATATTGCATATTCCTTGGAGCAAATTGCATATCATGGAAGTCAAAGAAACAACCAACAATTGCAAGGTCAAGTTCAGAAGCAGAATACCGAGCC
ATGGCAATCACCACAGCTGAGCTCACTTGGATAAGCTTTGTCCTCAAAGATATTGGCATACATATGATGCAACCTCCACAACTGTTTTGCGACAACATGAGTGCACTACG
AATGTCCATCAATCCCGTATTCCATGCTAGGACAAAACACATTGAGATCGATTACCATTTTGTAAGAGAAAGAGTAGCTCTTGGTCTCCTCATCACTCGGTATGTTCCCT
CTCAAGACCAACTAGCTGACATTCTCACCAAGCCTCTCTCAAAAGAATCATTTAAGAAGCTCAGAAGCAAACTTGGCGTCCGCTGCACTTCCTCAAGTTTGAGGAAGAAT
GAAAGAAAACAAGTCAATCCAACAAGTTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCATCTCCACAAAATCAGTCCCACAATTTGGCAACTCCAGATCCACAAATCAAGGCTGAATTAAATAATGTTTGTGAAGAATGGATACAAGGAGCAGAAGACAA
ACCAGAAAATCACAATAATAGTGATCTGGAAGTTGCAAAACAGTGTAGAAATGCACACTGGTTTGAAGAAAGTGAACATGAAGCAACCATCCCAGTGGCGAATAATCAAT
TGCAGGAAGAAACACCATCTCAACCTACTCAAGACAATGAAGGACAGCCAAATTTAGCATCTAACCAAAAGACTACATTGAATGCAAGCACTGAAAGTCACATTCCGTGC
CATCTAGACATTAGCAGATCTGCAAATCACATTGACATCAGTACAAAGTTGGTGTCTAACCAAGAGCCTACACAAATTACATGCATTGAAGACCACACTCCAAGCCAACT
AGACAATAGTAGATGTGTGGACCAGCTTAACATCAGTGGGCATCAAAACCAGAATAATTCTCAAATCATCCAAACTTTAACCCCACAGGAAGCTCTGTCCTTACCTGATA
TAAGCAATGACATGTTTGTTGATTTTTCCATTTTTGCAGGAAGCTCAATGCAGACCACAAATGGGGATAAACAAGGACAACAACATAACCTTAAGGAAAAAGCTCCTGTT
GAGCCTCATCACATGGTCACTAGGGGAAAATCGGCTAAGGATCCTCAACTATACTCCACGATACACAAAAAAGGACATTTATCCCTGATTGCCACTAAACAAGAAATGGA
GCCAAAGTCTTTCAAATCCGCCCTCAAGATTCCCCATTGGAAGGCGGCTATGGAAGAGGAAATGAAGGCACTTTTGGAAAACCACACATGGGACCTTGTACCCAAACCAT
CTCACACAAACATCATTGGGTCAAAATGGATATTTAAGACCAAACTAAAAGAAGATGGATCCATCGAACGGTACAAGGCTCGCCTAGTTGCTCAGGGATACACACAAGTT
GAGGGACTTGACTATGAAGAGACATTTAGTCCTATAGTGAAACCAACCACTATACGAATTGTCCTAGCTGTGGCAGTAACGTGCGGATGGAAATTAAGGCAACTAGATGT
TAAAAATGCCTTCCTCCATGGTTACCTTAAGGAAGAAGTCTTCATGGCTCAACCACCTGGTTTCCAACATCCAAGCCTTCCTCAACATGTGTGCAAATTAAACCGCTCTC
TTTATGGTCTTAAACAAGCCCCTAGAGCTTGGTTCGAAAGACTATCACAGTTCCTCGTGCAAACAGGATTCTTCTGCTCTCACTCTGATCCCTCATTTTTCATTTTTAAA
ACCAATGAGATCACTATGATCATGCTAATCTATGTAGATGATATTATTGTTACAGGTAACAGTGAAAAGCATATGGAGGAACTCATTCAAAAACTCAGCATAGAGTTTGC
CCTCAAAGACCTTGGACCCTTCCACTACTTCCTAGGCATTGAAGTCCACCACACATCAGATGGCATCATGCTATCACAAGAAAAATATGCTAGAGACATCCTCATCAAAA
CAAAAATGATAGAAGCCTCTCAATATGAAACTCCAATGAGCAATTACACCTCAAATTGTGCAAATGACAATGAACTTGTCAATGCAACTGAATACAGGAGCATTGTGGGA
TCACTCCAATATCTGACCCTTACTCGACCTGATATCACTTATGCAGTCAACAAAGTGTGTCAACAACTCCAAACACCAAGGATAAAGGATCTAAAGGCGGTCAAAAGGAT
ACTACGATACCTAAATGGGACAACAAATCATGGTATATCTTTTTATAAAAATAGCTCACTTAAACTCTATGCTTTCTATGATGCAGATTGGAGAGGCTGTCCAGACATTC
GAAGAAGTACCACAGGATATTGCATATTCCTTGGAGCAAATTGCATATCATGGAAGTCAAAGAAACAACCAACAATTGCAAGGTCAAGTTCAGAAGCAGAATACCGAGCC
ATGGCAATCACCACAGCTGAGCTCACTTGGATAAGCTTTGTCCTCAAAGATATTGGCATACATATGATGCAACCTCCACAACTGTTTTGCGACAACATGAGTGCACTACG
AATGTCCATCAATCCCGTATTCCATGCTAGGACAAAACACATTGAGATCGATTACCATTTTGTAAGAGAAAGAGTAGCTCTTGGTCTCCTCATCACTCGGTATGTTCCCT
CTCAAGACCAACTAGCTGACATTCTCACCAAGCCTCTCTCAAAAGAATCATTTAAGAAGCTCAGAAGCAAACTTGGCGTCCGCTGCACTTCCTCAAGTTTGAGGAAGAAT
GAAAGAAAACAAGTCAATCCAACAAGTTTATGA
Protein sequenceShow/hide protein sequence
MLSSPQNQSHNLATPDPQIKAELNNVCEEWIQGAEDKPENHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQKTTLNASTESHIPC
HLDISRSANHIDISTKLVSNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNSQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPV
EPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQV
EGLDYEETFSPIVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFK
TNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPFHYFLGIEVHHTSDGIMLSQEKYARDILIKTKMIEASQYETPMSNYTSNCANDNELVNATEYRSIVG
SLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFYDADWRGCPDIRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRA
MAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGVRCTSSSLRKN
ERKQVNPTSL