; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015509 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015509
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationtig00004455:184929..188543
RNA-Seq ExpressionSgr015509
SyntenySgr015509
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]1.7e-30145.44Show/hide
Query:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIHEPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL
        +VCQIC  P HTAL C +RF+H +Q +   +A+ A+ +  P D + + D+ A+ H+  DPG L+  S Y G +K+ +G+G  LDI+H G+  +     NL
Subjt:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIHEPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL

Query:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLYALEEHQQLAAYTASTK-APFSIWHKRMGHLNDSALKHLCNLHL
         L NVLVVP +KKNLLS  +L  D   +  F++   VIK+ + G+ + KG K+ G+YAL   ++ A ++   K A   +WH+R+GH     ++ L    L
Subjt:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLYALEEHQQLAAYTASTK-APFSIWHKRMGHLNDSALKHLCNLHL

Query:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI
        +  +S  K E  C +CQ++K  RLPF L NE    P++ IHCDLWG AP+ S Q FKYYV F+D++SR+TW +PLK KSDFFQ F  F   VENQF +KI
Subjt:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI

Query:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN
        K+FQ D GGEF    F   L+ +GI  Q +CP TP+QNG+ ERKHR++VE GL+L+F S TP +YWV+AF TA++LINR PS+ L+ ++P+  L+ K P+
Subjt:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN

Query:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE
        YS +RVFG KCFP+LR++S +K   R+ PC+F+GYS +HKGYRCL P S R+YISRHV FDE  FPF K   ++     + +  EF+  +W  G      
Subjt:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE

Query:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT
                                        +   E+T  +PT                        TP  +  ++ ++ +++      + P +    T
Subjt:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT

Query:  PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPV---EPHHMVTRGKSAKDPQLYSTI-----HKKGHLSLIATKQEMEPKSFKSALK
        P  + S P+ + D       F   + +  N D     ++++++  V   EP    T        Q +        +        AT   +EPKS K+ALK
Subjt:  PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPV---EPHHMVTRGKSAKDPQLYSTI-----HKKGHLSLIATKQEMEPKSFKSALK

Query:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLD
         P WK AMEEE+ AL++N TW+LVP+ +  NI+G KW+FKTK K DGS+ER KARLVA+G+ QV G+D+ ETFSPVVKP TIR+VL +A+   W++RQLD
Subjt:  IPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLD

Query:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME
        VKNAFLHG+L E VFM QPPGFQ+   P +VCKLN++LYGL+QAPRAWF+R S FL+  GF CS +D S F+ +++  T+++L+YVDDII+TG++   + 
Subjt:  VKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHME

Query:  ELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELV-NATEYRSIVGSLQYLTLTRPDITYA
        + I  L  EF++KDLGPLHYFLG+ V     GI+L Q +YAR++L +  M       TPM+  + +  ND+ L  +A  YRSIVG LQYLT TRPDI Y+
Subjt:  ELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELV-NATEYRSIVGSLQYLTLTRPDITYA

Query:  VNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITT
        VN +CQ +  P     + VKR+LRY+ GT ++GI   ++  L+L  F DADW GC  TRRSTTGYC +LG NCISW +KKQPT+ARSS++AEYRA+A   
Subjt:  VNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITT

Query:  AELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        AE+TW+SFVL+DIG+++ +PP LF DN+SAL M+INPVFHARTKHIEIDYHFVRE+V+ G L+T++V S +Q+AD+ TK L + +   LR KLG+
Subjt:  AELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.5e-28945.92Show/hide
Query:  IPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDC
        IP+ALAA+++H E  D N Y DSGAT HI NDP                                                                   
Subjt:  IPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDC

Query:  SIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQL
           F++  FVIK+   Q L +G K+ GLYALEE+  Q    T S+KA   +WH+RMGH    ++K L +   +++SSW K   +CV+CQL K  +LPF L
Subjt:  SIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQL

Query:  RNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQ
        RN+ISS PL+KIHCDLWGPAP  S Q + +            WL    K                       KIFQSDGGGEFQS+ F++HL + GI  Q
Subjt:  RNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQ

Query:  LSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTY
        +SCP TP+QNGVAERKHRH+VE GL+++F +K P   WVDAFLTAV+LINR+PS  L M++P+  LF + P Y S+R+FGC+CFPYLR+Y  +KFS +TY
Subjt:  LSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTY

Query:  PCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWM-------QGAEEKPENHNNSDLEVAKQCRNAHWFEE
        PCVF+GYS +HKGYRCL P + R+YISRHVIF+EN FP+D            +E++ F   +            +   EN  N     +K+C +     E
Subjt:  PCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWM-------QGAEEKPENHNNSDLEVAKQCRNAHWFEE

Query:  SEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT---------PQEALSLPDI
          ++ ++   ++      +       ++ G P   + +  T      DH       +     L  S H N +      TL+         P +  + PDI
Subjt:  SEHEATIP--VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT---------PQEALSLPDI

Query:  SNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKS
        S  + VD S F     +  + D   Q  +    A                 P   H      HM+TR K   DP L S +     ++  AT+ ++ EPK+
Subjt:  SNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKS

Query:  FKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGW
        +++ALKIPHW  AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSPV+K TTIR++ ++AVT GW
Subjt:  FKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGW

Query:  KLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGN
        K+RQLDVKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLKQAPRAWF+RLS                FFI+                   GN
Subjt:  KLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGN

Query:  SEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRP
            + +LI  LS EF+LKDLG LHYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRP
Subjt:  SEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRP

Query:  DITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRA
        DI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  FCDADW GC +TRRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR+
Subjt:  DITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRA

Query:  MAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        +A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+KHIE+DYHFVRE+VA G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  MAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0052.9Show/hide
Query:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN
        ++CQIC K NH+AL C +RFD+ +QL++IP+ALAA+++H E  D N Y DSGAT HI NDPGK++    Y+G++ ++VGNGEAL I+H+G  +LKT   +
Subjt:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN

Query:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL
        L L+ +LVVP +KKNLLSI +L  DN CSI F++  FVIK+   Q L +G K+ GLYALEE+  Q    T S+KA   +WH+RMGH    ++K L +   
Subjt:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL

Query:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI
        +++SSW K   +CV+CQL K  +LPF LRN+ISS PL+KIHCDLWGPAP  S Q +KYY  FIDD +RYTWLYPL++KSDFF+ F KFQ LVENQ +R+I
Subjt:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI

Query:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN
        KIFQSDGGGEFQS+ F++HL + GI  Q+SCP TP+QNGVAERKHRH+VE GL+++F +K P   WVDAFLTAV+LINR+PS  L M++P+  LF + P 
Subjt:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN

Query:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE
        Y S+R+FGC+CFPYLR+Y  +KFS +TYPCVF+GYS +HKGYRCL P + R+YISRHVI                    S++++ FL             
Subjt:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE

Query:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT
                +    +N H              N  +  ETP+                         DH       +  + +L++          +I T  
Subjt:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT

Query:  PQEAL--SLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWK
        P  +    +PD S  + VD S              QGQ  + K        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW 
Subjt:  PQEAL--SLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWK

Query:  AAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAF
         AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSPV+K TTIR++ ++AVT GWK+RQLDVKNAF
Subjt:  AAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAF

Query:  LHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQK
        LHG+LKEEVFM QPPGF +  L  HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  
Subjt:  LHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQK

Query:  LSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQ
        LS EF+LKDLG LHYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ
Subjt:  LSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQ

Query:  QLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWI
          Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  FCDADW GC +TRRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW+
Subjt:  QLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWI

Query:  SFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        +F+L+DIGI + +PPQL CDN+SAL M +N VFHAR+KHIE+DYHFVRE+VA G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  SFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RVX04530.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0049.51Show/hide
Query:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN
        ++CQIC K NH+AL C +RFD+++Q ++IP+ALAA+++H E  D N Y DSGATTHI ND                                        
Subjt:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN

Query:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL
                                     + F++  FVIK+   Q L +G K+ GLYALEE+  Q    T S+KA   +WH+RMGH    ++K L +   
Subjt:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL

Query:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI
        +++SSW K   +CV+CQL K  +LPF LRN+ISS PL+KIHCDLWGPAP  S Q +KYY  FIDD +RYTWLYPL++KSDFF+ F KFQ LVENQ +R+I
Subjt:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI

Query:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN
        KIFQSDGGGEFQS+ F++HL + GI  Q+SCP TP+QNGVAERKHRH+VE GL+++F +K P   WVDAFLTAV+LINR+PS  L M++P+  LF + P 
Subjt:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN

Query:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE
        Y S+R+FGC+CFPYLR+Y  +KFS +TYPCVF+GYS +HKGYRCL P + R+YISRHVIF+EN FP+D            +E++ F   +     E    
Subjt:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE

Query:  NHNNSDLE--VAKQCRNAHWFEESEHEATIPVANNQLQEE-------TPSQPTQ----DNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGH
        +H     +   A Q  N+   ++  H        NQ   E         +Q T+    ++ G P + + +  T      DH       +     L  S H
Subjt:  NHNNSDLE--VAKQCRNAHWFEESEHEATIPVANNQLQEE-------TPSQPTQ----DNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGH

Query:  QNQNNPQIIQTLT---------PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRG
         N +      TL+         P +  + PDIS  + VD S F     +  + D   Q  +    A                 P   H      HM+TR 
Subjt:  QNQNNPQIIQTLT---------PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRG

Query:  KSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYT
        K   DP L S +     ++  A + ++ EPK++++ LKIPHW   M+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++
Subjt:  KSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYT

Query:  QVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFF
        Q+ GLD+ ETFSPV+K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF 
Subjt:  QVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFF

Query:  CSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSN
        C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG LHYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+ 
Subjt:  CSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSN

Query:  YTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTT
         +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  FCDADW GC +TRRST+
Subjt:  YTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTT

Query:  GYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLI
        GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+KHIE+DYHFVRE+ A G+LI
Subjt:  GYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLI

Query:  TRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        TR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  TRYVPSQDQLADILTKPLSKESFKKLRSKLGV

RWR75576.1 Zinc finger, CCCH-type [Cinnamomum micranthum f. kanehirae]0.0e+0048.19Show/hide
Query:  VCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIHEPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNLN
        +CQIC KPNH+A+ C  RF+ T+Q  D  +ALAA ++ + +D + + D+GAT H+  D GKL+  S Y+G++K+ VGNG ALDITH+G+  +K G   L 
Subjt:  VCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIHEPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNLN

Query:  LQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTASTKAPF-SIWHKRMGHLNDSALKHLCNLHLL
        L NVLVVP +KKNLLS+S++  +      F+ D FV+K+   GQ +  G +  GLY+L+     A ++   +A    +WH+R+GH     L  L   +++
Subjt:  LQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTASTKAPF-SIWHKRMGHLNDSALKHLCNLHLL

Query:  DISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKIK
          ++   +  +C +CQ++K  RLPF      S+ P++ IHCDLWGPAP+LSCQHF+YYV F+D+ +R+TW +PLKKKSDF+Q F  F K V+ QF RKI+
Subjt:  DISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKIK

Query:  IFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNY
        IFQSDGGGEF    F   L   GI H+ SCPHTPQQNG+AERKHR+V E GL+L+F    P ++WVDAF TAV+LINR  S+ +N  +PY +L GK P+Y
Subjt:  IFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNY

Query:  SSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPEN
         S+RVFGCKCFPYLR+Y+N+KF+ R+ PCVF+GYS   KGYRC  P + RIY SRHV+FDE+TFPF    +    T K+  +  F   EW+ GA E   +
Subjt:  SSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPEN

Query:  HNNSDLEVAKQCRNAHWFEES--EHEATIPVANNQLQEETPSQPTQDNEG-QPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQT
         + + +       +A     S   H  ++  +N     +      Q + G  P+  S Q          H P    N+      + +   +++ P I  +
Subjt:  HNNSDLEVAKQCRNAHWFEES--EHEATIPVANNQLQEETPSQPTQDNEG-QPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQT

Query:  LTPQEALSLPDIS---------------NDMFVDFSIFAGSSM---QTTNGDKQGQQHNLKEKAPVEPHH-MVTRGKS--AKDPQLYSTIHKKGHLSLIA
         +P  A  LP  +               +D+   + +   SS     T   D     H++   AP  P+H M+TR KS   K    YS+       +L A
Subjt:  LTPQEALSLPDIS---------------NDMFVDFSIFAGSSM---QTTNGDKQGQQHNLKEKAPVEPHH-MVTRGKS--AKDPQLYSTIHKKGHLSLIA

Query:  TKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIV
             EPKS KSAL+   W +AM+EE+ AL +N TW LVP+ S+ N++G KW++KTKL+ DGS+ER KARLVA+G+ QVEG+D+ ETFSPVVKP TIRIV
Subjt:  TKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIV

Query:  LAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIY
        L +A+   W +RQLDVKNAFL+G+L E VFM QPP F HP  P HVC L ++LYGL+QAPRAWF+R S FL+  GFFCS +D S FI   +   + +L+Y
Subjt:  LAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIY

Query:  VDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATE-YRSIVG
        VDDII+TGN+   +  ++ +L  EFA+KDLG +HYFLGI+V     G+ L+Q KYA ++L K  M       TPM        + N L    + Y+S+VG
Subjt:  VDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATE-YRSIVG

Query:  SLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIA
         L YLT TRPDI+Y+VN VCQ +  P     + VKRILRY+ GT  +GI    NSSLKLYAF DADW GCP TRRSTTGYC +LG+NCISW SK+QPT+A
Subjt:  SLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIA

Query:  RSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKES
        RSS+EAEYRA+A T AE+TW++++L+DIG+++ QPP LFCDN+SAL M++NPVFHARTKHIE+DYHFVRE+VALG L+TR+VPS  Q+ADILTKPLS+  
Subjt:  RSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKES

Query:  FKKLRSKLGVRCTS
        F+ LR KLGV+  S
Subjt:  FKKLRSKLGVRCTS

TrEMBL top hitse value%identityAlignment
A0A2N9EWB3 Integrase catalytic domain-containing protein0.0e+0049.96Show/hide
Query:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL
        CQICG+  H+AL C  R+D+ ++  ++I +ALA  ++ + +D    Y D+GAT+H+ +  G L     Y G++ + VGNG  L I+HVG   L +   +L
Subjt:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL

Query:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH
        NL +VLVVP ++KNL+S+ KL  DN C     A +F IK+   G+ +  G K +GLYAL+    +AA  A  + KAP  IWH+R+GH +   L  L + +
Subjt:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH

Query:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK
        ++D+S W K E +C +CQ+ K  RLPF   N+I+  PL KIHCDLWGPAP+ S Q+FKYYV F+DD +RYTWLYPLK KSDFF +F  FQ++VENQF+RK
Subjt:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK

Query:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP
        I+IFQ DGGGEF   AF  HL   GI   +SCP TP+QNGVAERKHRH+VETGL+++F ++ P   W++AF+TAV+LINR+PS  L M TP+ KL G  P
Subjt:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP

Query:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP
        +Y+S++VFGC+CFPYLR+Y+ +KF  ++YPC+F+GYSP+HKGYRCL P + R+Y+SRHV+FDE   P+         T     +  +   E + G    P
Subjt:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP

Query:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------
                    Q    H F ES     T+P ++     +TP   +     QPN +S+     P   T     TPS    S  +    +S  QN      
Subjt:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------

Query:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME
             + P +   L        P  ++   +D S  A  S   T+    G  +      P+ P+      +S     + +      H   I+ K     E
Subjt:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME

Query:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT
        PKS KSAL+  HW+ AM +E+ AL +N TW LVP+ +  NI+GS+W+FKTKLK DGSIER+KARLVA+GY Q+EGLD+ ETFSPV+KPTTIR+VL++A+T
Subjt:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT

Query:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV
         GW LRQLDVKNAFLHG+LKE V+M QPPGF  P  P HVC L++++YGLKQAPRAWF+R S FL+Q GF+CS +D S F+F+++  T+++L+YVDDIIV
Subjt:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV

Query:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL
        T +   H+  LI KLS EFA+KDLGPL+YFLG++V H   G+ LSQ KYA++IL K  M +    GTP++           LVNAT YRSIVG+LQYLTL
Subjt:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL

Query:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE
        TRPD+T+AVN VCQ +  P     +AVKRILRYL GT ++GI    +SSL LY F DADW GCPDTRRSTTGYCI+LGANCISW SKKQ T++RSS+EAE
Subjt:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE

Query:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK
        YRAMA   AELTW++++L+D+G+     P LFCDN SAL M++NPVFHARTKHIE+DYHFVRE+VA G L TRYVPSQ Q+AD+ TK +SK+ F + RSK
Subjt:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK

Query:  LGV
        LGV
Subjt:  LGV

A0A2N9FTN5 Integrase catalytic domain-containing protein0.0e+0049.96Show/hide
Query:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL
        CQICG+  H+AL C  R+D+ ++  ++I +ALA  ++ + +D    Y D+GAT+H+ +  G L     Y G++ + VGNG  L I+HVG   L +   +L
Subjt:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL

Query:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH
        NL +VLVVP ++KNL+S+ KL  DN C     A +F IK+   G+ +  G K +GLYAL+    +AA  A  + KAP  IWH+R+GH +   L  L + +
Subjt:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH

Query:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK
        ++D+S W K E +C +CQ+ K  RLPF   N+I+  PL KIHCDLWGPAP+ S Q+FKYYV F+DD +RYTWLYPLK KSDFF +F  FQ++VENQF+RK
Subjt:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK

Query:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP
        I+IFQ DGGGEF   AF  HL   GI   +SCP TP+QNGVAERKHRH+VETGL+++F ++ P   W++AF+TAV+LINR+PS  L M TP+ KL G  P
Subjt:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP

Query:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP
        +Y+S++VFGC+CFPYLR+Y+ +KF  ++YPC+F+GYSP+HKGYRCL P + R+Y+SRHV+FDE   P+         T     +  +   E + G    P
Subjt:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP

Query:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------
                    Q    H F ES     T+P ++     +TP   +     QPN +S+     P   T     TPS    S  +    +S  QN      
Subjt:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------

Query:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME
             + P +   L        P  ++   +D S  A  S   T+    G  +      P+ P+      +S     + +      H   I+ K     E
Subjt:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME

Query:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT
        PKS KSAL+  HW+ AM +E+ AL +N TW LVP+ +  NI+GS+W+FKTKLK DGSIER+KARLVA+GY Q+EGLD+ ETFSPV+KPTTIR+VL++A+T
Subjt:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT

Query:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV
         GW LRQLDVKNAFLHG+LKE V+M QPPGF  P  P HVC L++++YGLKQAPRAWF+R S FL+Q GF+CS +D S F+F+++  T+++L+YVDDIIV
Subjt:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV

Query:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL
        T +   H+  LI KLS EFA+KDLGPL+YFLG++V H   G+ LSQ KYA++IL K  M +    GTP++           LVNAT YRSIVG+LQYLTL
Subjt:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL

Query:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE
        TRPD+T+AVN VCQ +  P     +AVKRILRYL GT ++GI    +SSL LY F DADW GCPDTRRSTTGYCI+LGANCISW SKKQ T++RSS+EAE
Subjt:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE

Query:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK
        YRAMA   AELTW++++L+D+G+     P LFCDN SAL M++NPVFHARTKHIE+DYHFVRE+VA G L TRYVPSQ Q+AD+ TK +SK+ F + RSK
Subjt:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK

Query:  LGV
        LGV
Subjt:  LGV

A0A2N9IWS3 Integrase catalytic domain-containing protein0.0e+0049.96Show/hide
Query:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL
        CQICG+  H+AL C  R+D+ ++  ++I +ALA  ++ + +D    Y D+GAT+H+ +  G L     Y G++ + VGNG  L I+HVG   L +   +L
Subjt:  CQICGKPNHTALHCRSRFDHTFQL-DDIPKALAALSIHEPNDG-NLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNL

Query:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH
        NL +VLVVP ++KNL+S+ KL  DN C     A +F IK+   G+ +  G K +GLYAL+    +AA  A  + KAP  IWH+R+GH +   L  L + +
Subjt:  NLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKN-AQGQTLGKGYKRKGLYALEEHQQLAAYTA--STKAPFSIWHKRMGHLNDSALKHLCNLH

Query:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK
        ++D+S W K E +C +CQ+ K  RLPF   N+I+  PL KIHCDLWGPAP+ S Q+FKYYV F+DD +RYTWLYPLK KSDFF +F  FQ++VENQF+RK
Subjt:  LLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRK

Query:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP
        I+IFQ DGGGEF   AF  HL   GI   +SCP TP+QNGVAERKHRH+VETGL+++F ++ P   W++AF+TAV+LINR+PS  L M TP+ KL G  P
Subjt:  IKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEP

Query:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP
        +Y+S++VFGC+CFPYLR+Y+ +KF  ++YPC+F+GYSP+HKGYRCL P + R+Y+SRHV+FDE   P+         T     +  +   E + G    P
Subjt:  NYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKP

Query:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------
                    Q    H F ES     T+P ++     +TP   +     QPN +S+     P   T     TPS    S  +    +S  QN      
Subjt:  ENHNNSDLEVAKQCRNAHWFEESEHE-ATIPVANNQLQEETPSQPTQDNEGQPNLASN---QEPTQITCIEDHTPSQLDNSRCVDQLNISGHQN------

Query:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME
             + P +   L        P  ++   +D S  A  S   T+    G  +      P+ P+      +S     + +      H   I+ K     E
Subjt:  ----QNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATK--QEME

Query:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT
        PKS KSAL+  HW+ AM +E+ AL +N TW LVP+ +  NI+GS+W+FKTKLK DGSIER+KARLVA+GY Q+EGLD+ ETFSPV+KPTTIR+VL++A+T
Subjt:  PKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVT

Query:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV
         GW LRQLDVKNAFLHG+LKE V+M QPPGF  P  P HVC L++++YGLKQAPRAWF+R S FL+Q GF+CS +D S F+F+++  T+++L+YVDDIIV
Subjt:  CGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIV

Query:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL
        T +   H+  LI KLS EFA+KDLGPL+YFLG++V H   G+ LSQ KYA++IL K  M +    GTP++           LVNAT YRSIVG+LQYLTL
Subjt:  TGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTL

Query:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE
        TRPD+T+AVN VCQ +  P     +AVKRILRYL GT ++GI    +SSL LY F DADW GCPDTRRSTTGYCI+LGANCISW SKKQ T++RSS+EAE
Subjt:  TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAE

Query:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK
        YRAMA   AELTW++++L+D+G+     P LFCDN SAL M++NPVFHARTKHIE+DYHFVRE+VA G L TRYVPSQ Q+AD+ TK +SK+ F + RSK
Subjt:  YRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSK

Query:  LGV
        LGV
Subjt:  LGV

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0052.9Show/hide
Query:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN
        ++CQIC K NH+AL C +RFD+ +QL++IP+ALAA+++H E  D N Y DSGAT HI NDPGK++    Y+G++ ++VGNGEAL I+H+G  +LKT   +
Subjt:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN

Query:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL
        L L+ +LVVP +KKNLLSI +L  DN CSI F++  FVIK+   Q L +G K+ GLYALEE+  Q    T S+KA   +WH+RMGH    ++K L +   
Subjt:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL

Query:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI
        +++SSW K   +CV+CQL K  +LPF LRN+ISS PL+KIHCDLWGPAP  S Q +KYY  FIDD +RYTWLYPL++KSDFF+ F KFQ LVENQ +R+I
Subjt:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI

Query:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN
        KIFQSDGGGEFQS+ F++HL + GI  Q+SCP TP+QNGVAERKHRH+VE GL+++F +K P   WVDAFLTAV+LINR+PS  L M++P+  LF + P 
Subjt:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN

Query:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE
        Y S+R+FGC+CFPYLR+Y  +KFS +TYPCVF+GYS +HKGYRCL P + R+YISRHVI                    S++++ FL             
Subjt:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE

Query:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT
                +    +N H              N  +  ETP+                         DH       +  + +L++          +I T  
Subjt:  NHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLT

Query:  PQEAL--SLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWK
        P  +    +PD S  + VD S              QGQ  + K        HM+TR K   DP L S +     ++  AT+ ++ EPK++++ LKIPHW 
Subjt:  PQEAL--SLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWK

Query:  AAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAF
         AM+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++Q+ GLD+ ETFSPV+K TTIR++ ++AVT GWK+RQLDVKNAF
Subjt:  AAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAF

Query:  LHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQK
        LHG+LKEEVFM QPPGF +  L  HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  
Subjt:  LHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQK

Query:  LSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQ
        LS EF+LKDLG LHYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+  +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ
Subjt:  LSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQ

Query:  QLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWI
          Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  FCDADW GC +TRRST+GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW+
Subjt:  QLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWI

Query:  SFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        +F+L+DIGI + +PPQL CDN+SAL M +N VFHAR+KHIE+DYHFVRE+VA G+LITR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  SFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

A0A438J6E1 Retrovirus-related Pol polyprotein from transposon RE10.0e+0049.51Show/hide
Query:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN
        ++CQIC K NH+AL C +RFD+++Q ++IP+ALAA+++H E  D N Y DSGATTHI ND                                        
Subjt:  MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIH-EPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSN

Query:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL
                                     + F++  FVIK+   Q L +G K+ GLYALEE+  Q    T S+KA   +WH+RMGH    ++K L +   
Subjt:  LNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQ-QLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHL

Query:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI
        +++SSW K   +CV+CQL K  +LPF LRN+ISS PL+KIHCDLWGPAP  S Q +KYY  FIDD +RYTWLYPL++KSDFF+ F KFQ LVENQ +R+I
Subjt:  LDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKI

Query:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN
        KIFQSDGGGEFQS+ F++HL + GI  Q+SCP TP+QNGVAERKHRH+VE GL+++F +K P   WVDAFLTAV+LINR+PS  L M++P+  LF + P 
Subjt:  KIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPN

Query:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE
        Y S+R+FGC+CFPYLR+Y  +KFS +TYPCVF+GYS +HKGYRCL P + R+YISRHVIF+EN FP+D            +E++ F   +     E    
Subjt:  YSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPE

Query:  NHNNSDLE--VAKQCRNAHWFEESEHEATIPVANNQLQEE-------TPSQPTQ----DNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGH
        +H     +   A Q  N+   ++  H        NQ   E         +Q T+    ++ G P + + +  T      DH       +     L  S H
Subjt:  NHNNSDLE--VAKQCRNAHWFEESEHEATIPVANNQLQEE-------TPSQPTQ----DNEGQPNLASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGH

Query:  QNQNNPQIIQTLT---------PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRG
         N +      TL+         P +  + PDIS  + VD S F     +  + D   Q  +    A                 P   H      HM+TR 
Subjt:  QNQNNPQIIQTLT---------PQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKA-----------------PVEPH------HMVTRG

Query:  KSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYT
        K   DP L S +     ++  A + ++ EPK++++ LKIPHW   M+EE+KAL++N TWDLVP+P  TNI+GSKW+FKTKLKEDG+I+RYKARLVA+G++
Subjt:  KSAKDPQLYSTIHKKGHLSLIATKQEM-EPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYT

Query:  QVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFF
        Q+ GLD+ ETFSPV+K TTIR++ ++AVT GWK+RQLDVKNAFLHG+LKEEVFM QPPGF +  LP HVCKLNRSLYGLKQAPRAWF+RLSQ L+  GF 
Subjt:  QVEGLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFF

Query:  CSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSN
        C  +D S FI +  +  +++LIYVDDIIVTGN    + +LI  LS EF+LKDLG LHYFLG+EV +  +G+ +SQ KY RD+L  TKM+E +   TPM+ 
Subjt:  CSHSDPSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSN

Query:  YTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTT
         +   + D + ++ T+YR +VGSLQYLT TRPDI +AVNK CQ  Q P   DL+AVKRILRYL GT  HGI F+K SSL+L  FCDADW GC +TRRST+
Subjt:  YTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTT

Query:  GYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLI
        GYCIFLGANCISW SK+QPT++RSS+EAEYR++A + AE+TW++F+L+DIGI + +PPQL CDN+SAL M++NPVFHAR+KHIE+DYHFVRE+ A G+LI
Subjt:  GYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLI

Query:  TRYVPSQDQLADILTKPLSKESFKKLRSKLGV
        TR++PS  Q+ADI TK L K SF+  R KLGV
Subjt:  TRYVPSQDQLADILTKPLSKESFKKLRSKLGV

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.5e-14629.94Show/hide
Query:  CQICGKPNHTALHC--------RSRFDHTFQLDDIPKALAALSIHEPNDGNL------YADSGATTHIINDPGKLNLKSIYRGNEKL-------YVGNGE
        C  CG+  H    C            ++  Q+        A  + E N+ ++        DSGA+ H+IND      +S+Y  + ++           GE
Subjt:  CQICGKPNHTALHC--------RSRFDHTFQLDDIPKALAALSIHEPNDGNL------YADSGATTHIINDPGKLNLKSIYRGNEKL-------YVGNGE

Query:  ALDITHVGSGKLKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKG----YKRKGLYALEEHQQLAAYT--ASTKAPF
         +  T  G  +L+     + L++VL       NL+S+ +L    +  +    DK       G T+ K      K  G+          AY+  A  K  F
Subjt:  ALDITHVGSGKLKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKG----YKRKGLYALEEHQQLAAYT--ASTKAPF

Query:  SIWHKRMGHLNDSALKHLCNLHLL-DISSWKKDEL---ICVNCQLSKRSRLPFQLRNEISSV--PLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTW
         +WH+R GH++D  L  +   ++  D S     EL   IC  C   K++RLPF+   + + +  PL  +H D+ GP   ++     Y+V F+D F+ Y  
Subjt:  SIWHKRMGHLNDSALKHLCNLHLL-DISSWKKDEL---ICVNCQLSKRSRLPFQLRNEISSV--PLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTW

Query:  LYPLKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFL
         Y +K KSD F  FQ F    E  F+ K+     D G E+ S   +    + GI + L+ PHTPQ NGV+ER  R + E   +++  +K    +W +A L
Subjt:  LYPLKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFL

Query:  TAVFLINRMPSKTL--NMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDK
        TA +LINR+PS+ L  + +TPY     K+P    +RVFG   + +++N    KF  +++  +FVGY P   G++  D ++ +  ++R V+ DE       
Subjt:  TAVFLINRMPSKTL--NMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDK

Query:  QTNNIQETPKSLEVIEFLGEEWMQGAEEKPENHNNSDLEV--------AKQCRNAHWFEESEHEATIPVANNQ---LQEETPSQPTQDNEGQPNLASNQE
         TN +       E + FL +      E + +N  N   ++        +K+C N  + ++S+        N+    +Q E P++ +++ +    L  ++E
Subjt:  QTNNIQETPKSLEVIEFLGEEWMQGAEEKPENHNNSDLEV--------AKQCRNAHWFEESEHEATIPVANNQ---LQEETPSQPTQDNEGQPNLASNQE

Query:  PTQITCIEDHTPSQLDNSRCVDQLNIS-GHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKS
          +      +  ++    +  D LN S G  N N  +  +T    + + + + + +  ++        ++T       ++ N   K  +  H +     +
Subjt:  PTQITCIEDHTPSQLDNSRCVDQLNIS-GHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKS

Query:  AKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVE
        + D                    E++ +  KS+     W+ A+  E+ A   N+TW +  +P + NI+ S+W+F  K  E G+  RYKARLVA+G+TQ  
Subjt:  AKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVE

Query:  GLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSH
         +DYEETF+PV + ++ R +L++ +    K+ Q+DVK AFL+G LKEE++M  P G    S   +VCKLN+++YGLKQA R WFE   Q L +  F  S 
Subjt:  GLDYEETFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSH

Query:  SDPSFFIFKTNEI--TMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPM-SN
         D   +I     I   + +L+YVDD+++       M    + L  +F + DL  + +F+GI +    D I LSQ  Y + IL K  M   +   TP+ S 
Subjt:  SDPSFFIFKTNEI--TMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPM-SN

Query:  YTSNCANDNELVNATEYRSIVGSLQYLTL-TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSL--KLYAFCDADWGGCPDTRR
              N +E  N T  RS++G L Y+ L TRPD+T AVN + +       +  + +KR+LRYL GT +  + F KN +   K+  + D+DW G    R+
Subjt:  YTSNCANDNELVNATEYRSIVGSLQYLTL-TRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSL--KLYAFCDADWGGCPDTRR

Query:  STTGYCI-FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVAL
        STTGY       N I W +K+Q ++A SS+EAEY A+     E  W+ F+L  I I +  P +++ DN   + ++ NP  H R KHI+I YHF RE+V  
Subjt:  STTGYCI-FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVAL

Query:  GLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV
         ++   Y+P+++QLADI TKPL    F +LR KLG+
Subjt:  GLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-15132.65Show/hide
Query:  VGNGEALDITHVGSGKLKTGLS-NLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALE-EHQQLAAYTASTKAP
        +GN     I  +G   +KT +   L L++V  VP ++ NL  IS +A D D    + A++          + KG  R  LY    E  Q     A  +  
Subjt:  VGNGEALDITHVGSGKLKTGLS-NLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALE-EHQQLAAYTASTKAP

Query:  FSIWHKRMGHLNDSALKHLCNLHLLDISSWKKDELI--CVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYP
          +WHKRMGH+++  L+ L    L+   S+ K   +  C  C   K+ R+ FQ  +E     L+ ++ D+ GP  I S    KY+V FIDD SR  W+Y 
Subjt:  FSIWHKRMGHLNDSALKHLCNLHLLDISSWKKDELI--CVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYP

Query:  LKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAV
        LK K   FQ FQKF  LVE +  RK+K  +SD GGE+ S  F+ +   HGI H+ + P TPQ NGVAER +R +VE   S++  +K P  +W +A  TA 
Subjt:  LKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAV

Query:  FLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNI
        +LINR PS  L  + P      KE +YS ++VFGC+ F ++      K   ++ PC+F+GY     GYR  DP+  ++  SR V+F E+     +   ++
Subjt:  FLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNI

Query:  QETPKSLEVIEFLGEEWMQGAEEKPENHNNSDLEVAKQCRNAHWFEESEHEATIP-VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQL
         E  K+  +  F+                                       TIP  +NN    E+ +    +   QP     Q              QL
Subjt:  QETPKSLEVIEFLGEEWMQGAEEKPENHNNSDLEVAKQCRNAHWFEESEHEATIP-VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQL

Query:  DNSRCVDQLNISGHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHL
        D           G +   +P                                   T G++Q Q     E+  VE         S + P            
Subjt:  DNSRCVDQLNISGHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQLYSTIHKKGHL

Query:  SLIATKQEMEPKSFKSALKIP---HWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVK
          +    + EP+S K  L  P       AM+EEM++L +N T+ LV  P     +  KW+FK K   D  + RYKARLV +G+ Q +G+D++E FSPVVK
Subjt:  SLIATKQEMEPKSFKSALKIP---HWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVK

Query:  PTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEI
         T+IR +L++A +   ++ QLDVK AFLHG L+EE++M QP GF+       VCKLN+SLYGLKQAPR W+ +   F+    +  ++SDP  +  + +E 
Subjt:  PTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEI

Query:  T-MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEV--HHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTS-------NCA
          +I+L+YVDD+++ G  +  + +L   LS  F +KDLGP    LG+++    TS  + LSQ KY   +L +  M  A    TP++ +            
Subjt:  T-MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEV--HHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTS-------NCA

Query:  NDNELVNATEYRSIVGSLQY-LTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIF
         +   +    Y S VGSL Y +  TRPDI +AV  V + L+ P  +  +AVK ILRYL GTT   + F  +  + L  + DAD  G  D R+S+TGY   
Subjt:  NDNELVNATEYRSIVGSLQY-LTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIF

Query:  LGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVP
             ISW+SK Q  +A S++EAEY A   T  E+ W+   L+++G+H  +   ++CD+ SA+ +S N ++HARTKHI++ YH++RE V    L    + 
Subjt:  LGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVP

Query:  SQDQLADILTKPLSKESFKKLRSKLGV
        + +  AD+LTK + +  F+  +  +G+
Subjt:  SQDQLADILTKPLSKESFKKLRSKLGV

P92519 Uncharacterized mitochondrial protein AtMg008103.3e-6652.42Show/hide
Query:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEY
        M +L+YVDDI++TG+S   +  LI +LS  F++KDLGP+HYFLGI++     G+ LSQ KYA  IL    M++     TP+    ++  +  +  + +++
Subjt:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEY

Query:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKK
        RSIVG+LQYLTLTRPDI+YAVN VCQ++  P + D   +KR+LRY+ GT  HG+  +KNS L + AFCD+DW GC  TRRSTTG+C FLG N ISW +K+
Subjt:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKK

Query:  QPTIARSSSEAEYRAMAITTAELTWIS
        QPT++RSS+E EYRA+A+T AELTW S
Subjt:  QPTIARSSSEAEYRAMAITTAELTWIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.4e-25241.58Show/hide
Query:  CQICGKPNHTALHCRSRFDH-----TFQLDDIP----KALAALSIHEP-NDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGK
        CQICG   H+A  C S+  H       Q    P    +  A L++  P +  N   DSGAT HI +D   L+L   Y G + + V +G  + I+H GS  
Subjt:  CQICGKPNHTALHCRSRFDH-----TFQLDDIP----KALAALSIHEP-NDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGK

Query:  LKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLY--ALEEHQQLAAYTA-STKAPFSIWHKRMGHLNDS
        L T    LNL N+L VP++ KNL+S+ +L   N  S+ F    F +K+   G  L +G  +  LY   +   Q ++ + + S+KA  S WH R+GH   S
Subjt:  LKTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLY--ALEEHQQLAAYTA-STKAPFSIWHKRMGHLNDS

Query:  ALKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQK
         L  + + + L + +     L C +C ++K +++PF      S+ PL  I+ D+W  +PILS  +++YYV F+D F+RYTWLYPLK+KS   ++F  F+ 
Subjt:  ALKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQK

Query:  LVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTP
        L+EN+F  +I  F SD GGEF  +A   +  QHGI H  S PHTP+ NG++ERKHRH+VETGL+L+  +  P  YW  AF  AV+LINR+P+  L +++P
Subjt:  LVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTP

Query:  YAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDK---QTNNIQETPKSLEVIEFL
        + KLFG  PNY  +RVFGC C+P+LR Y+  K   ++  CVF+GYS     Y CL   ++R+YISRHV FDEN FPF       + +QE  +    +   
Subjt:  YAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDK---QTNNIQETPKSLEVIEFL

Query:  GEEWMQGAEEKPENHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPT-------QITCIEDHTPSQLDNSRCVD
           W             + +  A  C + H         + P  N+Q+     S    D+    +  S+ EPT       Q T     T +Q  +S+   
Subjt:  GEEWMQGAEEKPENHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPNLASNQEPT-------QITCIEDHTPSQLDNSRCVD

Query:  QLNISGHQNQNNPQIIQTL-TP-QEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLI
        Q N +   N++  Q+ Q+L TP Q + S P  +       +     S+         Q  N   +AP+  H M TR K+   K    YS          +
Subjt:  QLNISGHQNQNNPQIIQTL-TP-QEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTIHKKGHLSLI

Query:  ATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIR
        +   E EP++   ALK   W+ AM  E+ A + NHTWDLV P PSH  I+G +WIF  K   DGS+ RYKARLVA+GY Q  GLDY ETFSPV+K T+IR
Subjt:  ATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPTTIR

Query:  IVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIML
        IVL VAV   W +RQLDV NAFL G L ++V+M+QPPGF     P +VCKL ++LYGLKQAPRAW+  L  +L+  GF  S SD S F+ +  +  + ML
Subjt:  IVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIML

Query:  IYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMS-NYTSNCANDNELVNATEYRSI
        +YVDDI++TGN    +   +  LS  F++KD   LHYFLGIE      G+ LSQ +Y  D+L +T MI A    TPM+ +   +  +  +L + TEYR I
Subjt:  IYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMS-NYTSNCANDNELVNATEYRSI

Query:  VGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPT
        VGSLQYL  TRPDI+YAVN++ Q +  P  + L+A+KRILRYL GT NHGI   K ++L L+A+ DADW G  D   ST GY ++LG + ISW SKKQ  
Subjt:  VGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPT

Query:  IARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSK
        + RSS+EAEYR++A T++E+ WI  +L ++GI + +PP ++CDN+ A  +  NPVFH+R KHI IDYHF+R +V  G L   +V + DQLAD LTKPLS+
Subjt:  IARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSK

Query:  ESFKKLRSKLGV
         +F+   SK+GV
Subjt:  ESFKKLRSKLGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-24240.13Show/hide
Query:  CQICGKPNHTALHCRS--RFDHTF--QLDDIP----KALAALSIHEP-NDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKL
        CQIC    H+A  C    +F  T   Q    P    +  A L+++ P N  N   DSGAT HI +D   L+    Y G + + + +G  + ITH GS  L
Subjt:  CQICGKPNHTALHCRS--RFDHTF--QLDDIP----KALAALSIHEP-NDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKL

Query:  KTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLY--ALEEHQQLAAYTA-STKAPFSIWHKRMGHLNDSA
         T   +L+L  VL VP++ KNL+S+ +L   N  S+ F    F +K+   G  L +G  +  LY   +   Q ++ + +  +KA  S WH R+GH + + 
Subjt:  KTGLSNLNLQNVLVVPSMKKNLLSISKLAYDNDCSIIFTADKFVIKNAQ-GQTLGKGYKRKGLY--ALEEHQQLAAYTA-STKAPFSIWHKRMGHLNDSA

Query:  LKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKL
        L  + + H L + +     L C +C ++K  ++PF      SS PL  I+ D+W  +PILS  +++YYV F+D F+RYTWLYPLK+KS    +F  F+ L
Subjt:  LKHLCNLHLLDISSWKKDELICVNCQLSKRSRLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKL

Query:  VENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPY
        VEN+F  +I    SD GGEF  +  + +L QHGI H  S PHTP+ NG++ERKHRH+VE GL+L+  +  P  YW  AF  AV+LINR+P+  L +Q+P+
Subjt:  VENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCPHTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPY

Query:  AKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEW
         KLFG+ PNY  ++VFGC C+P+LR Y+  K   ++  C F+GYS     Y CL   + R+Y SRHV FDE  FPF      +  + +            
Subjt:  AKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGYRCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEW

Query:  MQGAEEKPENHNNSDLEV------AKQCRNAHWFEESEHEATIP-------VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRC
         Q ++  P   +++ L        A  C   H  + S    + P       V+++ L   + S P   +  +P   S+  P Q T     T +   NS  
Subjt:  MQGAEEKPENHNNSDLEV------AKQCRNAHWFEESEHEATIP-------VANNQLQEETPSQPTQDNEGQPNLASNQEPTQITCIEDHTPSQLDNSRC

Query:  VDQLNIS----GHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGD-----KQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTI
        ++  N +       NQN+P       PQ  +S P I           + SS  T+               +  +APV  H M TR K    K  Q YS  
Subjt:  VDQLNIS----GHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGD-----KQGQQHNLKEKAPVEPHHMVTRGKSA--KDPQLYSTI

Query:  HKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFS
              SL A     EP++   A+K   W+ AM  E+ A + NHTWDLV P P    I+G +WIF  K   DGS+ RYKARLVA+GY Q  GLDY ETFS
Subjt:  HKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLV-PKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFS

Query:  PVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFK
        PV+K T+IRIVL VAV   W +RQLDV NAFL G L +EV+M+QPPGF     P +VC+L +++YGLKQAPRAW+  L  +L+  GF  S SD S F+ +
Subjt:  PVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFK

Query:  TNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTS-NCANDNEL
             + ML+YVDDI++TGN    ++  +  LS  F++K+   LHYFLGIE      G+ LSQ +Y  D+L +T M+ A    TPM+        +  +L
Subjt:  TNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTS-NCANDNEL

Query:  VNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCI
         + TEYR IVGSLQYL  TRPD++YAVN++ Q +  P      A+KR+LRYL GT +HGI   K ++L L+A+ DADW G  D   ST GY ++LG + I
Subjt:  VNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCI

Query:  SWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLA
        SW SKKQ  + RSS+EAEYR++A T++EL WI  +L ++GI +  PP ++CDN+ A  +  NPVFH+R KHI +DYHF+R +V  G L   +V + DQLA
Subjt:  SWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLA

Query:  DILTKPLSKESFKKLRSKLGV
        D LTKPLS+ +F+    K+GV
Subjt:  DILTKPLSKESFKKLRSKLGV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-12143.76Show/hide
Query:  YSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEE
        Y  +    H  L+   +  EP ++  A +   W  AM++E+ A+   HTW++   P +   IG KW++K K   DG+IERYKARLVA+GYTQ EG+D+ E
Subjt:  YSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEE

Query:  TFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGF---QHPSLPQH-VCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSD
        TFSPV K T+++++LA++    + L QLD+ NAFL+G L EE++M  PPG+   Q  SLP + VC L +S+YGLKQA R WF + S  L+  GF  SHSD
Subjt:  TFSPVVKPTTIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGF---QHPSLPQH-VCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSD

Query:  PSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMS-NYTSN
         ++F+  T  + + +L+YVDDII+  N++  ++EL  +L   F L+DLGPL YFLG+E+  ++ GI + Q KYA D+L +T ++       PM  + T +
Subjt:  PSFFIFKTNEITMIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMS-NYTSN

Query:  CANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCI
          +  + V+A  YR ++G L YL +TR DI++AVNK+ Q  + PR+   +AV +IL Y+ GT   G+ +   + ++L  F DA +  C DTRRST GYC+
Subjt:  CANDNELVNATEYRSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCI

Query:  FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRER
        FLG + ISWKSKKQ  +++SS+EAEYRA++  T E+ W++   +++ + + +P  LFCDN +A+ ++ N VFH RTKHIE D H VRER
Subjt:  FLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLKDIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRER

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.8e-1847.73Show/hide
Query:  YLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCI-----FLGA
        YLT+TRPD+T+AVN++ Q     R   ++AV ++L Y+ GT   G+ +   S L+L AF D+DW  CPDTRRS TG+C      FLGA
Subjt:  YLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCI-----FLGA

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.6e-0433.82Show/hide
Query:  HRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCF
        +R ++E   S++ +   P  +  DA  TAV +IN+ PS  +N   P    F   P YS +R FGC  +
Subjt:  HRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCF

ATMG00810.1 DNA/RNA polymerases superfamily protein2.3e-6752.42Show/hide
Query:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEY
        M +L+YVDDI++TG+S   +  LI +LS  F++KDLGP+HYFLGI++     G+ LSQ KYA  IL    M++     TP+    ++  +  +  + +++
Subjt:  MIMLIYVDDIIVTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEY

Query:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKK
        RSIVG+LQYLTLTRPDI+YAVN VCQ++  P + D   +KR+LRY+ GT  HG+  +KNS L + AFCD+DW GC  TRRSTTG+C FLG N ISW +K+
Subjt:  RSIVGSLQYLTLTRPDITYAVNKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKK

Query:  QPTIARSSSEAEYRAMAITTAELTWIS
        QPT++RSS+E EYRA+A+T AELTW S
Subjt:  QPTIARSSSEAEYRAMAITTAELTWIS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.4e-2649.62Show/hide
Query:  MVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVA
        M+TR K+       + ++ K  L+ I T  + EPKS   ALK P W  AM+EE+ AL  N TW LVP P + NI+G KW+FKTKL  DG+++R KARLVA
Subjt:  MVTRGKSAKDPQLYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVA

Query:  QGYTQVEGLDYEETFSPVVKPTTIRIVLAVA
        +G+ Q EG+ + ET+SPVV+  TIR +L VA
Subjt:  QGYTQVEGLDYEETFSPVVKPTTIRIVLAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTGTCAAATTTGTGGAAAGCCAAATCATACTGCCCTACATTGTAGAAGTCGTTTTGACCATACATTCCAGTTAGATGACATACCGAAGGCACTTGCAGCATTAAG
TATACATGAGCCGAATGATGGGAATCTTTATGCAGATTCGGGAGCCACCACTCATATAATAAATGATCCAGGTAAGTTAAACCTCAAAAGCATCTATAGAGGTAATGAAA
AATTATATGTAGGAAATGGAGAGGCCCTAGATATTACTCATGTCGGAAGTGGCAAATTAAAAACTGGCTTAAGCAATTTAAATTTACAAAATGTTTTGGTTGTTCCTAGC
ATGAAAAAGAATTTACTTTCCATTAGTAAACTGGCTTATGATAATGATTGCTCTATTATCTTTACTGCTGACAAGTTTGTTATTAAGAATGCTCAGGGACAGACATTGGG
AAAGGGGTATAAGCGTAAAGGACTATATGCCTTAGAAGAACATCAACAACTTGCTGCCTACACTGCCTCAACAAAGGCCCCCTTCTCAATTTGGCACAAAAGAATGGGAC
ACCTAAATGATAGTGCTTTAAAACACCTATGCAATTTACATCTTCTTGATATATCCAGCTGGAAAAAGGATGAATTAATATGTGTAAACTGTCAACTAAGCAAGCGCTCC
CGTTTACCTTTTCAATTAAGAAATGAAATAAGTAGTGTTCCTTTGAATAAAATCCATTGTGACCTCTGGGGTCCTGCACCTATCCTATCTTGTCAACATTTTAAGTATTA
TGTATGCTTTATTGATGATTTCTCTCGTTATACTTGGTTATATCCATTGAAAAAGAAATCTGACTTCTTTCAAAGCTTTCAAAAGTTTCAAAAATTAGTTGAAAATCAAT
TTGACAGGAAGATAAAGATATTTCAAAGTGATGGAGGAGGTGAGTTCCAATCGTTGGCATTTAAACATCATTTGGAACAACACGGCATCCATCACCAACTATCATGCCCC
CACACTCCTCAACAGAACGGAGTGGCTGAGAGAAAGCATAGGCATGTGGTCGAAACAGGTTTATCCTTAATCTTTCAATCTAAGACTCCTTTTAAGTATTGGGTAGATGC
ATTCTTAACTGCTGTGTTTCTTATAAATAGAATGCCATCCAAAACTTTAAATATGCAAACTCCATATGCTAAACTCTTTGGAAAAGAACCAAATTATAGTAGTATAAGAG
TTTTTGGGTGCAAATGTTTTCCATATTTACGAAATTACTCAAATGATAAGTTCTCTAAAAGGACCTATCCATGTGTGTTTGTTGGATATAGCCCCATTCATAAAGGTTAT
AGATGTCTAGACCCCATTTCAAACCGAATATATATATCTAGACATGTAATTTTTGATGAAAACACCTTTCCCTTTGATAAACAAACTAACAACATACAAGAGACTCCAAA
ATCTCTTGAAGTTATTGAGTTTTTAGGTGAAGAATGGATGCAAGGAGCAGAAGAAAAACCAGAAAATCACAATAATAGTGATCTGGAAGTTGCAAAACAGTGTAGAAATG
CACACTGGTTTGAAGAAAGTGAACATGAAGCAACCATCCCAGTGGCGAATAATCAATTGCAGGAAGAAACACCATCTCAACCTACTCAAGACAATGAAGGACAGCCAAAT
TTAGCATCTAACCAAGAGCCTACACAAATTACATGCATTGAAGACCACACTCCAAGCCAACTAGACAATAGTAGATGTGTGGACCAGCTTAACATCAGTGGGCATCAAAA
CCAGAATAATCCCCAAATCATCCAAACTTTAACCCCACAGGAAGCTCTGTCCTTACCTGATATAAGCAATGACATGTTTGTTGATTTTTCCATTTTTGCAGGAAGCTCAA
TGCAGACCACAAATGGGGATAAACAAGGACAACAACATAACCTTAAGGAAAAAGCTCCTGTTGAGCCTCATCACATGGTCACTAGGGGAAAATCGGCTAAGGATCCTCAA
CTATACTCCACGATACATAAAAAAGGACATTTATCCTTGATTGCCACTAAACAAGAAATGGAGCCAAAGTCTTTCAAATCCGCCCTCAAGATTCCCCATTGGAAGGCGGC
TATGGAAGAGGAAATGAAGGCACTTTTGGAAAACCACACATGGGACCTTGTACCCAAACCATCTCACACAAACATCATTGGGTCAAAATGGATATTTAAGACCAAACTAA
AAGAAGATGGATCCATCGAACGGTACAAGGCTCGCCTAGTTGCTCAGGGATACACTCAAGTTGAGGGACTTGACTATGAAGAGACATTTAGTCCTGTAGTGAAACCAACC
ACTATACGAATTGTCCTAGCTGTGGCAGTAACGTGCGGATGGAAATTAAGGCAACTAGATGTTAAAAATGCCTTCCTCCATGGTTACCTTAAGGAAGAAGTCTTCATGGC
TCAACCACCTGGTTTCCAACATCCAAGCCTTCCTCAACATGTGTGCAAATTAAACCGCTCTCTTTATGGTCTTAAACAAGCCCCTAGAGCTTGGTTCGAAAGACTATCAC
AGTTCCTCGTGCAAACAGGATTCTTCTGCTCTCACTCTGATCCCTCATTTTTCATTTTTAAAACAAATGAGATCACTATGATCATGCTAATCTATGTAGATGATATTATT
GTTACAGGTAACAGTGAAAAGCATATGGAGGAACTCATTCAAAAACTCAGCATAGAGTTTGCCCTCAAAGACCTTGGACCCCTCCACTACTTCCTAGGCATTGAAGTCCA
CCACACATCAGATGGCATCATGCTATCACAAGGAAAATATGCTAGAGACATCCTCATCAAAACAAAAATGATAGAAGCCTCTCAATATGGAACTCCAATGAGCAATTACA
CCTCAAATTGTGCAAATGACAATGAACTTGTCAATGCAACTGAATACAGGAGCATTGTGGGATCACTCCAATATCTGACCCTTACTCGACCTGATATCACTTATGCAGTC
AACAAAGTGTGTCAACAACTCCAAACACCAAGGATAAAGGATCTAAAGGCGGTCAAAAGGATACTACGATATCTAAATGGGACAACAAATCACGGTATATCTTTTTATAA
AAATAGCTCACTTAAACTCTATGCTTTCTGTGATGCAGATTGGGGAGGCTGTCCAGACACTCGAAGAAGTACCACAGGATATTGCATATTCCTTGGAGCAAATTGCATAT
CATGGAAGTCAAAGAAACAACCAACAATTGCAAGGTCAAGTTCAGAAGCAGAATACCGAGCCATGGCAATCACCACAGCTGAGCTCACTTGGATAAGCTTTGTCCTCAAA
GATATTGGCATACATATGATGCAACCTCCACAACTGTTTTGCGACAACATGAGTGCACTACGAATGTCCATCAATCCCGTATTCCATGCTAGGACAAAACACATTGAGAT
CGATTACCATTTTGTAAGAGAAAGAGTAGCTCTTGGTCTCCTCATCACTCGGTATGTTCCCTCTCAAGACCAACTAGCTGACATTCTCACCAAGCCTCTCTCAAAAGAAT
CATTTAAGAAGCTCAGAAGCAAACTTGGCGTCCGCTGCACTTCCTCAAGTTTGAGGAAGAATGAAAGAAAACAAGTCAATCCAACAAGTTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTGTCAAATTTGTGGAAAGCCAAATCATACTGCCCTACATTGTAGAAGTCGTTTTGACCATACATTCCAGTTAGATGACATACCGAAGGCACTTGCAGCATTAAG
TATACATGAGCCGAATGATGGGAATCTTTATGCAGATTCGGGAGCCACCACTCATATAATAAATGATCCAGGTAAGTTAAACCTCAAAAGCATCTATAGAGGTAATGAAA
AATTATATGTAGGAAATGGAGAGGCCCTAGATATTACTCATGTCGGAAGTGGCAAATTAAAAACTGGCTTAAGCAATTTAAATTTACAAAATGTTTTGGTTGTTCCTAGC
ATGAAAAAGAATTTACTTTCCATTAGTAAACTGGCTTATGATAATGATTGCTCTATTATCTTTACTGCTGACAAGTTTGTTATTAAGAATGCTCAGGGACAGACATTGGG
AAAGGGGTATAAGCGTAAAGGACTATATGCCTTAGAAGAACATCAACAACTTGCTGCCTACACTGCCTCAACAAAGGCCCCCTTCTCAATTTGGCACAAAAGAATGGGAC
ACCTAAATGATAGTGCTTTAAAACACCTATGCAATTTACATCTTCTTGATATATCCAGCTGGAAAAAGGATGAATTAATATGTGTAAACTGTCAACTAAGCAAGCGCTCC
CGTTTACCTTTTCAATTAAGAAATGAAATAAGTAGTGTTCCTTTGAATAAAATCCATTGTGACCTCTGGGGTCCTGCACCTATCCTATCTTGTCAACATTTTAAGTATTA
TGTATGCTTTATTGATGATTTCTCTCGTTATACTTGGTTATATCCATTGAAAAAGAAATCTGACTTCTTTCAAAGCTTTCAAAAGTTTCAAAAATTAGTTGAAAATCAAT
TTGACAGGAAGATAAAGATATTTCAAAGTGATGGAGGAGGTGAGTTCCAATCGTTGGCATTTAAACATCATTTGGAACAACACGGCATCCATCACCAACTATCATGCCCC
CACACTCCTCAACAGAACGGAGTGGCTGAGAGAAAGCATAGGCATGTGGTCGAAACAGGTTTATCCTTAATCTTTCAATCTAAGACTCCTTTTAAGTATTGGGTAGATGC
ATTCTTAACTGCTGTGTTTCTTATAAATAGAATGCCATCCAAAACTTTAAATATGCAAACTCCATATGCTAAACTCTTTGGAAAAGAACCAAATTATAGTAGTATAAGAG
TTTTTGGGTGCAAATGTTTTCCATATTTACGAAATTACTCAAATGATAAGTTCTCTAAAAGGACCTATCCATGTGTGTTTGTTGGATATAGCCCCATTCATAAAGGTTAT
AGATGTCTAGACCCCATTTCAAACCGAATATATATATCTAGACATGTAATTTTTGATGAAAACACCTTTCCCTTTGATAAACAAACTAACAACATACAAGAGACTCCAAA
ATCTCTTGAAGTTATTGAGTTTTTAGGTGAAGAATGGATGCAAGGAGCAGAAGAAAAACCAGAAAATCACAATAATAGTGATCTGGAAGTTGCAAAACAGTGTAGAAATG
CACACTGGTTTGAAGAAAGTGAACATGAAGCAACCATCCCAGTGGCGAATAATCAATTGCAGGAAGAAACACCATCTCAACCTACTCAAGACAATGAAGGACAGCCAAAT
TTAGCATCTAACCAAGAGCCTACACAAATTACATGCATTGAAGACCACACTCCAAGCCAACTAGACAATAGTAGATGTGTGGACCAGCTTAACATCAGTGGGCATCAAAA
CCAGAATAATCCCCAAATCATCCAAACTTTAACCCCACAGGAAGCTCTGTCCTTACCTGATATAAGCAATGACATGTTTGTTGATTTTTCCATTTTTGCAGGAAGCTCAA
TGCAGACCACAAATGGGGATAAACAAGGACAACAACATAACCTTAAGGAAAAAGCTCCTGTTGAGCCTCATCACATGGTCACTAGGGGAAAATCGGCTAAGGATCCTCAA
CTATACTCCACGATACATAAAAAAGGACATTTATCCTTGATTGCCACTAAACAAGAAATGGAGCCAAAGTCTTTCAAATCCGCCCTCAAGATTCCCCATTGGAAGGCGGC
TATGGAAGAGGAAATGAAGGCACTTTTGGAAAACCACACATGGGACCTTGTACCCAAACCATCTCACACAAACATCATTGGGTCAAAATGGATATTTAAGACCAAACTAA
AAGAAGATGGATCCATCGAACGGTACAAGGCTCGCCTAGTTGCTCAGGGATACACTCAAGTTGAGGGACTTGACTATGAAGAGACATTTAGTCCTGTAGTGAAACCAACC
ACTATACGAATTGTCCTAGCTGTGGCAGTAACGTGCGGATGGAAATTAAGGCAACTAGATGTTAAAAATGCCTTCCTCCATGGTTACCTTAAGGAAGAAGTCTTCATGGC
TCAACCACCTGGTTTCCAACATCCAAGCCTTCCTCAACATGTGTGCAAATTAAACCGCTCTCTTTATGGTCTTAAACAAGCCCCTAGAGCTTGGTTCGAAAGACTATCAC
AGTTCCTCGTGCAAACAGGATTCTTCTGCTCTCACTCTGATCCCTCATTTTTCATTTTTAAAACAAATGAGATCACTATGATCATGCTAATCTATGTAGATGATATTATT
GTTACAGGTAACAGTGAAAAGCATATGGAGGAACTCATTCAAAAACTCAGCATAGAGTTTGCCCTCAAAGACCTTGGACCCCTCCACTACTTCCTAGGCATTGAAGTCCA
CCACACATCAGATGGCATCATGCTATCACAAGGAAAATATGCTAGAGACATCCTCATCAAAACAAAAATGATAGAAGCCTCTCAATATGGAACTCCAATGAGCAATTACA
CCTCAAATTGTGCAAATGACAATGAACTTGTCAATGCAACTGAATACAGGAGCATTGTGGGATCACTCCAATATCTGACCCTTACTCGACCTGATATCACTTATGCAGTC
AACAAAGTGTGTCAACAACTCCAAACACCAAGGATAAAGGATCTAAAGGCGGTCAAAAGGATACTACGATATCTAAATGGGACAACAAATCACGGTATATCTTTTTATAA
AAATAGCTCACTTAAACTCTATGCTTTCTGTGATGCAGATTGGGGAGGCTGTCCAGACACTCGAAGAAGTACCACAGGATATTGCATATTCCTTGGAGCAAATTGCATAT
CATGGAAGTCAAAGAAACAACCAACAATTGCAAGGTCAAGTTCAGAAGCAGAATACCGAGCCATGGCAATCACCACAGCTGAGCTCACTTGGATAAGCTTTGTCCTCAAA
GATATTGGCATACATATGATGCAACCTCCACAACTGTTTTGCGACAACATGAGTGCACTACGAATGTCCATCAATCCCGTATTCCATGCTAGGACAAAACACATTGAGAT
CGATTACCATTTTGTAAGAGAAAGAGTAGCTCTTGGTCTCCTCATCACTCGGTATGTTCCCTCTCAAGACCAACTAGCTGACATTCTCACCAAGCCTCTCTCAAAAGAAT
CATTTAAGAAGCTCAGAAGCAAACTTGGCGTCCGCTGCACTTCCTCAAGTTTGAGGAAGAATGAAAGAAAACAAGTCAATCCAACAAGTTTATGA
Protein sequenceShow/hide protein sequence
MVCQICGKPNHTALHCRSRFDHTFQLDDIPKALAALSIHEPNDGNLYADSGATTHIINDPGKLNLKSIYRGNEKLYVGNGEALDITHVGSGKLKTGLSNLNLQNVLVVPS
MKKNLLSISKLAYDNDCSIIFTADKFVIKNAQGQTLGKGYKRKGLYALEEHQQLAAYTASTKAPFSIWHKRMGHLNDSALKHLCNLHLLDISSWKKDELICVNCQLSKRS
RLPFQLRNEISSVPLNKIHCDLWGPAPILSCQHFKYYVCFIDDFSRYTWLYPLKKKSDFFQSFQKFQKLVENQFDRKIKIFQSDGGGEFQSLAFKHHLEQHGIHHQLSCP
HTPQQNGVAERKHRHVVETGLSLIFQSKTPFKYWVDAFLTAVFLINRMPSKTLNMQTPYAKLFGKEPNYSSIRVFGCKCFPYLRNYSNDKFSKRTYPCVFVGYSPIHKGY
RCLDPISNRIYISRHVIFDENTFPFDKQTNNIQETPKSLEVIEFLGEEWMQGAEEKPENHNNSDLEVAKQCRNAHWFEESEHEATIPVANNQLQEETPSQPTQDNEGQPN
LASNQEPTQITCIEDHTPSQLDNSRCVDQLNISGHQNQNNPQIIQTLTPQEALSLPDISNDMFVDFSIFAGSSMQTTNGDKQGQQHNLKEKAPVEPHHMVTRGKSAKDPQ
LYSTIHKKGHLSLIATKQEMEPKSFKSALKIPHWKAAMEEEMKALLENHTWDLVPKPSHTNIIGSKWIFKTKLKEDGSIERYKARLVAQGYTQVEGLDYEETFSPVVKPT
TIRIVLAVAVTCGWKLRQLDVKNAFLHGYLKEEVFMAQPPGFQHPSLPQHVCKLNRSLYGLKQAPRAWFERLSQFLVQTGFFCSHSDPSFFIFKTNEITMIMLIYVDDII
VTGNSEKHMEELIQKLSIEFALKDLGPLHYFLGIEVHHTSDGIMLSQGKYARDILIKTKMIEASQYGTPMSNYTSNCANDNELVNATEYRSIVGSLQYLTLTRPDITYAV
NKVCQQLQTPRIKDLKAVKRILRYLNGTTNHGISFYKNSSLKLYAFCDADWGGCPDTRRSTTGYCIFLGANCISWKSKKQPTIARSSSEAEYRAMAITTAELTWISFVLK
DIGIHMMQPPQLFCDNMSALRMSINPVFHARTKHIEIDYHFVRERVALGLLITRYVPSQDQLADILTKPLSKESFKKLRSKLGVRCTSSSLRKNERKQVNPTSL