; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038204 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038204
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr2:13760418..13770045
RNA-Seq ExpressionLag0038204
SyntenyLag0038204
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5532217.1 hypothetical protein RHGRI_026743 [Rhododendron griersonianum]5.6e-20142.84Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNS-SIWHDRLGHPNFTVLKSVLQLLNV-AYSSFSSLCSHCLSGKMSKLSFPLSHTH
        QD  T +IL  GP + GLYP+ T SSA+ S   S +     +RV+S S+WH +LGHP+ T+ +S++    +     F   C  C   K  KL FP+S T 
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNS-SIWHDRLGHPNFTVLKSVLQLLNV-AYSSFSSLCSHCLSGKMSKLSFPLSHTH

Query:  SSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKT-------------LATLFATHGIFHQRSCP
        SS P +L+H D                                   S+V   ++ FK++ +      +KT             L  LF  +G+ HQ SCP
Subjt:  SSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKT-------------LATLFATHGIFHQRSCP

Query:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF
        HTPEQNG+ ERKHRH++E  L+L+    +P  +W  A   A FL NRLP  SL  + P+ +LF   PDY  L+ FGCAC+P L+PY +HKL P++  CVF
Subjt:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF

Query:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ
        +GY    KGY C DP+ +++Y+SRHV F+E  FPF      ++SSS  S+   SS+ P  +  F  +     LP    S++++S SP             
Subjt:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ

Query:  TSIGGCSVESVDTNVPVANEYVDISLMDT--NPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLST-EPSSFTQASKLPEW--
        TS+   + ES  T+   + ++V  SL +T   PS VP                   HPM TRSK+GI K K  L+   S   T EP SF++A     W  
Subjt:  TSIGGCSVESVDTNVPVANEYVDISLMDT--NPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLST-EPSSFTQASKLPEW--

Query:  ----------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFL
                                          IKR+ DGS+ARYK RLVA    Q EG+D+ ETFSPV+K+PT+RIVLSLA HH W LRQLDV N FL
Subjt:  ----------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFL

Query:  HGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQL
        HG + E VYM+QP G+     P+HVC+L K+LYGL+QAPRAW+  F+S L   GF +S +D SLF+ +   ++T++L+YVDDI++TG+D ++I++LI  L
Subjt:  HGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQL

Query:  KELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQ
           F M DLG L YF GLE      GL ++Q KY  D+L++ GM + K  S+P S            P D   +RS+VG+L YLT T+P+ISF+V+ V Q
Subjt:  KELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQ

Query:  FMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQ
         MH P + H  AVKRILRY++GT+  GLH  A SLTL+AF+D+DW G+P DRRSTTGF VF G N ISW  KKQ T+ RSST+AEYRA+A T A+++WIQ
Subjt:  FMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQ

Query:  QLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
        QLLSE+ VS+    +L+CDN+SA+ LA NPVFH+RTKHIE+D H++REQ+
Subjt:  QLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

PRQ49208.1 putative RNA-directed DNA polymerase [Rosa chinensis]1.5e-20945.29Show/hide
Query:  DKRTGRILFHGPSINGLYPLT--TQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQL--LNVAYSSFSSLCSHCLSGKMSKLSFPLSHT
        DK TG IL+ G    GLYP+     S   P+S PS   A LG +V+ SIWH RLGHP+  V+ S+L    ++V     +SLC  CLSGK +KL F LS  
Subjt:  DKRTGRILFHGPSINGLYPLT--TQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQL--LNVAYSSFSSLCSHCLSGKMSKLSFPLSHT

Query:  HSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRSC
         SS P E++HSD                                    +      +F +F  N  S  +K L T               A +GI HQ SC
Subjt:  HSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRSC

Query:  PHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCV
        P+TP+QNG+AERK+RHIVE AL+L++  S+P R+WYHA A + +LINR+P  +L  KSPFE+LF+K P   H+RVFGC CYPLL+PY S+KLQP+T  C+
Subjt:  PHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCV

Query:  FIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSS-----STPSMPSASSTIPTLIRLFQPLDSVE--------ELPTPCESTVSASTS
        FIG+ +GYKG++C  P++++  ISRHV+FDE  F F  S PT S S     +T S+P + ST P    L     SV+          P  C +T S ++ 
Subjt:  FIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSS-----STPSMPSASSTIPTLIRLFQPLDSVE--------ELPTPCESTVSASTS

Query:  PSFCDMTTNTPTVQT-SIGGCSVESV---DTNVPVA---NEYVDIS--LMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKV-YLAKLD
        P     +TN   + T +   C+  S+   DT VPV      Y+ IS  ++D NP   P    +++  L    ++ N HPM TRSK+GI KKKV Y A + 
Subjt:  PSFCDMTTNTPTVQT-SIGGCSVESV---DTNVPVA---NEYVDIS--LMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKV-YLAKLD

Query:  SSLST--EPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVR
        +++S   EP+S  QA K+PEW                                    +K+N DG++AR+K RLVA+ + Q EG DY+ETFSPVV+  TVR
Subjt:  SSLST--EPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVR

Query:  IVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLL
        + L+LAA + W L Q+DVKN FLHG LKEE+YM QP GF +   P+HVC+L KSLYGLKQAPRAW E FTS L   GFK S SDPSLF+++    + YLL
Subjt:  IVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLL

Query:  LYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRL-SIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP--SDATAY
        LYVDDI+LTG++ S I S+   L+  F+M DLG L YF GL+   L S G+ V+Q KY  D+L++ GM  +K+CSTPC    ++       P  +D T Y
Subjt:  LYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRL-SIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP--SDATAY

Query:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQ
        RS+VG L YLTFT+PD++++V+ V QFM +PT  H AAVKRILRYL GT   GL  R++   L AFSD+DW G+P DRRSTTGFV+++G  PISW +KKQ
Subjt:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQ

Query:  STIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
         ++ RSST+AEYRA+A T +EI W++ LL+++ + LP + +L+CDN SAL LA NPV  S+ KH+EVD+HF RE++
Subjt:  STIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

TQD88914.1 hypothetical protein C1H46_025506 [Malus baccata]2.1e-20845.06Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVL--QLLNVAYSSFSSLCSHCLSGKMSKLSFPLSHTH
        QDK T ++L+ G S   +YPL       P + P+     LG ++N S+WH RLGHP  +VL+  L    +  + +S S  C+ CL GK +KL FP+  + 
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVL--QLLNVAYSSFSSLCSHCLSGKMSKLSFPLSHTH

Query:  SSHPLELVHSD--------------------------------SDVAATVSMFKSF---AENLLSFKMKTLAT-------------LFATHGIFHQRSCP
        S  PLE++H+D                                ++ AA   +F  F    +N  S  +K L +                T GI HQ+SCP
Subjt:  SSHPLELVHSD--------------------------------SDVAATVSMFKSF---AENLLSFKMKTLAT-------------LFATHGIFHQRSCP

Query:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF
        +TPEQNG+AERK+RH+VE A++L+ K S+  ++W+HA A +T+L+NRLP+  L   SPFEVL++ PP   HLRVFGCACYP L+PY ++KL P+TT C+F
Subjt:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF

Query:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ
        +GY   YKGY+C      ++ +SRHV+FDE VFP       SSSS +P++ SAS  IP  +           +P P      +S+ PS            
Subjt:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ

Query:  TSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEW-----
          IGG S     +++ ++  + D++      S VP D   +        VSSN HPM+TRSKSGI KKKV+ A +   +  EP SF+ A++  +W     
Subjt:  TSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEW-----

Query:  -------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGD
                                       +KRNPDGSVARYK RLVAK + Q  G+DY ETFSPVVK  TVR++LSLAA  GW L+QLDVKN FLHG 
Subjt:  -------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGD

Query:  LKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKEL
        L EEVYM QPQGFI K  P  VC+L +SLYGLKQAPRAW   F+S L+  GF +S  DPSL+V+    S+  LLLYVDDI+L+G+D  ++ S+ISQL   
Subjt:  LKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKEL

Query:  FDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMH
        FDM DLG L YF GL+ +     L V Q KY  DLL +  M ++K C+TPC               D   YRSI+GAL YLTFT+PDI+++V++V QFMH
Subjt:  FDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMH

Query:  SPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLL
        SP   H  AVKRILRYL+G+L LGL  +  +L + A++D+DW G+P DRRSTTGFVVFLG NPISW +KKQ T+ RSST+AEYRA+ATTTAE++WIQQLL
Subjt:  SPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLL

Query:  SELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQ
         +L +S   + +L+CDN SA+ LA NP+ HS+ KHIE+D HFVRE++Q      QG+  LQ
Subjt:  SELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQ

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]5.8e-21444.29Show/hide
Query:  MRLLPSQTNLPLVVREG----QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYSSFSSL-
        M  L    N   +V E     QDK T  IL+ G S N +YPL    S+     P +  A +  R+NS++WH RLGHP  +V+K+ L   ++ +    S  
Subjt:  MRLLPSQTNLPLVVREG----QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYSSFSSL-

Query:  -CSHCLSGKMSKLSFPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT------
         C  CL GK + L FP   + S  P E++H+D                                   + V      F++F  N  +  ++ L +      
Subjt:  -CSHCLSGKMSKLSFPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT------

Query:  -------LFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCAC
                    GI H +SCP+TP+QNG+ ERK+RHI E A++L+ +  +P ++WYHA A A +LINR+P+L L  KSPFEVL+   P   HL++FGCAC
Subjt:  -------LFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCAC

Query:  YPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCES
        YP L+PY SHKL P+T++C+F+GY   YKG++C +P   ++ +SRHV+FDE  FP  L +    S S   + S SS +P+      PL S+  +P     
Subjt:  YPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCES

Query:  TVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVV-PTDATKIVGPLDDPTVSS-NCHPMRTRSKSGIVKKKVYLAKLDS
          S+  SP  C   ++ P+     G  ++ S+  ++ +       +L D   S + P      V  L   +V++ N HPM+TRSKSGI KKK   +    
Subjt:  TVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVV-PTDATKIVGPLDDPTVSS-NCHPMRTRSKSGIVKKKVYLAKLDS

Query:  SLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVL
        + + EPSS++ A KL EW                                    IK++PDG+VARYK RLVAK + Q  G+DY ETFSPVVK  TVR++L
Subjt:  SLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVL

Query:  SLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYV
        SLAA +GW L QLDVKN FLHG L EEVYM QPQGF+    P+HVC+L +SLYGLKQAPRAW E FT  L+  GFK+S +DPSLFV+    SI  LLLYV
Subjt:  SLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYV

Query:  DDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGA
        DDI+LTGN  + + S+I QL   FDM +LG L YF GL+ +  S GL V Q KY  DLL +  M + K C TPC   H   T    S SD   YRSIVGA
Subjt:  DDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGA

Query:  LHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRS
        L YLTFT+PDI+++V++V QFMH+P  DH  AVKRILRYLRGT+ LG+H    SL + A++D+DW G+P DRRSTTGFVVFLG+NPISW +KKQ T+ RS
Subjt:  LHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRS

Query:  STKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQSLQET
        ST+AEYRA+ATTTAEI+W+QQLL +L +S P   +L+CDN  A+ LA NP+ HS+ KHIE+D HFVRE++Q      QG+  LQ +  T
Subjt:  STKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQSLQET

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]7.1e-21245.32Show/hide
Query:  QDKRTGRILFHGPSINGLYP--------LTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS--SFSSLCSHCLSGKMSKL
        QDK TGRI+  G    GLYP        L  Q+    +S     T  LG++V  ++WH RLGHP+  V  ++L+  ++  S    SS+C+ CL GK +KL
Subjt:  QDKRTGRILFHGPSINGLYP--------LTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS--SFSSLCSHCLSGKMSKL

Query:  SFPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT----LFATH---------G
         F      + HPLE++HSD                                   S+V      F SF     S  +K   +     +++H         G
Subjt:  SFPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT----LFATH---------G

Query:  IFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQ
        I HQ+SCP+TP+QNG+AERKHRHI+E A++L+   S+P + W+HA A + +LINR+   +L   SPF+ LF   P  SHL+VFGCAC+PLL+   S KLQ
Subjt:  IFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQ

Query:  PRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF---VLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCE-STVSASTSPS
        P+T+QC+FIGY   YKGYLCL+PLT +IY+SRHV+FDE  FP+   + S+  SS  S+PS+      +P+L+     + S    P P + S  S ST P+
Subjt:  PRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF---VLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCE-STVSASTSPS

Query:  FCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSS------------NCHPMRTRSKSGIVKKKVYLAKLD
          + + +TP                     + +VD      +P+ +P D+T+ + P DDP                + HPM+TRSKSGI KKKV+ AKL 
Subjt:  FCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSS------------NCHPMRTRSKSGIVKKKVYLAKLD

Query:  SSLS-TEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRI
        SS+  +EP++F  A K+PEW                                    IK NPDGSVARYK RLVAK Y Q EGVDY ETFSPVVK  TVR+
Subjt:  SSLS-TEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRI

Query:  VLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPS-HVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLL
        +L+LAA   W LRQLDVKN FLHGDL EEVYM QPQGF S   PS +VCRL+KSLYGLKQAPRAW E FTS L   GFKASL+DPSLFV++       LL
Subjt:  VLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPS-HVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLL

Query:  LYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSI
        LYVDDI+LTG+    I  +I  L + FD+ DLG L YF GL+      GL V+Q KY  DLL +  + ++K C+TPC   H  S            YRSI
Subjt:  LYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSI

Query:  VGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTI
        VGAL YLTFT+PDI+F+V++  QFMH P   H+ AVK ILRYL GTL  GL  +   L L A+SD+DW G+P DRRST+G +V+LG++PISW +KKQ T+
Subjt:  VGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTI

Query:  FRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
         RSST+AEYRALA   AE+ W++Q+L +L V L  + +LYCDN S + L+ NPVFHSR KHIE+D HFVRE++
Subjt:  FRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

TrEMBL top hitse value%identityAlignment
A0A2N9EFT0 Uncharacterized protein2.1e-22248.09Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVP-----STITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQ---LLNVAYSSFSS-LCSHCLSGKMSKLS
        QD  +GR L+ G S +GLYP+   SS+   S P        +A LGT+   S+WH RLGHP   VL SVL     L+V  + FSS  C+HC+ GK+ +  
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVP-----STITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQ---LLNVAYSSFSS-LCSHCLSGKMSKLS

Query:  FPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGI
        FP S   ++ PLELVHSD                                   S V AT   F +  EN+L+ ++K L T               +T GI
Subjt:  FPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGI

Query:  FHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQP
         HQ SCPHTP+QNGVAERKHRHIVE AL+LIS+ S+PL+YW +AF+ A +LINR+P+ +L   SP+++LF   PDYS L+ FGC C+PLLRPY  HKL+P
Subjt:  FHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQP

Query:  RTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTI----PTLIRLFQPLDSVEEL--PTPCEST---VSAS
        R++ CVF+GY L  KGYLCL+  T ++ ISRHV F E+ FPF  S  + SS STPS    SS +     T   +  P  S+  L   TP  S+   VSA 
Subjt:  RTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTI----PTLIRLFQPLDSVEEL--PTPCEST---VSAS

Query:  TS-PSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYL-AKLDSSLSTE
        T  PS    TT++P +   +  CSV S                                GP+  P    N HPM+TR KSGI K+K+ L  K  + L TE
Subjt:  TS-PSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYL-AKLDSSLSTE

Query:  PSSFTQASKLPEW----------IKR----NPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEE
        P S+  ASK PEW          ++R     PDGSVARYK RLVAK YHQ+ G+DY+ETFSPVVK  TVR++LS+AA   W LRQLDV N FLH  LKE+
Subjt:  PSSFTQASKLPEW----------IKR----NPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEE

Query:  VYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMT
        VYM QPQGF+    P HVC+L KSLYGLKQAPRAWFE FTS L+  GF AS +DPSLF+     ++ +LL+YVDDI++TGN PS ++SL+ QL   F++ 
Subjt:  VYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMT

Query:  DLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPT
        DLG L+YF GLE    + G  V Q KY  DLL ++ M + K CSTPC T     T  + +P  DAT +RS+VGAL YLTFT+PD+++ V+ + QFMH+PT
Subjt:  DLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPT

Query:  LDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSEL
          HL+A KR+LRY+RG+L LGL     SL L A+SD+DW G+P  RRSTTG++VF+G NP++W++KKQST+ RSST+AEYRALA+  AE+ W++ +L +L
Subjt:  LDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSEL

Query:  FVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
         +SL     L+CDN SAL LA NPVFH+RTKHIEVD HF+R+++
Subjt:  FVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

A0A2N9FXA9 Uncharacterized protein6.0e-22546.68Show/hide
Query:  TGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS----SFSSLCSHCLSGKMSKLSFPLSHTHSS
        +GR+L+ G S NGLYP+ TQ S    S  S+I A L ++    +WH RLGHP+  VL S +  L+   S         C HCL GKM KL F  S   S+
Subjt:  TGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS----SFSSLCSHCLSGKMSKLSFPLSHTHSS

Query:  HPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLA-------------TLFATHGIFHQRSCPHT
         PLELVHSD                                   SDV  T   F++  EN LS K+K L              T  A+HGI H  SCPHT
Subjt:  HPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLA-------------TLFATHGIFHQRSCPHT

Query:  PEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIG
        P+QNG+ ERKHRH++E AL+L+S   + + YW +A + A  LINRLP+ +L +++P+E+LF KPPD  HLR FGC C+P LRPY +HKLQPRTT C+F+G
Subjt:  PEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIG

Query:  YPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ--
        YP   KGY+CLDP T R+YISRHV+F+E  F   LS P S SS +P   S  ST    I        +    TP +S       P    + + TP +Q  
Subjt:  YPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQ--

Query:  TSIGGCSVESVDTNVPVANEYVDISL-MDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEW----
        TSI   ++     + P+       +  + T PSV P  ++ I       + S+N HPM TRSK+GI K K++   +     TEPS++  ASK P+W    
Subjt:  TSIGGCSVESVDTNVPVANEYVDISL-MDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEW----

Query:  --------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHG
                                        +KRN DG+++R+K RLVAK +HQ+ G+D+ ETFSPVVK PTVR++L+LA  + W LRQLDV N FLHG
Subjt:  --------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHG

Query:  DLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKE
         LKEEVYM QP G++S   P+HVC L+KS+YGLKQAPRAWFE FTS L+  GF +S +D SLF+     ++ +LLLYVDDIVLTGN+  +++ LI+ L +
Subjt:  DLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKE

Query:  LFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDISFAVSRVSQF
        +F++ DLG+LS+F GL+ +R S GL +TQ KY  DLL +  M     C TPC   H   +AT  +P +D  AYRS+VGALHYLTFT+PD+SFAV +V QF
Subjt:  LFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDISFAVSRVSQF

Query:  MHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQ
        M++PT  HL A KRILRYL+GTL  GLH      TLSAF+D+DW G+P DRRST+G +VFLG NPI+W  KKQ T+ RSST+AEYRALA+ +AE+ W++ 
Subjt:  MHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQ

Query:  LLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
        LL +L + +    IL+CDN SAL +A NPVFH+RTKHIEVD HF+RE++
Subjt:  LLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

A0A2N9GRJ0 Uncharacterized protein9.6e-22347.31Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVP-----STITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQ---LLNVAYSSFSS-LCSHCLSGKMSKLS
        QD  +GR L+ G S +GLYP+   SS+   S P        +A LGT+   S+WH RLGHP   VL SVL     L+V  + FSS  C+HC+ GK+ +  
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVP-----STITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQ---LLNVAYSSFSS-LCSHCLSGKMSKLS

Query:  FPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGI
        FP S   ++ PLELVHSD                                   S V AT   F +  EN+L+ ++K L T               +T GI
Subjt:  FPLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGI

Query:  FHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQP
         HQ SCPHTP+QNGVAERKHRHIVE AL+LIS+ S+PL+YW +AF+ A +LINR+P+ +L   SP+++LF   PDYS L+ FGC C+PLLRPY  HKL+P
Subjt:  FHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQP

Query:  RTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTI----PTLIRLFQPLDSVEEL--PTPCEST---VSAS
        R++ CVF+GY L  KGYLCL+  T ++ ISRHV F E+ FPF  S  + SS STPS    SS +     T   +  P  S+  L   TP  S+   VSA 
Subjt:  RTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTI----PTLIRLFQPLDSVEEL--PTPCEST---VSAS

Query:  TS-PSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYL-AKLDSSLSTE
        T  PS    TT++P +   +  CSV S                                GP+  P    N HPM+TR KSGI K+K+ L  K  + L TE
Subjt:  TS-PSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYL-AKLDSSLSTE

Query:  PSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAH
        P S+  ASK PEW                                    IKR PDGSVARYK RLVAK YHQ+ G+DY+ETFSPVVK  TVR++LS+AA 
Subjt:  PSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAH

Query:  HGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVL
          W LRQLDV N FLHG LKE+VYM QPQGF+    P HVC+L KSLYGLKQAPRAWFE FTS L+  GF AS +DPSLF+     ++ +LL+YVDDI++
Subjt:  HGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVL

Query:  TGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYL
        TGN PS ++SL+ QL   F++ DLG L+YF GLE    + G  V Q KY  DLL ++ M + K CSTPC T     T  + +P  DAT +RS+VGAL YL
Subjt:  TGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYL

Query:  TFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKA
        TFT+PD+++ V+ + QFMH+PT  HL+A KR+LRY+RG+L LGL     SL L A+SD+DW G+P  RRSTTG++VF+G NP++W++KKQST+ RSST+A
Subjt:  TFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKA

Query:  EYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
        EYRALA+  AE+ W++ +L +L +SL     L+CDN SAL LA NPVFH+RTKHIEVD HF+R+++
Subjt:  EYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

A0A2N9GWH7 Integrase catalytic domain-containing protein3.0e-22445.15Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQS-SASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV---LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLSH
        QD  TG++L+ G S NG+YP+ + +   S  +  +   AQ  +     +WH RLGHP+  +L S+   LQ  +   +S  + C HCL+GKM +L FP+S+
Subjt:  QDKRTGRILFHGPSINGLYPLTTQS-SASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV---LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLSH

Query:  THSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRS
           S P  L+H+D                                   SD     S F++      S  +K L T               A HGI HQ S
Subjt:  THSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRS

Query:  CPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQC
        CPHTP+QNGVAERKHRH+++ AL+L+S+  +P+ +W HA + AT ++NRLP+ +L +K+P+E+LF+KPPD SH R FGC C+PLL PY +HKLQP+T  C
Subjt:  CPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQC

Query:  VFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPT
        VF+GYP   KGYLCLDPLT+R+Y SRHV+F+E +FP ++ +P +S+S+     S  + + TL+ L             C  T   + +PS        P 
Subjt:  VFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPT

Query:  VQTSIGGCSVESVDTNVPVANEYVDISL-------MDTNPSVVPTDATKI-VGPLDD------------PTVSSN-----------CHPMRTRSKSGIVK
         Q      S    D+++P  N    IS+       + T+P+++PTD   I V P++             PTV+S             HPM+TRSKSGI K
Subjt:  VQTSIGGCSVESVDTNVPVANEYVDISL-------MDTNPSVVPTDATKI-VGPLDD------------PTVSSN-----------CHPMRTRSKSGIVK

Query:  KKV-YLAKLDSSLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSP
         KV Y A++DS+L TEP+S+T ASK P+W                                    +K + DG++AR+K RLVAK +HQ+ GVD++ETFSP
Subjt:  KKV-YLAKLDSSLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSP

Query:  VVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNI
        VVK PTVR++LSLA   GW LRQLDVKN FLHG LKEEVYM QPQGFI  Q PSHVC+L KS+YGLKQAPRAWFE FTS L+  GF AS +D SLF+   
Subjt:  VVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNI

Query:  RGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPS
          +I YLLLYVDDIVLT N P+Y+  L++QL  +FD+ DLGSL YF GL+  R S GL + Q KY  DLL +  M ++K   +P       S        
Subjt:  RGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPS

Query:  DATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISW
        D   YRS+VGALHYLTFT+PDISF+V +V Q+M +PT  HLAA KRILRY+RGTL  G+     SL LSA++D+DW G+P DRRST+G++V+LG+NPI+W
Subjt:  DATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISW

Query:  ITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQSLQE
          KKQ T+ RSST++EYRALA  +AE+ W++ LL +L + L  + IL+CDN SAL +A NPVFH+RTKHIEVD HFVRE++   D   Q    L  L +
Subjt:  ITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQSLQE

A0A2N9HZ49 Uncharacterized protein3.7e-22246.19Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQ---SSASP-SSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS----SFSSLCSHCLSGKMSKLSF
        QD  +G++L+ G S NGLYP+ TQ    S SP SS  S+I A L +R    +WH RLGHP+  VL S +  L+   S         C HCL GKM KL F
Subjt:  QDKRTGRILFHGPSINGLYPLTTQ---SSASP-SSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYS----SFSSLCSHCLSGKMSKLSF

Query:  PLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLATL-------------FATHGIF
          S   S+HPLELVHSD                                   SDV  T   F++  ENLLS K+K L T               A+ GI 
Subjt:  PLSHTHSSHPLELVHSD-----------------------------------SDVAATVSMFKSFAENLLSFKMKTLATL-------------FATHGIF

Query:  HQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPR
        H  SCPHTP+QNG+ ERKHRH++E AL+L+S   + + +W +A + A  +INRLP+  L +++P+E+LF KPPD +HL+ FGC C+P LRPY +HKLQPR
Subjt:  HQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPR

Query:  TTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTT
        +T C+F+GYP   KGY+CLDP T R+YISRHV+F+E  F   LS P S S+ TP      ST P L  L          P     +   S+ P    + +
Subjt:  TTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTT

Query:  NTPTVQ--TSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTV---SSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQA
         TP VQ   S+  C + S     P  N  ++        S+  T A  I+     P +   S+N HPM TRSK+GI K K +   +     TEPS++  A
Subjt:  NTPTVQ--TSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTV---SSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQA

Query:  SKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQ
        SK P+W                                    +KRN DG+++RYK RLVAK +HQ+ G+D+ ETFS +VK PTVR++L+LA  + W LRQ
Subjt:  SKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQ

Query:  LDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSY
        LDV+N FLHG LKEEVYM QP G+++   P+HVCRL+KS+YGLKQAPRAWFE FT+ L+  GF +S +D SLF+ +   ++ +LLLYVDDIVLTGN+  +
Subjt:  LDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSY

Query:  ITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDI
        ++ LI+ L ++F++ DLG+LS+F GL+ +R S GL +TQ KY  DLL +  M     C+TPC   H   +AT  +P +D  AYRS+VGALHYLTFT+PD+
Subjt:  ITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSP-SDATAYRSIVGALHYLTFTQPDI

Query:  SFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALAT
        SFAV +V QF+++PT  HL A KRILRYL+GTL  GLH      TLSAF+D+DW G+P DRRST+G +VFLG NPI+W  KKQ T+ RSST+AEYRALA+
Subjt:  SFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALAT

Query:  TTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL
         +AE+ W++ LL +L + +    IL+CDN SAL +A NPVFH+RTKHIEVD HF+RE++
Subjt:  TTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-9229.95Show/hide
Query:  NSSIWHDRLGHPNFTVLKSVLQ--------LLNVAYSSFSSLCSHCLSGKMSKLSFP--LSHTHSSHPLELVHSD-------------------------
        N  +WH+R GH +   L  + +        LLN    S   +C  CL+GK ++L F      TH   PL +VHSD                         
Subjt:  NSSIWHDRLGHPNFTVLKSVLQ--------LLNVAYSSFSSLCSHCLSGKMSKLSFP--LSHTHSSHPLELVHSD-------------------------

Query:  ----------SDVAATVSMFKSF-AENLLSFKMKT---------------LATLFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRY
                  SDV    SMF+ F A++   F +K                +       GI +  + PHTP+ NGV+ER  R I E A +++S   +   +
Subjt:  ----------SDVAATVSMFKSF-AENLLSFKMKT---------------LATLFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRY

Query:  WYHAFACATFLINRLPSLSL--GNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGY-PLGYKGYLCLDPLTERIYISRHVVFDE
        W  A   AT+LINR+PS +L   +K+P+E+   K P   HLRVFG   Y  ++     K   ++ + +F+GY P G+K +   D + E+  ++R VV DE
Subjt:  WYHAFACATFLINRLPSLSL--GNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGY-PLGYKGYLCLDPLTERIYISRHVVFDE

Query:  ------HVFPF-VLSSPTSSSSSTPSMPSASSTI--PTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSV--ESVDTNVPVAN
                  F  +    S  S   + P+ S  I         +  D+++ L    ES      + S   + T  P             +S ++N    N
Subjt:  ------HVFPF-VLSSPTSSSSSTPSMPSASSTI--PTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSV--ESVDTNVPVAN

Query:  E--------YVDISLMDTNPSVVPTDAT----KIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSL-----------STEPSSFTQ--------
        E        +++ S    NP+      T    K +G +D+PT +     +  RS+    K ++   + D+SL           +  P+SF +        
Subjt:  E--------YVDISLMDTNPSVVPTDAT----KIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSL-----------STEPSSFTQ--------

Query:  --------------------ASKLPE--------W---IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVK
                             +K PE        W   +K N  G+  RYK RLVA+ + Q+  +DYEETF+PV +  + R +LSL   +   + Q+DVK
Subjt:  --------------------ASKLPE--------W---IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVK

Query:  NVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSIT---YLLLYVDDIVLTGNDPSYI
          FL+G LKEE+YM+ PQG        +VC+LNK++YGLKQA R WFE F  +L  C F  S  D  +++ + +G+I    Y+LLYVDD+V+   D + +
Subjt:  NVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSIT---YLLLYVDDIVLTGNDPSYI

Query:  TSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDA-TAYRSIVGALHYLTF-TQPDI
         +    L E F MTDL  + +F G+  +     + ++Q  Y   +L +F M      STP  +    +   L S  D  T  RS++G L Y+   T+PD+
Subjt:  TSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDA-TAYRSIVGALHYLTF-TQPDI

Query:  SFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLR---ASSLTLSAFSDSDWTGNPTDRRSTTGFVV-FLGANPISWITKKQSTIFRSSTKAEYR
        + AV+ +S++      +    +KR+LRYL+GT+ + L  +   A    +  + DSDW G+  DR+STTG++      N I W TK+Q+++  SST+AEY 
Subjt:  SFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLR---ASSLTLSAFSDSDWTGNPTDRRSTTGFVV-FLGANPISWITKKQSTIFRSSTKAEYR

Query:  ALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQS
        AL     E +W++ LL+ + + L   + +Y DNQ  + +A NP  H R KHI++  HF REQ+Q+
Subjt:  ALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-11032.52Show/hide
Query:  VNSSIWHDRLGHPNFTVLKSVLQLLNVAYSSFSSL--CSHCLSGKMSKLSFPLSHTHSSHPLELVHSD--------------------------------
        ++  +WH R+GH +   L+ + +   ++Y+  +++  C +CL GK  ++SF  S     + L+LV+SD                                
Subjt:  VNSSIWHDRLGHPNFTVLKSVLQLLNVAYSSFSSL--CSHCLSGKMSKLSFPLSHTHSSHPLELVHSD--------------------------------

Query:  ---SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATF
             V      F +  E     K+K L +               ++HGI H+++ P TP+ NGVAER +R IVE   S++    +P  +W  A   A +
Subjt:  ---SDVAATVSMFKSFAENLLSFKMKTLAT-------------LFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATF

Query:  LINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEH------------
        LINR PS+ L  + P  V   K   YSHL+VFGC  +  +      KL  ++  C+FIGY     GY   DP+ +++  SR VVF E             
Subjt:  LINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEH------------

Query:  ---------VFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLD-SVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEY
                   P   ++PTS+ S+T  +         +I   + LD  VEE+  P                 T        +       V++    + EY
Subjt:  ---------VFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLD-SVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEY

Query:  VDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEWIKRNPDGSVARYKTRLVAKEYHQREG
        V IS  D  P  +      +  P  +  + +    M +  K+G  K    L +L       P       KL    K++ D  + RYK RLV K + Q++G
Subjt:  VDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEWIKRNPDGSVARYKTRLVAKEYHQREG

Query:  VDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLS
        +D++E FSPVVK  ++R +LSLAA    ++ QLDVK  FLHGDL+EE+YM+QP+GF        VC+LNKSLYGLKQAPR W+  F S +    +  + S
Subjt:  VDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIRCGFKASLS

Query:  DPSL-FVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEF--KRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTG
        DP + F R    +   LLLYVDD+++ G D   I  L   L + FDM DLG      G++   +R S  L ++Q KY   +L RF M  AK  STP + G
Subjt:  DPSL-FVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEF--KRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTG

Query:  HFSSTATLCSPS-------DATAYRSIVGALHY-LTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNP
        H   +  +C  +           Y S VG+L Y +  T+PDI+ AV  VS+F+ +P  +H  AVK ILRYLRGT    L    S   L  ++D+D  G+ 
Subjt:  HFSSTATLCSPS-------DATAYRSIVGALHY-LTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNP

Query:  TDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQ
         +R+S+TG++       ISW +K Q  +  S+T+AEY A   T  E+IW+++ L EL +   +  ++YCD+QSA+ L++N ++H+RTKHI+V  H++RE 
Subjt:  TDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQ

Query:  L
        +
Subjt:  L

P92519 Uncharacterized mitochondrial protein AtMg008106.5e-5151.11Show/hide
Query:  YLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAY
        YLLLYVDDI+LTG+  + +  LI QL   F M DLG + YF G++ K    GL ++Q KY   +L   GM + K  STP      SS +T   P D + +
Subjt:  YLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAY

Query:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKK
        RSIVGAL YLT T+PDIS+AV+ V Q MH PTL     +KR+LRY++GT+  GL++ + S L + AF DSDW G  + RRSTTGF  FLG N ISW  K+
Subjt:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKK

Query:  QSTIFRSSTKAEYRALATTTAEIIW
        Q T+ RSST+ EYRALA T AE+ W
Subjt:  QSTIFRSSTKAEYRALATTTAEIIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-18039.62Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV-----LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLS
        +D  TG  L  G + + LY     SS      P ++ A   ++   S WH RLGHP  ++L SV     L +LN ++   S  CS CL  K +K+ F  S
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV-----LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLS

Query:  HTHSSHPLELVHSD----------------------------------SDVAATVSMFKSFAENLLSFKMKT-----------LATLFATHGIFHQRSCP
          +S+ PLE ++SD                                  S V  T   FK+  EN    ++ T           L   F+ HGI H  S P
Subjt:  HTHSSHPLELVHSD----------------------------------SDVAATVSMFKSFAENLLSFKMKT-----------LATLFATHGIFHQRSCP

Query:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF
        HTPE NG++ERKHRHIVE  L+L+S  S+P  YW +AFA A +LINRLP+  L  +SPF+ LF   P+Y  LRVFGCACYP LRPY  HKL  ++ QCVF
Subjt:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF

Query:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF--------------------------------VLSSPTSS----SSSTPSMPSA-----------
        +GY L    YLCL   T R+YISRHV FDE+ FPF                                VL +P+ S    +++ PS PSA           
Subjt:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF--------------------------------VLSSPTSS----SSSTPSMPSA-----------

Query:  ----------SSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVV---PTDATK
                  SS  PT  R   P  + +  PT  ++   +S + S  + T  +P+          +S  ++          S   T PS++   P    +
Subjt:  ----------SSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMTTNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVV---PTDATK

Query:  IVGPLDDPTVSSNCHPMRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW-------------------------------------IKRNPDG
        IV   ++     N H M TR+K+GI+K    Y   +  +  +EP +  QA K   W                                      K N DG
Subjt:  IVGPLDDPTVSSNCHPMRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW-------------------------------------IKRNPDG

Query:  SVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRA
        S+ RYK RLVAK Y+QR G+DY ETFSPV+K  ++RIVL +A    W +RQLDV N FL G L ++VYM QP GFI K  P++VC+L K+LYGLKQAPRA
Subjt:  SVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRA

Query:  WFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVR
        W+    + L+  GF  S+SD SLFV     SI Y+L+YVDDI++TGNDP+ + + +  L + F + D   L YF G+E KR+  GL ++Q +Y +DLL R
Subjt:  WFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVR

Query:  FGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAF
          M  AK  +TP +     S  +    +D T YR IVG+L YL FT+PDIS+AV+R+SQFMH PT +HL A+KRILRYL GT   G+ L + ++L+L A+
Subjt:  FGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAF

Query:  SDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIE
        SD+DW G+  D  ST G++V+LG +PISW +KKQ  + RSST+AEYR++A T++E+ WI  LL+EL + L +  ++YCDN  A  L  NPVFHSR KHI 
Subjt:  SDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIE

Query:  VDLHFVREQLQS
        +D HF+R Q+QS
Subjt:  VDLHFVREQLQS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-17839.68Show/hide
Query:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV-----LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLS
        +D  TG  L  G + + LY     SS + S   S       ++   S WH RLGHP+  +L SV     L +LN ++   S  CS C   K  K+ F  S
Subjt:  QDKRTGRILFHGPSINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSV-----LQLLNVAYSSFSSLCSHCLSGKMSKLSFPLS

Query:  HTHSSHPLELVHSD----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-----------LFATHGIFHQRSCP
           SS PLE ++SD                                  S V  T  +FKS  EN    ++ TL +             + HGI H  S P
Subjt:  HTHSSHPLELVHSD----------------------------------SDVAATVSMFKSFAENLLSFKMKTLAT-----------LFATHGIFHQRSCP

Query:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF
        HTPE NG++ERKHRHIVEM L+L+S  SVP  YW +AF+ A +LINRLP+  L  +SPF+ LF +PP+Y  L+VFGCACYP LRPY  HKL+ ++ QC F
Subjt:  HTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLLRPYVSHKLQPRTTQCVF

Query:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF------VLSSPTSSSSSTPSMPSASSTIPTLIRLFQP------LDSVEELPTP----CESTVSAS
        +GY L    YLCL   T R+Y SRHV FDE  FPF      V +S    S S P+ PS ++   T + L  P      LD+    P+     C + VS+S
Subjt:  IGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPF------VLSSPTSSSSSTPSMPSASSTIPTLIRLFQP------LDSVEELPTP----CESTVSAS

Query:  TSP--SFCDMTTNTPTV------QTSIGGCSVESVDTNVPVANEYVDISLMDTNPSV-------------VPTDATKIVGPLDDPTVSS-----------
          P  S    +++ PT       Q +      ++ ++N P+ N     S    +P+              +PT +T I  P + P+ SS           
Subjt:  TSP--SFCDMTTNTPTV------QTSIGGCSVESVDTNVPVANEYVDISLMDTNPSV-------------VPTDATKIVGPLDDPTVSS-----------

Query:  -------------NCHPMRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW-------------------------------------IKRNPD
                     N H M TR+K GI K  + Y      + ++EP +  QA K   W                                      K N D
Subjt:  -------------NCHPMRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW-------------------------------------IKRNPD

Query:  GSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPR
        GS+ RYK RLVAK Y+QR G+DY ETFSPV+K  ++RIVL +A    W +RQLDV N FL G L +EVYM QP GF+ K  P +VCRL K++YGLKQAPR
Subjt:  GSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPR

Query:  AWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLV
        AW+    + L+  GF  S+SD SLFV     SI Y+L+YVDDI++TGND   +   +  L + F + +   L YF G+E KR+  GL ++Q +YT+DLL 
Subjt:  AWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLV

Query:  RFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSA
        R  M  AK  +TP +T    +  +     D T YR IVG+L YL FT+PD+S+AV+R+SQ+MH PT DH  A+KR+LRYL GT   G+ L + ++L+L A
Subjt:  RFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSA

Query:  FSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHI
        +SD+DW G+  D  ST G++V+LG +PISW +KKQ  + RSST+AEYR++A T++E+ WI  LL+EL + L    ++YCDN  A  L  NPVFHSR KHI
Subjt:  FSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHI

Query:  EVDLHFVREQLQS
         +D HF+R Q+QS
Subjt:  EVDLHFVREQLQS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.9e-9642.4Show/hide
Query:  IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQF----PSHVCRLNK
        IK N DG++ RYK RLVAK Y Q+EG+D+ ETFSPV K  +V+++L+++A + + L QLD+ N FL+GDL EE+YM+ P G+ ++Q     P+ VC L K
Subjt:  IKRNPDGSVARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQF----PSHVCRLNK

Query:  SLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVT
        S+YGLKQA R WF  F+ +LI  GF  S SD + F++        +L+YVDDI++  N+ + +  L SQLK  F + DLG L YF GLE  R + G+ + 
Subjt:  SLYGLKQAPRAWFECFTSSLIRCGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVT

Query:  QIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGL-H
        Q KY +DLL   G+   K  S P       S  +     DA AYR ++G L YL  T+ DISFAV+++SQF  +P L H  AV +IL Y++GT+  GL +
Subjt:  QIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGL-H

Query:  LRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARN
           + + L  FSD+ +      RRST G+ +FLG + ISW +KKQ  + +SS +AEYRAL+  T E++W+ Q   EL + L +  +L+CDN +A+ +A N
Subjt:  LRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKKQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARN

Query:  PVFHSRTKHIEVDLHFVREQ--LQSFDAFSQGSFDLQ----SLQETLIFGVLKDLADSFNLLYLQGL
         VFH RTKHIE D H VRE+   Q+  ++S  ++D Q         ++ G +  +   F L  L+ L
Subjt:  PVFHSRTKHIEVDLHFVREQ--LQSFDAFSQGSFDLQ----SLQETLIFGVLKDLADSFNLLYLQGL

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.2e-1347.73Show/hide
Query:  YLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASS-LTLSAFSDSDWTGNPTDRRSTTGF-----VVFLGA
        YLT T+PD++FAV+R+SQF  +     + AV ++L Y++GT+  GL   A+S L L AF+DSDW   P  RRS TGF     + FLGA
Subjt:  YLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASS-LTLSAFSDSDWTGNPTDRRSTTGF-----VVFLGA

ATMG00810.1 DNA/RNA polymerases superfamily protein4.6e-5251.11Show/hide
Query:  YLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAY
        YLLLYVDDI+LTG+  + +  LI QL   F M DLG + YF G++ K    GL ++Q KY   +L   GM + K  STP      SS +T   P D + +
Subjt:  YLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSSTATLCSPSDATAY

Query:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKK
        RSIVGAL YLT T+PDIS+AV+ V Q MH PTL     +KR+LRY++GT+  GL++ + S L + AF DSDW G  + RRSTTGF  FLG N ISW  K+
Subjt:  RSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHL-RASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITKK

Query:  QSTIFRSSTKAEYRALATTTAEIIW
        Q T+ RSST+ EYRALA T AE+ W
Subjt:  QSTIFRSSTKAEYRALATTTAEIIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)9.2e-0832Show/hide
Query:  MRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQR
        M TRSK+GI K    Y   + +++  EP S   A K P W                                     K + DG++ R K RLVAK +HQ 
Subjt:  MRTRSKSGIVK-KKVYLAKLDSSLSTEPSSFTQASKLPEW------------------------------------IKRNPDGSVARYKTRLVAKEYHQR

Query:  EGVDYEETFSPVVKKPTVRIVLSLA
        EG+ + ET+SPVV+  T+R +L++A
Subjt:  EGVDYEETFSPVVKKPTVRIVLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTAACTAGAGCTAGGTCAAATGTCCGTTTTACCCCTAAAGTTACATCTTGCTCCTTAAGTCCCACTGATCCTCTAATGAACAATTTGGTTTATGGTTTTACCAC
TAAACCAAGTCCCTTTCGGGCCAATGAGATGGTGGGGCCCCTTGATCAAGACCCAAAGTCAGCACTTAAGGGAATAACCTCTCTACTATCCTTGAAACGGGTAGGAGTGA
ATTCCTTCTTGCAGGATTATGTCCACAACTATCTACCCGGTCTCATCCCCAAAATGTGCAACCAAAGTTCCACATTGGCTAGACAAGGGGATGATCATGGATTTATAAGG
GAAGACAAGTACCTCCATTGGTATGGGGCTTTTTGGGTGGATCCAAAGCAAAGCCATGAGGACTTATGCCCAAAGTGGACAATATCATACCATTGTGGAGATCTGTGTCG
TGTCAGAAGCCGACCCTCGGCCCGCTTGCATGGACCGAGCTCTTCTGCTTCCGTTTGGTCCCTGGTGCCCCCGGCCGCCCCAGTTCCACTTGGTTTAGTTCGAATCGCCT
CTGAATGCCTAAAACATGTCATGCGTCTTCTCCCCTCTCAAACAAATTTACCGTTGGTGGTACGTGAAGGTCAGGACAAACGAACGGGTCGCATACTCTTCCATGGGCCC
AGTATTAATGGCCTTTATCCTCTGACTACTCAGTCCTCCGCTTCTCCATCCTCTGTTCCTAGCACTATTACTGCTCAACTAGGTACCAGAGTGAACTCTTCCATTTGGCA
TGACCGGTTAGGACATCCTAACTTTACTGTTCTTAAATCTGTACTACAGTTGTTAAATGTAGCATATTCCTCTTTCTCCTCGTTATGTTCACATTGTTTGAGTGGTAAGA
TGAGCAAGTTATCTTTTCCCTTGTCTCATACTCACTCTTCTCACCCCTTGGAACTTGTTCATAGTGATTCTGATGTTGCTGCCACTGTTTCCATGTTTAAATCATTTGCT
GAAAATCTTCTTTCTTTCAAGATGAAAACTCTTGCAACTCTTTTTGCTACACATGGCATTTTTCATCAAAGATCTTGTCCTCATACACCCGAGCAAAATGGGGTAGCCGA
GAGAAAACATCGCCATATTGTTGAGATGGCCCTTTCTTTGATATCCAAGTTTTCTGTTCCTCTTCGCTACTGGTATCATGCTTTTGCCTGTGCTACCTTCCTTATCAATC
GCTTGCCCTCTCTCTCCCTCGGTAATAAATCTCCTTTTGAGGTTCTATTTCGTAAGCCACCAGATTACTCACACTTACGGGTTTTCGGTTGTGCTTGTTATCCGCTCCTT
CGTCCTTATGTCTCTCACAAACTCCAACCTCGAACTACTCAGTGTGTGTTCATAGGTTATCCCCTTGGCTACAAAGGCTATCTATGCCTTGATCCTCTTACCGAACGCAT
TTATATCTCTCGACATGTTGTCTTTGATGAACATGTCTTTCCCTTTGTTCTTTCTTCACCTACCTCAAGTTCATCATCCACTCCTTCCATGCCTTCTGCTTCCTCTACTA
TTCCCACTCTTATTCGACTTTTTCAGCCCTTAGATTCAGTTGAGGAGTTGCCCACACCTTGTGAGTCTACTGTATCTGCTTCTACCTCTCCAAGTTTCTGTGATATGACT
ACTAATACTCCTACTGTTCAGACTTCTATTGGAGGTTGTAGTGTTGAATCTGTTGATACTAATGTTCCTGTTGCAAATGAATATGTTGATATTTCTCTTATGGATACTAA
TCCCTCTGTTGTGCCTACTGATGCCACGAAAATTGTTGGGCCTCTTGATGACCCCACTGTCTCCTCCAATTGCCATCCCATGAGAACTCGGTCCAAGTCTGGTATTGTCA
AGAAAAAGGTTTACTTGGCCAAACTTGACTCCTCACTCTCTACTGAGCCATCATCCTTTACACAAGCTTCTAAGTTGCCCGAATGGATCAAACGCAATCCGGATGGTAGT
GTGGCACGTTATAAGACACGTTTGGTTGCGAAAGAATATCATCAAAGAGAGGGTGTCGATTATGAGGAAACGTTCAGCCCTGTAGTCAAGAAACCTACTGTTAGGATTGT
TCTGTCCTTGGCTGCTCATCATGGTTGGGATCTTCGTCAGTTGGATGTTAAAAATGTGTTCCTACATGGTGATTTGAAGGAGGAGGTCTACATGCAACAACCTCAAGGAT
TCATTAGCAAACAATTCCCTTCTCATGTTTGTCGCCTAAATAAGTCTCTTTATGGACTCAAACAAGCCCCTCGCGCTTGGTTTGAGTGCTTTACAAGCTCCTTGATCCGT
TGTGGTTTCAAAGCCTCTCTATCTGATCCTTCACTATTTGTTAGGAATATCAGAGGCTCCATCACTTATTTGTTGCTTTATGTTGACGATATTGTACTTACTGGCAACGA
TCCTTCATATATTACCTCTCTTATTTCTCAGTTGAAGGAATTGTTTGATATGACTGATTTAGGAAGCCTATCTTACTTCTTTGGACTTGAATTCAAGCGTTTGTCTATTG
GTCTTTGTGTTACTCAGATAAAGTATACTATGGATCTGTTAGTTCGGTTTGGTATGGCTGAAGCTAAGATTTGCTCTACACCTTGTTCTACTGGCCATTTTTCTTCCACG
GCAACTCTTTGTTCTCCTTCTGATGCCACTGCATATCGAAGTATAGTTGGAGCTCTCCATTACCTCACGTTTACTCAACCAGACATATCGTTTGCCGTGAGTAGGGTTTC
ACAGTTTATGCATTCGCCTACACTTGATCATTTAGCTGCTGTCAAGCGTATCTTGAGGTATCTGCGTGGTACATTACAGCTGGGATTGCATCTTCGTGCTAGTTCTTTAA
CTCTCAGTGCTTTCTCGGACTCGGATTGGACTGGTAATCCGACTGATCGACGGTCTACTACTGGTTTTGTGGTTTTCTTGGGTGCAAATCCTATCTCTTGGATTACCAAG
AAACAATCCACAATCTTTCGCAGCTCAACCAAGGCTGAATACCGTGCATTGGCTACTACTACTGCTGAGATTATCTGGATTCAACAGCTTCTTAGTGAATTGTTCGTGTC
TCTGCCTCAGTCTCTTATACTTTACTGTGATAATCAGTCTGCCTTGCAGTTGGCTCGAAATCCAGTGTTCCATAGTCGAACGAAACATATTGAAGTTGACCTTCATTTTG
TTCGAGAACAGTTACAATCCTTTGATGCTTTCTCTCAAGGTAGCTTTGATCTACAAAGTCTTCAAGAAACTTTAATCTTCGGAGTTTTGAAGGATCTTGCAGACAGCTTC
AATCTTCTATATCTTCAGGGTCTTACAGAGAGCTTTGATCTTCTGAATCTTACAGAGGCTTCAATCTTCAGAGTCTTGGACGAAGCTTCAAGAGGAATGAAGCTTCAATC
TTCAGGTCTTGTAGAAGGAATGAAGCTTCAATCTTCAAGTCTTGTAGGAGGATCAGAGCTTCAGTCTTCAGAATCTTCTGAGAGTTTCAATTTTCCTGCCACCCCCTCAA
ATGAGCCAAGGTCTTCTATTTATAGAGTTTCCCAACGACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTAACTAGAGCTAGGTCAAATGTCCGTTTTACCCCTAAAGTTACATCTTGCTCCTTAAGTCCCACTGATCCTCTAATGAACAATTTGGTTTATGGTTTTACCAC
TAAACCAAGTCCCTTTCGGGCCAATGAGATGGTGGGGCCCCTTGATCAAGACCCAAAGTCAGCACTTAAGGGAATAACCTCTCTACTATCCTTGAAACGGGTAGGAGTGA
ATTCCTTCTTGCAGGATTATGTCCACAACTATCTACCCGGTCTCATCCCCAAAATGTGCAACCAAAGTTCCACATTGGCTAGACAAGGGGATGATCATGGATTTATAAGG
GAAGACAAGTACCTCCATTGGTATGGGGCTTTTTGGGTGGATCCAAAGCAAAGCCATGAGGACTTATGCCCAAAGTGGACAATATCATACCATTGTGGAGATCTGTGTCG
TGTCAGAAGCCGACCCTCGGCCCGCTTGCATGGACCGAGCTCTTCTGCTTCCGTTTGGTCCCTGGTGCCCCCGGCCGCCCCAGTTCCACTTGGTTTAGTTCGAATCGCCT
CTGAATGCCTAAAACATGTCATGCGTCTTCTCCCCTCTCAAACAAATTTACCGTTGGTGGTACGTGAAGGTCAGGACAAACGAACGGGTCGCATACTCTTCCATGGGCCC
AGTATTAATGGCCTTTATCCTCTGACTACTCAGTCCTCCGCTTCTCCATCCTCTGTTCCTAGCACTATTACTGCTCAACTAGGTACCAGAGTGAACTCTTCCATTTGGCA
TGACCGGTTAGGACATCCTAACTTTACTGTTCTTAAATCTGTACTACAGTTGTTAAATGTAGCATATTCCTCTTTCTCCTCGTTATGTTCACATTGTTTGAGTGGTAAGA
TGAGCAAGTTATCTTTTCCCTTGTCTCATACTCACTCTTCTCACCCCTTGGAACTTGTTCATAGTGATTCTGATGTTGCTGCCACTGTTTCCATGTTTAAATCATTTGCT
GAAAATCTTCTTTCTTTCAAGATGAAAACTCTTGCAACTCTTTTTGCTACACATGGCATTTTTCATCAAAGATCTTGTCCTCATACACCCGAGCAAAATGGGGTAGCCGA
GAGAAAACATCGCCATATTGTTGAGATGGCCCTTTCTTTGATATCCAAGTTTTCTGTTCCTCTTCGCTACTGGTATCATGCTTTTGCCTGTGCTACCTTCCTTATCAATC
GCTTGCCCTCTCTCTCCCTCGGTAATAAATCTCCTTTTGAGGTTCTATTTCGTAAGCCACCAGATTACTCACACTTACGGGTTTTCGGTTGTGCTTGTTATCCGCTCCTT
CGTCCTTATGTCTCTCACAAACTCCAACCTCGAACTACTCAGTGTGTGTTCATAGGTTATCCCCTTGGCTACAAAGGCTATCTATGCCTTGATCCTCTTACCGAACGCAT
TTATATCTCTCGACATGTTGTCTTTGATGAACATGTCTTTCCCTTTGTTCTTTCTTCACCTACCTCAAGTTCATCATCCACTCCTTCCATGCCTTCTGCTTCCTCTACTA
TTCCCACTCTTATTCGACTTTTTCAGCCCTTAGATTCAGTTGAGGAGTTGCCCACACCTTGTGAGTCTACTGTATCTGCTTCTACCTCTCCAAGTTTCTGTGATATGACT
ACTAATACTCCTACTGTTCAGACTTCTATTGGAGGTTGTAGTGTTGAATCTGTTGATACTAATGTTCCTGTTGCAAATGAATATGTTGATATTTCTCTTATGGATACTAA
TCCCTCTGTTGTGCCTACTGATGCCACGAAAATTGTTGGGCCTCTTGATGACCCCACTGTCTCCTCCAATTGCCATCCCATGAGAACTCGGTCCAAGTCTGGTATTGTCA
AGAAAAAGGTTTACTTGGCCAAACTTGACTCCTCACTCTCTACTGAGCCATCATCCTTTACACAAGCTTCTAAGTTGCCCGAATGGATCAAACGCAATCCGGATGGTAGT
GTGGCACGTTATAAGACACGTTTGGTTGCGAAAGAATATCATCAAAGAGAGGGTGTCGATTATGAGGAAACGTTCAGCCCTGTAGTCAAGAAACCTACTGTTAGGATTGT
TCTGTCCTTGGCTGCTCATCATGGTTGGGATCTTCGTCAGTTGGATGTTAAAAATGTGTTCCTACATGGTGATTTGAAGGAGGAGGTCTACATGCAACAACCTCAAGGAT
TCATTAGCAAACAATTCCCTTCTCATGTTTGTCGCCTAAATAAGTCTCTTTATGGACTCAAACAAGCCCCTCGCGCTTGGTTTGAGTGCTTTACAAGCTCCTTGATCCGT
TGTGGTTTCAAAGCCTCTCTATCTGATCCTTCACTATTTGTTAGGAATATCAGAGGCTCCATCACTTATTTGTTGCTTTATGTTGACGATATTGTACTTACTGGCAACGA
TCCTTCATATATTACCTCTCTTATTTCTCAGTTGAAGGAATTGTTTGATATGACTGATTTAGGAAGCCTATCTTACTTCTTTGGACTTGAATTCAAGCGTTTGTCTATTG
GTCTTTGTGTTACTCAGATAAAGTATACTATGGATCTGTTAGTTCGGTTTGGTATGGCTGAAGCTAAGATTTGCTCTACACCTTGTTCTACTGGCCATTTTTCTTCCACG
GCAACTCTTTGTTCTCCTTCTGATGCCACTGCATATCGAAGTATAGTTGGAGCTCTCCATTACCTCACGTTTACTCAACCAGACATATCGTTTGCCGTGAGTAGGGTTTC
ACAGTTTATGCATTCGCCTACACTTGATCATTTAGCTGCTGTCAAGCGTATCTTGAGGTATCTGCGTGGTACATTACAGCTGGGATTGCATCTTCGTGCTAGTTCTTTAA
CTCTCAGTGCTTTCTCGGACTCGGATTGGACTGGTAATCCGACTGATCGACGGTCTACTACTGGTTTTGTGGTTTTCTTGGGTGCAAATCCTATCTCTTGGATTACCAAG
AAACAATCCACAATCTTTCGCAGCTCAACCAAGGCTGAATACCGTGCATTGGCTACTACTACTGCTGAGATTATCTGGATTCAACAGCTTCTTAGTGAATTGTTCGTGTC
TCTGCCTCAGTCTCTTATACTTTACTGTGATAATCAGTCTGCCTTGCAGTTGGCTCGAAATCCAGTGTTCCATAGTCGAACGAAACATATTGAAGTTGACCTTCATTTTG
TTCGAGAACAGTTACAATCCTTTGATGCTTTCTCTCAAGGTAGCTTTGATCTACAAAGTCTTCAAGAAACTTTAATCTTCGGAGTTTTGAAGGATCTTGCAGACAGCTTC
AATCTTCTATATCTTCAGGGTCTTACAGAGAGCTTTGATCTTCTGAATCTTACAGAGGCTTCAATCTTCAGAGTCTTGGACGAAGCTTCAAGAGGAATGAAGCTTCAATC
TTCAGGTCTTGTAGAAGGAATGAAGCTTCAATCTTCAAGTCTTGTAGGAGGATCAGAGCTTCAGTCTTCAGAATCTTCTGAGAGTTTCAATTTTCCTGCCACCCCCTCAA
ATGAGCCAAGGTCTTCTATTTATAGAGTTTCCCAACGACTTTAA
Protein sequenceShow/hide protein sequence
MFVTRARSNVRFTPKVTSCSLSPTDPLMNNLVYGFTTKPSPFRANEMVGPLDQDPKSALKGITSLLSLKRVGVNSFLQDYVHNYLPGLIPKMCNQSSTLARQGDDHGFIR
EDKYLHWYGAFWVDPKQSHEDLCPKWTISYHCGDLCRVRSRPSARLHGPSSSASVWSLVPPAAPVPLGLVRIASECLKHVMRLLPSQTNLPLVVREGQDKRTGRILFHGP
SINGLYPLTTQSSASPSSVPSTITAQLGTRVNSSIWHDRLGHPNFTVLKSVLQLLNVAYSSFSSLCSHCLSGKMSKLSFPLSHTHSSHPLELVHSDSDVAATVSMFKSFA
ENLLSFKMKTLATLFATHGIFHQRSCPHTPEQNGVAERKHRHIVEMALSLISKFSVPLRYWYHAFACATFLINRLPSLSLGNKSPFEVLFRKPPDYSHLRVFGCACYPLL
RPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPLTERIYISRHVVFDEHVFPFVLSSPTSSSSSTPSMPSASSTIPTLIRLFQPLDSVEELPTPCESTVSASTSPSFCDMT
TNTPTVQTSIGGCSVESVDTNVPVANEYVDISLMDTNPSVVPTDATKIVGPLDDPTVSSNCHPMRTRSKSGIVKKKVYLAKLDSSLSTEPSSFTQASKLPEWIKRNPDGS
VARYKTRLVAKEYHQREGVDYEETFSPVVKKPTVRIVLSLAAHHGWDLRQLDVKNVFLHGDLKEEVYMQQPQGFISKQFPSHVCRLNKSLYGLKQAPRAWFECFTSSLIR
CGFKASLSDPSLFVRNIRGSITYLLLYVDDIVLTGNDPSYITSLISQLKELFDMTDLGSLSYFFGLEFKRLSIGLCVTQIKYTMDLLVRFGMAEAKICSTPCSTGHFSST
ATLCSPSDATAYRSIVGALHYLTFTQPDISFAVSRVSQFMHSPTLDHLAAVKRILRYLRGTLQLGLHLRASSLTLSAFSDSDWTGNPTDRRSTTGFVVFLGANPISWITK
KQSTIFRSSTKAEYRALATTTAEIIWIQQLLSELFVSLPQSLILYCDNQSALQLARNPVFHSRTKHIEVDLHFVREQLQSFDAFSQGSFDLQSLQETLIFGVLKDLADSF
NLLYLQGLTESFDLLNLTEASIFRVLDEASRGMKLQSSGLVEGMKLQSSSLVGGSELQSSESSESFNFPATPSNEPRSSIYRVSQRL