; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022881 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022881
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:40047306..40053150
RNA-Seq ExpressionLag0022881
SyntenyLag0022881
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07344.1 hypothetical protein Prudu_019255 [Prunus dulcis]5.8e-9828.47Show/hide
Query:  LRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVGSTSAKELWLSLEKRYSSK
        +RL+  N+V W YQ+ S+L+ + L+GH D S   P K+V S +   TTE++  Y QW   D++L++LL ATLS  A+ +V+G  SA++ WLSL  RY+S 
Subjt:  LRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVGSTSAKELWLSLEKRYSSK

Query:  TRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTMKKNTTADII
        +R+ + +L++ L++I K   +SVEKF  R+K + ++LAAA  K+ D++I++  LNGL  +F   +T +  R   + L++  A L + EK ++    A  +
Subjt:  TRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTMKKNTTADII

Query:  PTTMAAFKDHSP--TGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLS-------NGSNSRGI-LG
          +M      SP   G     +     +G A+     +P+     S+   R         G  Q+      G     ++S S       +GS +  +  G
Subjt:  PTTMAAFKDHSP--TGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLS-------NGSNSRGI-LG

Query:  SGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDL---------------ANLNLSNNYNGE
         GS +G+   + S  G   +NS    + CQIC K GH AL+CY+R NF+YQG  PP  + AM        NDL                 L  S  Y+ E
Subjt:  SGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDL---------------ANLNLSNNYNGE

Query:  ESVTV----GASINGLYLI----PSPSALSNSSMQPK-----------------------NFNFHA----KQECSLWHQRFGHPAPKTLHSSL-------
        + +T+    G S  GLY I    P PS +  S +                          N   HA    + +  LWH R GH +   LH  L       
Subjt:  ESVTV----GASINGLYLI----PSPSALSNSSMQPK-----------------------NFNFHA----KQECSLWHQRFGHPAPKTLHSSL-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHL
                                                                          +   FLINRMPS  LNM SP+  LFG  PDL  L
Subjt:  ------------------------------------------------------------------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHL

Query:  KVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQN
        K+FG A YPLL+P   HKL+P++ Q IFLGY + +KG +CY+  T K ++S HVI +E ++P       P                         T +Q 
Subjt:  KVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQN

Query:  PCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRN
           +PL                            S  + +  G LP+ V   +++N HPM+TR+KS          +  SS+ T                
Subjt:  PCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRN

Query:  AMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV--FAQEGCCVFILIGK
        AM EE  AL QQ  GT  ++                P P                          T++     L  Y+ +RLV   F+QE          
Subjt:  AMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV--FAQEGCCVFILIGK

Query:  KRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHGH------------------------LRKSLYGLKQSPRAWFDRFTTYLLTLGFVA
           +D     +PVV+  T+R+IL LA  + W L QLDVKNAFLHG                         L+KSLYGLKQ+PRAW  +FT YL TLGF  
Subjt:  KRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHGH------------------------LRKSLYGLKQSPRAWFDRFTTYLLTLGFVA

Query:  SAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDG-IFVNQAKYLNDLLQTSGMATAKTCLTPMST
        S +DPSLF++    +   LLLYVDDII+T SD + +  + + L+  F++ DLG LKYFLGLEV   + G IFVNQAKY  DL++ + M T K C TP   
Subjt:  SAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDG-IFVNQAKYLNDLLQTSGMATAKTCLTPMST

Query:  TVDLY--AFAPLFKDATLYRQLVDSLQYLTFTR
           +      PL  + TL+R +V +LQYLTFTR
Subjt:  TVDLY--AFAPLFKDATLYRQLVDSLQYLTFTR

KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.2e-11636.58Show/hide
Query:  SSSAMVISSTNSSM-ASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNA
        +S+ +  SS+N S+  S LFLLSNICNLVP+RLDS+NYVLWKYQVSSILKAHSLFGHID++LP P K + SST    +EI+ +YLQW SRDQALITL+N 
Subjt:  SSSAMVISSTNSSM-ASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNA

Query:  TLSPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRT
        TLS  ALAHVV S S+K LWLSLEKRYSS TRSNIL+LRSALY+I K  SES+++++ RIK LV+KLAAASV ++DEEILVHTLNGL   F+AFRTSIRT
Subjt:  TLSPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRT

Query:  RSGSLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMG
        RSG+++LE+LH LL +EE  + K +  + IPT MAAF      G RGRGRR  H  G        +PNF    S S+     +NF               
Subjt:  RSGSLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMG

Query:  HVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDLANLNLSN
                    S SRG+   GSN                                                                           N
Subjt:  HVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDLANLNLSN

Query:  NYNGEESVTVGASINGLYLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAPKTLHSSLSRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVF
        N+     V  G  IN                                                                                     
Subjt:  NYNGEESVTVGASINGLYLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAPKTLHSSLSRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVF

Query:  GCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCS
                                                                                             T +NL   PS     
Subjt:  GCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCS

Query:  VPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMC
                                                            N H M+TRAKS IFKPKAF  TT  ++PT PTSYTEASKYPEWR  M 
Subjt:  VPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMC

Query:  EEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPL-GWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRI
        EEFNALQ Q  GT +++ +  + S   +G + W   I   L G   RH                                               K R +
Subjt:  EEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPL-GWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRI

Query:  DQGFLQNPV-VKKPTIRVILTLAAQYGWPLNQLDVKNAFLHGH---LRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVD
         +G+ Q  V + +PT                    +N     H   L KSLYGLKQ+PRAWF+RFT+YL TLGFVAS ADPSLFIRSV  S TYLLLYVD
Subjt:  DQGFLQNPV-VKKPTIRVILTLAAQYGWPLNQLDVKNAFLHGH---LRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVD

Query:  DIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYRQL
        DIIVT  D  YI  L+ QLA EF+ISDLG LKYFLGLE+ SS DGI VNQA+YLNDLL TSGM +AK+C+TP+ST++DLY  AP F D +LYR+L
Subjt:  DIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYRQL

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]5.6e-10127.47Show/hide
Query:  ISSTNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQS----STGTDTTEISNDYLQWNSRDQALITLLNATLS
        ++ T S ++S   L+ ++ N V ++LD SNYV W +Q+  +L+ + +FG +D S+  P K+  S     T T++  I++DY  W   D+AL+TL+ ATLS
Subjt:  ISSTNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQS----STGTDTTEISNDYLQWNSRDQALITLLNATLS

Query:  PLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSG
          AL+ V+G  S++++W +L++R+S+ TR++I++++  L +I+K  SES++ +  RIKD  ++LAA  V I DE+I++  L GL  +F+  +  IR R  
Subjt:  PLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSG

Query:  SLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRG---RRNLHGRGRAAQPFASSPNF---PFHGSI-----------SDGRGAHSNFR
         ++L++L + L AEE T+ +      I + M A       GG   G     + H    +  P +S+P F   PF   +             G G ++NFR
Subjt:  SLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRG---RRNLHGRGRAAQPFASSPNF---PFHGSI-----------SDGRGAHSNFR

Query:  --KFGPQ--------QTFGPNHMGHVFGQQYSLSN---GSNSRGI-----LGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMN-
           F P+            P   G  F QQ S +N   GS+ + +        G    QI +     G  +   H  +  CQIC++ GH+A  C++R + 
Subjt:  --KFGPQ--------QTFGPNHMGHVFGQQYSLSN---GSNSRGI-----LGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMN-

Query:  --------FSYQGR-------------------------HPPTQLAAMA-------------------------WM-----QCSYENDLANLNLSNNYNG
                FS                             HPP    AM                          W+          +DL+N++++  Y+ 
Subjt:  --------FSYQGR-------------------------HPPTQLAAMA-------------------------WM-----QCSYENDLANLNLSNNYNG

Query:  EESVTVGASINGLYL----------------------IPSPS-------------------------------------ALSNSSMQP----------KN
         ++VT GA+  GL++                      +P  S                                      LSN+++ P            
Subjt:  EESVTVGASINGLYL----------------------IPSPS-------------------------------------ALSNSSMQP----------KN

Query:  FNFHAKQECSLWHQRFGHPAPKTLHSSLSR----------------------------------------------------------------------
             +   +LWH R GHPA   + ++LS+                                                                      
Subjt:  FNFHAKQECSLWHQRFGHPAPKTLHSSLSR----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---LVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQ
            V+LINRMP+  L+M SPFE L+  +P L HLK+FGCACYP LKP R HKL PKTS+ IFLGY   +KG IC+NP   K IVS HV+F E  FP + 
Subjt:  ---LVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQ

Query:  LSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVE---------------
        L ++  +S +   S +            S  PS      PL + IP+P     + S   S    +        + T+ S+P  +                
Subjt:  LSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVE---------------

Query:  -----------------TTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGK
                         +   VN HPM+TR+KS IFK K   S  V +    P+SY+ A K  EWR AM +E +AL QQ+  T   +  D       L  
Subjt:  -----------------TTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGK

Query:  RSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV--FAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPL
          W   I       ++HP                      +  Y+ +RLV   F+QE             +D     +PVVK  T+R++L+LAA  GW L
Subjt:  RSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV--FAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPL

Query:  NQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDL
        NQLDVKNAFLHG                         L++SLYGLKQ+PRAW +RFT +LLTLGF +S ADPSLF++  N+S   LLLYVDDII+T + +
Subjt:  NQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDL

Query:  TYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAF-APLFKDATLYRQLVDSLQYLTFTR
          +  + +QL  EF + +LG L YFLGL++   + G+FV+Q+KY+ DLL  + M   K CLTP      L    +P F D   YR +V +LQYLTFTR
Subjt:  TYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAF-APLFKDATLYRQLVDSLQYLTFTR

WP_081894301.1 DDE-type integrase/transposase/recombinase [Acetobacter malorum]5.1e-11028.59Show/hide
Query:  MVISSTNSSMASPLF-LLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQ--SSTGTDTTEISNDYLQWNSRDQALITLLNATL
        M+ ++    + SP+  L+ N+   V ++LD +NY++W YQ+  +L++H + G +D S   P +FV      G +T     +Y  W   D+AL+ L+  TL
Subjt:  MVISSTNSSMASPLF-LLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQ--SSTGTDTTEISNDYLQWNSRDQALITLLNATL

Query:  SPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRS
        SP A++ ++G TSA E+W++L  R+S+ T+++I +++  L +I+K  SES+ K+  RIKD+ + L+AA V  DD++I++  L GL  +++ FRT IR R 
Subjt:  SPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRS

Query:  GSLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFG-----PN
          ++L+D  A L AEE T++ N  +    T M A  + S   G      + H +G     F+   + P+HGS S+   +  ++   GP    G      N
Subjt:  GSLTLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFG-----PN

Query:  HMGHVFGQQYSLSN----GSNSRGIL-------------GSGSNSGQIIENGSAFGSAYNNSHGG------RISCQICSKFGHSALDCYNRMNFSYQGRH
        + G   G+  S SN    G+NS GIL             G+G  + QI        S     H         + CQIC KFGHSAL CY+R NFSYQGR 
Subjt:  HMGHVFGQQYSLSN----GSNSRGIL-------------GSGSNSGQIIENGSAFGSAYNNSHGG------RISCQICSKFGHSALDCYNRMNFSYQGRH

Query:  PPTQLAAM------------AWM-----QCSYENDLANLNLSNNYNGEESVTV-----------------------------------------------
        PP+ L  M             W+          +DL NL  +  + G +++T                                                
Subjt:  PPTQLAAM------------AWM-----QCSYENDLANLNLSNNYNGEESVTV-----------------------------------------------

Query:  -------------------------GASINGLY---------LIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAP--------------------
                                 G   +GLY         ++P  S  S+S    +          SLWH R GHP+                     
Subjt:  -------------------------GASINGLY---------LIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAP--------------------

Query:  ---------------------------KTLHS--------------------------------------------------------------------
                                   + LHS                                                                    
Subjt:  ---------------------------KTLHS--------------------------------------------------------------------

Query:  ----------------------------------------------SLSRL------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCA
                                                      S ++L            V+LINRMP S+L+M SP+  LF     L HLKVFG +
Subjt:  ----------------------------------------------SLSRL------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCA

Query:  CYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTP--SQNPCSV
        CYPLLKP   +KLQPKT Q IFLGY   +KG IC+NP + +F VS HV+F+ET FP+  L  KP+   +    T    T L+T  N+  +P  S +  S+
Subjt:  CYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTP--SQNPCSV

Query:  PLT--------------------NPIPLPTTGRTNPSIVES--HDNITCTDGSCQIAETVGSLPS--------------IVETTNNVNIHPMRTRAKSRI
        PL+                    +PI +P    ++P + +   H + + T  S  +  +  S  S              IV   ++V++HPM+TR+KS I
Subjt:  PLT--------------------NPIPLPTTGRTNPSIVES--HDNITCTDGSCQIAETVGSLPS--------------IVETTNNVNIHPMRTRAKSRI

Query:  FKPKAFLS--TTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPT
         K KAF++  ++V      P+++  ASK  EW++AM +E +AL  Q                      +W L    PL  G+   +V        +K P 
Subjt:  FKPKAFLS--TTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPT

Query:  SLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG-----------------------
          + +     Y+ +RLV              ++  ID G   +PVVK  T+R+IL LAAQ+ W L QLDVKNAFLHG                       
Subjt:  SLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG-----------------------

Query:  --HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSST
           L KSLYGLKQ+PRAW D+FT++L  LGF AS ADPSLF++  +     LLLYVDDII+T S  + I ++   L   F + DLG L YFLGL++   +
Subjt:  --HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSST

Query:  DGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYA-FAPLFKDATLYRQLVDSLQYLTFTR
         G+FV+Q KY+ +LL    M+ +K C TP   +  L       +     YR +V +LQYLTFTR
Subjt:  DGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYA-FAPLFKDATLYRQLVDSLQYLTFTR

XP_018499249.1 PREDICTED: uncharacterized protein LOC108865949 [Pyrus x bretschneideri]4.3e-10129.21Show/hide
Query:  LSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVGSTSAKELWL
        ++N+  +VP +L+  N++ W+     +LK   L G ++     P +F++ S+GT     S  +  W  RDQ L+  +N+TLS   L   +G   ++ LW 
Subjt:  LSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVGSTSAKELWL

Query:  SLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTM
        SLE+R+S  +R+++  LRS + +I+K    S+  F   IKD+ +KLAAA   + + +++ + L+GL D++ +F  SI TR+ SLT ++LH LL ++E ++
Subjt:  SLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTM

Query:  KKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLSNGSN-----SR
        +K  T  +  +T A F  +                  AAQP     NF    +    RG   N  ++   +TFG N+  +        +N +N     S 
Subjt:  KKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLSNGSN-----SR

Query:  GILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-------WMQCSYEN-----DLANLNLSNNYNG
        GILGSG          S   S  ++S G  + CQ+C +FGH A  C     F+      PT ++AM        W+  S  +     D + LN +  Y+G
Subjt:  GILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-------WMQCSYEN-----DLANLNLSNNYNG

Query:  EESVTVGAS---------------------INGLYLIPSPS-------------------------------------ALSNSSMQPKNFN---------
         + + VG                       +N + L+P  S                                       S   + P  +N         
Subjt:  EESVTVGAS---------------------INGLYLIPSPS-------------------------------------ALSNSSMQPKNFN---------

Query:  ------FHAKQECSLWHQRFGHPAPKTLHS--------------SLSRL---------------------------------------------------
                AK +  +WH+R GHP+   LHS              SLS L                                                   
Subjt:  ------FHAKQECSLWHQRFGHPAPKTLHS--------------SLSRL---------------------------------------------------

Query:  ---------------------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYN
                                   ++LINR+PS S  + SP+E LF  +P  H LK FGCACYP L+P  + KL  K+ Q +F GY L+  G  C++
Subjt:  ---------------------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYN

Query:  PHTKKFIVSHHVIFHETIFPFSQLSSK---PNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIA
        P + +  +S HV+F E+ FP+  LSS+   P+      + T     L L + +L   PS    S   T P+   T    +  ++ES         S    
Subjt:  PHTKKFIVSHHVIFHETIFPFSQLSSK---PNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIA

Query:  ETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLST---------TVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPT
        E+  SLP        VN H M TRAK+ I KPK F +T         +++++P  P+++ +ASK   W +AM  EF ALQ    GT  ++  +  ++   
Subjt:  ETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLST---------TVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPT

Query:  LGKRSWPLPI-CRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV---FAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQ
        L    W   +  +P G  ER+                             +RLV      QEG            +D     +PV K  TIR++L++A  
Subjt:  LGKRSWPLPI-CRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVV---FAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQ

Query:  YGWPLNQLDVKNAFLHGH------------------------LRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIV
        Y W ++QLDV NAFLHGH                        LRKSLYGLKQ+PRAW++ F T LL+LGF  S +DPSLFI+  + S T++L+YVDDII+
Subjt:  YGWPLNQLDVKNAFLHGH------------------------LRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIV

Query:  TRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYRQLVDSLQYLTF
        T S  T    +   L   F + DLG + YFLG+EVH S  G+ ++Q KY  DLL+ + M  AK C TP+S T  L     L  D T YR  V +LQYLT+
Subjt:  TRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYRQLVDSLQYLTF

Query:  TRRTL
        TR  L
Subjt:  TRRTL

TrEMBL top hitse value%identityAlignment
A0A2N9EFT0 Uncharacterized protein1.5e-13631.81Show/hide
Query:  NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVV
        N+ +++P+ LLSNI NLV ++LD++NY++WKYQV+SIL+A+SL   ID S P P KF++   G+ +  ++++Y +W SRD+ L+T+LNATLSP  L+ VV
Subjt:  NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVV

Query:  GSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLH
        G  SA+ +W +LEKR++S  RSNIL L+  L+ + K +++ V+ F  R+K+  +KL A  V I DEEIL   L GL  +F + R++IRTR+  ++ ++L 
Subjt:  GSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLH

Query:  ALLNAEEKTMKKNTTADIIPTTMAAFK-------DHSP-----TGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHM
         LL+AEE ++K N  A   P  MA          +H+P     +  RGRGR N +GRGR              G  ++GRG   N       Q F  N  
Subjt:  ALLNAEEKTMKKNTTADIIPTTMAAFK-------DHSP-----TGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHM

Query:  GHVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA----------------
                                           FG++ N S   R  CQIC K GH ALDCY+RM++SYQGRHPP +LAA+A                
Subjt:  GHVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA----------------

Query:  -----------------WMQ----------------------------CSYENDLANLNLSNNYNGEESVTV-------------GASINGLYLIPSPSA
                         W Q                              +  DL NLN   +Y G + V++             G+S +GLY I   S+
Subjt:  -----------------WMQ----------------------------CSYENDLANLNLSNNYNGEESVTV-------------GASINGLYLIPSPSA

Query:  LS------NSSMQPKNFNFHAKQECSLWHQRFGHPAPKTLHSSL--------------------------------------------------------
                +SS  P +     K   S+WH R GHP    LHS L                                                        
Subjt:  LS------NSSMQPKNFNFHAKQECSLWHQRFGHPAPKTLHSSL--------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKK
                           S  ++LINRMP+ +L  +SP++ LF   PD   LK FGC C+PLL+P  +HKL+P++S  +FLGY L+ KG +C N  T K
Subjt:  -------------------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKK

Query:  FIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTS----DPTLLLTLLNLSTTPSQNPC--SVPLTNPIPLPTTGRTNPSIV---ESHDNITCTDGSCQI
         ++S HV FHE  FPF   +S  + ST S T  +S     P    ++  L   PS  P   S PL++ +P  +     PS +    S  +I+C   SC  
Subjt:  FIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTS----DPTLLLTLLNLSTTPSQNPC--SVPLTNPIPLPTTGRTNPSIV---ESHDNITCTDGSCQI

Query:  AETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLST-TVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWP
          +V S P +      +N HPM+TR KS I K K  L T T++ + T P SY  ASKYPEW++AM +E+ ALQ+Q+                     +W 
Subjt:  AETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLST-TVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWP

Query:  LPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVK
            +P G   R+                             +RLV              ++  +D     +PVVK  T+R+IL++AAQ+ W L QLDV 
Subjt:  LPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVK

Query:  NAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVL
        NAFLH                          L+KSLYGLKQ+PRAWF+RFT+ LL LGF AS ADPSLFI   + +  +LL+YVDDII+T +  + +  L
Subjt:  NAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVL

Query:  QKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDL-YAFAPLFKDATLYRQLVDSLQYLTFTRRTL
         +QLAT F++ DLG L YFLGLEV  S  G FV+Q KY +DLLQ   M   K C TP  T+V L         DAT +R LV +LQYLTFTR  L
Subjt:  QKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDL-YAFAPLFKDATLYRQLVDSLQYLTFTRRTL

A0A2N9FTM6 Uncharacterized protein6.4e-13533.36Show/hide
Query:  STNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAH
        S++SSM +PL +LSN+ NL+ ++LDSSN+++WK+Q+SSILKA+S+   +D ++P P +F+  + G  T+  + ++  WN+RDQAL+TL+N+TLS   L+ 
Subjt:  STNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAH

Query:  VVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLED
        VVG  SA+ +W +LE+R++S +R+N+L L+  L+++KK   ESV  +  ++K+  +KL A  + ID+EE+L   L GL  ++  F ++IRTR+  ++ E+
Subjt:  VVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLED

Query:  LHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSL
        +  LL  EE ++ + + +      MA F   +P        RN          F S  +F  + S S GRG  ++ R  G +     NH       QYS 
Subjt:  LHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSL

Query:  SNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-----------WMQCSYEND-----L
           S S+G   S  +S              N   G R  CQIC K GH ALDCY+RM+F+YQGRHPP +LAAMA           W+  +   D     +
Subjt:  SNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-----------WMQCSYEND-----L

Query:  ANLNLSNNYNGEESVTVG----ASINGL--------------------YLI---PSPSALSNSSMQ---PKNFNF-------------------------
         NLN   +Y G E V VG      IN +                    +LI   PS   L  ++++    K+  F                         
Subjt:  ANLNLSNNYNGEESVTVG----ASINGL--------------------YLI---PSPSALSNSSMQ---PKNFNF-------------------------

Query:  -----HAKQECSLWHQRFGHPAPKTL----HSSL---------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKT
             H  Q+  +  ++  H     L    H+SL         +  + LINR+P+  L+  SP+E LF   PD+ HLK FGC C+PLL+P   HKLQP++
Subjt:  -----HAKQECSLWHQRFGHPAPKTL----HSSL---------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKT

Query:  SQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTST-TSDPTLLLTLLNLS----TTPSQ----NPCSVPLTNP-IPLP
        +  IFLGYP   KG +C +P T +  +S HV+F+ET F    L S+  +  T  TST  S P LL+ L N S    T P++    +P  V + +P +P+ 
Subjt:  SQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTST-TSDPTLLLTLLNLS----TTPSQ----NPCSVPLTNP-IPLP

Query:  TTGRTNPSIVESHDNITCTDGSCQIAETVGS---LPSI----------VETTNN---------------------------------------VNIHPMR
             +P    S ++ T +  S + A +V +   +PS+           +TT++                                       V+ HPM+
Subjt:  TTGRTNPSIVESHDNITCTDGSCQIAETVGS---LPSI----------VETTNN---------------------------------------VNIHPMR

Query:  TRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFS
        TRA++ IFKPK + +       T P +Y  ASKYP W  AM EEF+ALQ+Q                     ++W L    PL  G+   +V        
Subjt:  TRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFS

Query:  QKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------
                 + V ++ R S   +   +   V     ++  ID     +PVVK PT+R+IL LA  Y WPL QLDV+NAFLHG                  
Subjt:  QKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------

Query:  ------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEV
               L KS+YGLKQ+PRAWF+ FTT LLTLGF  S+AD SLFI        +LLLYVDDI++T ++  ++  L   L+  F++ D+G L YFLGL++
Subjt:  ------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEV

Query:  HSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPM--STTVDLYAFAPLFKDATLYRQLVDSLQYLTFTRRTL
          S+ G+ + Q KY  DLL    M     C TP   +T +      PL  D   YR LV +L YLTFTR  L
Subjt:  HSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPM--STTVDLYAFAPLFKDATLYRQLVDSLQYLTFTRRTL

A0A2N9H491 Uncharacterized protein1.9e-13934.94Show/hide
Query:  SMASSSAMVISST-----NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQA
        SMAS S+   ++       +S+ +PL LLSN+ NL+ ++LDS+N+++WK+Q+SSILKA+S+   +D ++P P +F+  + G  TT ++ D+  WN+RDQA
Subjt:  SMASSSAMVISST-----NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQA

Query:  LITLLNATLSPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSA
        L+TL+N+TLSP  L+ VVG  SA+ +W +LE R++S +R+N+L L+  L+++KK  +ESV  +  ++K+  +KL A    ID+EE+L   L GL  ++  
Subjt:  LITLLNATLSPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSA

Query:  FRTSIRTRSGSLTLEDLHALLNAEEKT-MKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQ
        F ++IRTR+  +  E++  LL  EE++ ++ + +   +   MA F   +            H R       +S+    F+ + S GRG   N  + G   
Subjt:  FRTSIRTRSGSLTLEDLHALLNAEEKT-MKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQ

Query:  TFGPNHMGHVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWM-------
         F  N+       QYS SN SN      S S   Q          A     G R  CQIC K GH ALDCY+RM+F+YQGRHPP +LAAMA +       
Subjt:  TFGPNHMGHVFGQQYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWM-------

Query:  ------------QCSYENDLANLNLSNNYNGEESVTVG--------ASINGLYLIPSPSALSNSSMQPKNFNF-HAKQECSLWHQRFGHPAPKTL----H
                    QC+  +   +  + N Y    ++ +             G Y     +A   SS      +  H  Q+  L  ++  H     L    H
Subjt:  ------------QCSYENDLANLNLSNNYNGEESVTVG--------ASINGLYLIPSPSALSNSSMQPKNFNF-HAKQECSLWHQRFGHPAPKTL----H

Query:  SSL---------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHV
        +SL         +  + LINR+P+  L+  SP+E LF   PD+ HL+ FGC C+P L+P   HKLQP+T+  IFLGYP   KG IC +P TK+  +S HV
Subjt:  SSL---------SRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHV

Query:  IFHETIF-PFSQLSSKPNISTTSTTST-TSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDG-----------SCQIAET
        +F+E  F P   L+S P   T   TST  S P LL+ L N S+T   N  S  L    PLP +     S+VES   +  TD            S   +  
Subjt:  IFHETIF-PFSQLSSKPNISTTSTTST-TSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDG-----------SCQIAET

Query:  VGSLPSI-----------VETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPT
          SLPS            +     VN HPM TR+K+ IFKPK+F +TT+    T P ++  ASKYP W  AM +E+ ALQ+Q                  
Subjt:  VGSLPSI-----------VETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPT

Query:  LGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWP
            +W L +  PLG                      +  + V ++ R S   +   +   V     ++  ID     +PVVK PT+R+IL L   Y WP
Subjt:  LGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWP

Query:  LNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSD
        L QLD++NAFLHG                         L KS+YGLKQ+PRAWF+ FTT LL LGF++S+AD SLFI     +  +LLLYVDDI++T ++
Subjt:  LNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSD

Query:  LTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPL-FKDATLYRQLVDSLQYLTFTRR
         +++  L   L+  F++ D+G L YFLGL++  S+ G+ + Q KY  DLL    M     C TP    V L +   +   D   YR LV +L YLTFTR 
Subjt:  LTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPL-FKDATLYRQLVDSLQYLTFTRR

Query:  TL
         L
Subjt:  TL

A0A2N9HIT4 Uncharacterized protein9.9e-13632.31Show/hide
Query:  NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVV
        +++ +SPL LL+N+ NL+  +LDSSNY++WK Q++++L A+S+  H+D S+P+P +F+ + TG     ++ D+L WN +D+AL+TLL +T S   LA VV
Subjt:  NSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVV

Query:  GSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLH
        G +S++E+W +LE+R++S  RSN+L L+  L SIKK  +E+V  +  RIK + +KL+A  V  D EE+    L GL  +F+ F ++IRTR   L+LE L 
Subjt:  GSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLH

Query:  ALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLSN
         LL  EE++M +  T  +  + +A F  H+     G    N  GRGR               S S GRG       F   Q F   +    +  QY    
Subjt:  ALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLSN

Query:  GSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-------------WMQCSYENDLANLNL
            + I  S     Q    G +           R +CQIC K GH A+DCY+RMNF+YQG++P T+LAAMA             W+  +   D    N 
Subjt:  GSNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMA-------------WMQCSYENDLANLNL

Query:  SN-----NYNGEESVTV-------------------------------------GASINGLYLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAP
        +N      Y G+E V+V                                     G S NG+Y I SP  +  ++      +  +  +  LWH R GHP+ 
Subjt:  SN-----NYNGEESVTV-------------------------------------GASINGLYLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAP

Query:  KTLHS----------------------------------------------------------------------------SLSRLVFLINRMPSSSLNM
        K L +                                                                            ++S    LIN++P+ +L  
Subjt:  KTLHS----------------------------------------------------------------------------SLSRLVFLINRMPSSSLNM

Query:  SSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNI-STTSTTSTTS
         SP+ETL+  +PDL HLK FGC C+PLL P   HKL PKT+  +F+GYPL+ KG  C NP T +   S HV+F+ET+FP  + S+  +  S  S  S ++
Subjt:  SSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNI-STTSTTSTTS

Query:  DPTLLLTLL---------------------------NLS---------------------------TTPSQNPC--SVPLTNPIPLPTTGRTNPSIVESH
        D  L   LL                           NL+                           T  S NP   +VP  + +PLPT    NP+   + 
Subjt:  DPTLLLTLL---------------------------NLS---------------------------TTPSQNPC--SVPLTNPIPLPTTGRTNPSIVESH

Query:  DNITCTDGSCQIAETV-GSLPSIVETTN----NVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIIT
                S   A TV  ++P    +T+     V  HPM+TR+KS I+KPK   +  +    T PTSYT ASK+  W  AM EEF ALQ+Q+  T +++ 
Subjt:  DNITCTDGSCQIAETV-GSLPSIVETTN----NVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIIT

Query:  QDGIHSFPTLGKRSWPLPICR-PLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVI
                       PLP  +  +G    + + +HS                 +  Y+ +RLV              ++  ID     +PV+K PT+R++
Subjt:  QDGIHSFPTLGKRSWPLPICR-PLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVI

Query:  LTLAAQYGWPLNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLY
        L+LA    WPL QLDVKNAFLHG                         L+KS+YGLKQ+PRAWF+ FTT L  LGF+AS AD SLFI + NK   YLLLY
Subjt:  LTLAAQYGWPLNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLY

Query:  VDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPM--STTVDLYAFAPLFKDATLYRQLV
        VDDI++T +   ++  L  +L++ F + DLG+L YFLGL++  +   +++NQAKY  DLL+   M   K   +P   +T + L+   PL  D   YR +V
Subjt:  VDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPM--STTVDLYAFAPLFKDATLYRQLV

Query:  DSLQYLTFTR
         +L YLTFTR
Subjt:  DSLQYLTFTR

A0A2N9INQ9 Uncharacterized protein2.7e-14134.24Show/hide
Query:  MVISSTNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPL
        MV     SS+ SPL LLS++ NL+P++LDS+N+++WK+Q+SSIL+A+S+   +D ++  P +F+  + G  TT ++ ++  W+ RDQAL+TL+N+TLSP 
Subjt:  MVISSTNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPL

Query:  ALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSL
         L  VVG  SA+ +W +LE R++S +RSN+L L+  L+++KK  SESV  +  +IK+  +KL A    ID+EE+    L GL  +F  F ++IRTR+  +
Subjt:  ALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSL

Query:  TLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQ
        + E++  LL  EE+++ ++  +     +MA F   +P        RN           +S  +F  H + + GRG ++N+R  G    F  N+  + F Q
Subjt:  TLEDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQ

Query:  QYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAY-NNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDLANLNLSNNYNG
                N++       +S    +  + F   +   S   R  CQIC K GH ALDCY+RM+F+YQGRHPP +LAAMA    S +    +      Y G
Subjt:  QYSLSNGSNSRGILGSGSNSGQIIENGSAFGSAY-NNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDLANLNLSNNYNG

Query:  EESVTVG-------------------------ASINGL--YLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAPK----------TLHSSLSRL-
         + V VG                         AS+  L    I               FN H      + H    H   +           + S+L+ L 
Subjt:  EESVTVG-------------------------ASINGL--YLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAPK----------TLHSSLSRL-

Query:  ----------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHH
                        V LINR+P+ +L+  +P+E LF   PDL HL+ FGC C+P L+P   HKLQP+T+  +FLGYP   KG IC +PHT +  +S H
Subjt:  ----------------VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHH

Query:  VIFHETIF-PFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGR---------TNPS--------IVESHDNITCTDGS
        V+F+E+ F     L S+   S  S  +++ D T  L +L  S TP   P S P ++ +P+P+             +PS        + + H  I  +  S
Subjt:  VIFHETIF-PFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGR---------TNPS--------IVESHDNITCTDGS

Query:  CQI-AETVGSLPSIVETT--NNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLG
          +   T  S+P ++ +T     N HPM TR+K+ IFKPK F ++      T P++Y  ASK+P+W  AM EE++ALQ+Q   T  ++        P  G
Subjt:  CQI-AETVGSLPSIVETT--NNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLG

Query:  KRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLN
        K    +  C+   W                          V ++ R S   +   +   V     ++  ID     +PVVK PT+R+IL LA  Y WPL 
Subjt:  KRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLN

Query:  QLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLT
        QLDV+NAFLHG                         L KS+YGLKQ+PRAWF+ FTT L+TLGF +S AD SLFI S   +  +LLLYVDDI++T ++  
Subjt:  QLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLT

Query:  YIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAF--APLFKDATLYRQLVDSLQYLTFTRRT
        ++  L   L+  F++ DLG L +FLGL++  S+ G+ + Q KY  DLL    M     C TP    V L A    PL  D   YR LV +L YLTFTR  
Subjt:  YIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAF--APLFKDATLYRQLVDSLQYLTFTRRT

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-1924.71Show/hide
Query:  FLINRMPSSSLNMSS--PFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGY-PLDFKGCICYNPHTKKFIVSHHVIF-----------
        +LINR+PS +L  SS  P+E      P L HL+VFG   Y  +K  +Q K   K+ + IF+GY P  FK    ++   +KFIV+  V+            
Subjt:  FLINRMPSSSLNMSS--PFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGY-PLDFKGCICYNPHTKKFIVSHHVIF-----------

Query:  -HETIF----PFSQLSSKPN----ISTTSTTSTTSDPTLLLTLLNLSTTPSQN-PCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLP
          ET+F      S+  + PN    I  T   + + +   +  L +   + ++N P           P   +   +I    D+          ++      
Subjt:  -HETIF----PFSQLSSKPN----ISTTSTTSTTSDPTLLLTLLNLSTTPSQN-PCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLP

Query:  SIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTL----GKRSWPLPICR
         + E+  + N +  R    +   K     + T +         +E  K  + + +  EE N+L +      + I  D  +SF  +     K SW   I  
Subjt:  SIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTL----GKRSWPLPICR

Query:  PLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQN----------PVVKKPTIRVILTLAAQYGWPLN
         L         +H + +    T T     E   I   SR V   +       +  K R + +GF Q           PV +  + R IL+L  QY   ++
Subjt:  PLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQN----------PVVKKPTIRVILTLAAQYGWPLN

Query:  QLDVKNAFLHGHLR----------------------KSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFI---RSVNKSFTYLLLYVDDIIVTRSDL
        Q+DVK AFL+G L+                      K++YGLKQ+ R WF+ F   L    FV S+ D  ++I    ++N++  Y+LLYVDD+++   D+
Subjt:  QLDVKNAFLHGHLR----------------------KSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFI---RSVNKSFTYLLLYVDDIIVTRSDL

Query:  TYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDA-TLYRQLVDSLQYLTFTRR
        T +   ++ L  +F+++DL  +K+F+G+ +    D I+++Q+ Y+  +L    M       TP+ + ++ Y      +D  T  R L+  L Y+    R
Subjt:  TYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDA-TLYRQLVDSLQYLTFTRR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.6e-2927.15Show/hide
Query:  FLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKP
        +LINR PS  L    P            HLKVFGC  +  +   ++ KL  K+   IF+GY  +  G   ++P  KK I S  V+F E+           
Subjt:  FLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKP

Query:  NISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFK
         + T +  S      ++   + + +T S NP S   T    +   G     ++E  +         Q+ E V  +     T       P+R   + R+ +
Subjt:  NISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFK

Query:  PKAFLST--TVSSVPTYPTSYTEASKYPE---WRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTP
         + + ST   + S    P S  E   +PE      AM EE  +LQ           ++G +    L K   PL  C+   W      V    +D   K  
Subjt:  PKAFLST--TVSSVPTYPTSYTEASKYPE---WRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTP

Query:  TSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG----------------------
                L  Y+ +RLVV   E         +K+ ID   + +PVVK  +IR IL+LAA     + QLDVK AFLHG                      
Subjt:  TSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG----------------------

Query:  --HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNK-SFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEV--H
           L KSLYGLKQ+PR W+ +F +++ +  ++ + +DP ++ +  ++ +F  LLLYVDD+++   D   I  L+  L+  F + DLG  +  LG+++   
Subjt:  --HLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNK-SFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEV--H

Query:  SSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMS
         ++  ++++Q KY+  +L+   M  AK   TP++
Subjt:  SSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMS

P92519 Uncharacterized mitochondrial protein AtMg008104.9e-1538.94Show/hide
Query:  YLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYR
        YLLLYVDDI++T S  T + +L  QL++ F + DLG + YFLG+++ +   G+F++Q KY   +L  +GM   K   TP+   ++       + D + +R
Subjt:  YLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYR

Query:  QLVDSLQYLTFTR
         +V +LQYLT TR
Subjt:  QLVDSLQYLTFTR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.3e-6331.26Show/hide
Query:  VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQL---
        V+LINR+P+  L + SPF+ LFG +P+   L+VFGCACYP L+P  QHKL  K+ Q +FLGY L     +C +  T +  +S HV F E  FPFS     
Subjt:  VFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQL---

Query:  ------------------SSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQN----------------PCSVPLTNPI---PLPTTGRTNP-----SIVE
                          ++ P  +      + SDP    T  +  + P +N                P S   T P    P PTT  T       S   
Subjt:  ------------------SSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQN----------------PCSVPLTNPI---PLPTTGRTNP-----SIVE

Query:  SHDNITCTDGSCQIAETVGS-------------------------------LPSIVETTNN-----VNIHPMRTRAKSRIFKPKAFLSTTVS-SVPTYPT
        +  N    +   Q+A+++ +                                P + +  NN     +N H M TRAK+ I KP    S  VS +  + P 
Subjt:  SHDNITCTDGSCQIAETVGS-------------------------------LPSIVETTNN-----VNIHPMRTRAKSRIFKPKAFLSTTVS-SVPTYPT

Query:  SYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQ
        +  +A K   WRNAM  E NA                      +G  +W L    P      H  +      F++K  +   L      Y+ +RLV    
Subjt:  SYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQ

Query:  EGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRF
                  ++  +D     +PV+K  +IR++L +A    WP+ QLDV NAFL G                         LRK+LYGLKQ+PRAW+   
Subjt:  EGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------------HLRKSLYGLKQSPRAWFDRF

Query:  TTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMAT
          YLLT+GFV S +D SLF+    KS  Y+L+YVDDI++T +D T +      L+  F + D   L YFLG+E      G+ ++Q +Y+ DLL  + M T
Subjt:  TTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMAT

Query:  AKTCLTPM--STTVDLYAFAPLFKDATLYRQLVDSLQYLTFTR
        AK   TPM  S  + LY+   L  D T YR +V SLQYL FTR
Subjt:  AKTCLTPM--STTVDLYAFAPLFKDATLYRQLVDSLQYLTFTR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-5931.01Show/hide
Query:  PKTLHS-SLSRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFH
        PKT    + S  V+LINR+P+  L + SPF+ LFG  P+   LKVFGCACYP L+P  +HKL+ K+ Q  F+GY L     +C +  T +   S HV F 
Subjt:  PKTLHS-SLSRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQHIFLGYPLDFKGCICYNPHTKKFIVSHHVIFH

Query:  ETIFPFS------------QLSSKPNISTTSTTSTT---------------------SDPTLLLTLLNLS----------------TTPSQN---PCSVP
        E  FPFS            +  S PN  + +T  TT                     S P+ L T    S                T PS N   P + P
Subjt:  ETIFPFS------------QLSSKPNISTTSTTSTT---------------------SDPTLLLTLLNLS----------------TTPSQN---PCSVP

Query:  ------------LTNP-------------IPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSI--------VETTNNVNIHPMRTRAKSRIFKPK
                    L NP              PLP +  ++P I     +I+  +     + +   LP +        V     VN H M TRAK  I KP 
Subjt:  ------------LTNP-------------IPLPTTGRTNPSIVESHDNITCTDGSCQIAETVGSLPSI--------VETTNNVNIHPMRTRAKSRIFKPK

Query:  AFLSTTVS-SVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQ
           S   S +  + P +  +A K   WR AM  E NA                      +G  +W L    P        +     R    K   S    
Subjt:  AFLSTTVS-SVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRSWPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQ

Query:  EVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------------HLRK
          L  Y+ +RLV              ++  +D     +PV+K  +IR++L +A    WP+ QLDV NAFL G                         LRK
Subjt:  EVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG------------------------HLRK

Query:  SLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVN
        ++YGLKQ+PRAW+    TYLLT+GFV S +D SLF+    +S  Y+L+YVDDI++T +D   +      L+  F + +   L YFLG+E      G+ ++
Subjt:  SLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVN

Query:  QAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFA-PLFKDATLYRQLVDSLQYLTFTRRTL
        Q +Y  DLL  + M TAK   TPM+T+  L   +     D T YR +V SLQYL FTR  L
Subjt:  QAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFA-PLFKDATLYRQLVDSLQYLTFTRRTL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.4e-0924.22Show/hide
Query:  LFLLSNICNLVPLRLD--SSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLAL-AHVVGSTS
        ++ +SNI + +P+ LD   SNY  W+    +   +  + GHID              GT     +ND + W  RD  +   L  TL+P       V S++
Subjt:  LFLLSNICNLVPLRLD--SSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLAL-AHVVGSTS

Query:  AKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLN
        ++++WL ++ ++ +   +  L L S L + K +    V  +  ++K L + L    V + D  ++++ LNGL+  F      I+ R    + +D   +L 
Subjt:  AKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLN

Query:  AEE----KTMKKNTT-------------ADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNH
         EE    + +K N T             ++  P T       +  G RGRGR N   RGR  +   S  N P   S +      ++++ +     + P  
Subjt:  AEE----KTMKKNTT-------------ADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNH

Query:  MGHVFGQQYSLSNGSNSRGILG
                Y  +NG N  G+LG
Subjt:  MGHVFGQQYSLSNGSNSRGILG

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-3235.77Show/hide
Query:  KKRRIDQGFLQ----------NPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG----------------------------HLRKSLYGLKQSPRAW
        K R + +G+ Q          +PV K  ++++IL ++A Y + L+QLD+ NAFL+G                            +L+KS+YGLKQ+ R W
Subjt:  KKRRIDQGFLQ----------NPVVKKPTIRVILTLAAQYGWPLNQLDVKNAFLHG----------------------------HLRKSLYGLKQSPRAW

Query:  FDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTS
        F +F+  L+  GFV S +D + F++     F  +L+YVDDII+  ++   +  L+ QL + F++ DLG LKYFLGLE+  S  GI + Q KY  DLL  +
Subjt:  FDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTS

Query:  GMATAKTCLTPMSTTVDLYAFA-PLFKDATLYRQLVDSLQYLTFTR
        G+   K    PM  +V   A +   F DA  YR+L+  L YL  TR
Subjt:  GMATAKTCLTPMSTTVDLYAFA-PLFKDATLYRQLVDSLQYLTFTR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.0e-0821.77Show/hide
Query:  VPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVG-STSAKELWLSLEKRY
        V L L+  NY +W+    ++  +  + GHID           SST T  TE      +W  RD  +   +  T++   L  ++    +A++LWLSLE  +
Subjt:  VPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATLSPLALAHVVG-STSAKELWLSLEKRY

Query:  SSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTMKKNTTA
             +  L+  + L +   +   SV ++  ++K L + L      I D  +++H LNGL++ +      I+ +S   +  +  ++L  EE  +   + +
Subjt:  SSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTLEDLHALLNAEEKTMKKNTTA

Query:  DIIPTTMAAFKD----------------HSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFG--PQQTFGPNHMGHVF
         +  T   +  +                H+     GRGR     RG               G  SDGR  ++N  +    P   +GP    +++
Subjt:  DIIPTTMAAFKD----------------HSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFG--PQQTFGPNHMGHVF

ATMG00810.1 DNA/RNA polymerases superfamily protein3.5e-1638.94Show/hide
Query:  YLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYR
        YLLLYVDDI++T S  T + +L  QL++ F + DLG + YFLG+++ +   G+F++Q KY   +L  +GM   K   TP+   ++       + D + +R
Subjt:  YLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIFVNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYR

Query:  QLVDSLQYLTFTR
         +V +LQYLT TR
Subjt:  QLVDSLQYLTFTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATGGCCTCCTCTTCAGCAATGGTTATCTCCTCCACCAACTCATCAATGGCTTCCCCTCTATTTCTTCTATCTAATATTTGTAACTTGGTTCCACTTCGA
TTGGACTCGAGCAATTATGTCCTTTGGAAGTATCAAGTCTCTTCAATTCTGAAAGCCCACTCCTTATTTGGGCATATCGACGAATCTCTTCCTCAACCACACAAG
TTTGTTCAATCATCCACCGGCACCGATACAACGGAGATCAGTAATGATTATCTTCAATGGAATTCTCGCGATCAAGCTCTCATCACCCTACTCAACGCCACCTTG
TCTCCCTTGGCCTTGGCTCACGTTGTTGGATCAACCTCTGCAAAGGAACTTTGGCTTTCATTAGAAAAGCGATACTCGTCCAAAACTCGCTCCAACATTCTTGAG
TTAAGATCTGCTCTCTACTCTATTAAGAAACTTTCTTCTGAATCTGTTGAAAAGTTTTCTTGCAGAATCAAAGATCTAGTTGAAAAACTTGCAGCAGCCTCAGTT
AAGATTGATGATGAAGAAATATTAGTCCATACCTTGAATGGATTATCGGACGACTTCAGCGCTTTCCGGACGTCGATTCGAACTCGCAGTGGCTCCCTAACTCTT
GAAGATCTTCATGCTCTTCTTAATGCTGAAGAGAAAACGATGAAGAAGAACACTACTGCCGACATCATACCCACGACAATGGCGGCATTCAAAGATCATTCTCCA
ACCGGCGGTCGTGGACGAGGACGACGAAACCTCCATGGCAGGGGACGTGCGGCGCAACCCTTTGCTTCCAGCCCTAATTTTCCTTTTCACGGGTCAATATCAGAC
GGCAGGGGAGCTCATTCCAATTTCCGAAAATTTGGGCCACAACAGACCTTTGGGCCGAATCACATGGGCCACGTTTTTGGGCAGCAATATTCTCTCTCAAATGGG
TCAAACAGTCGTGGCATCCTTGGATCAGGTTCAAACTCGGGTCAAATAATTGAAAATGGGTCAGCTTTCGGGTCTGCATATAATAATTCACATGGAGGTCGAATT
TCATGTCAAATTTGCTCCAAATTTGGTCATAGTGCACTTGATTGTTACAATCGGATGAATTTCTCTTATCAGGGGCGACATCCTCCAACCCAATTAGCAGCTATG
GCGTGGATGCAATGTTCATATGAAAATGATTTGGCCAATCTCAATTTATCGAACAACTATAATGGAGAAGAATCGGTCACAGTGGGTGCTAGCATCAATGGTCTC
TATCTGATTCCCAGCCCTTCAGCTCTATCTAATTCCAGTATGCAACCCAAAAATTTTAATTTTCATGCAAAACAGGAGTGTTCTCTGTGGCATCAACGGTTTGGA
CATCCTGCTCCTAAGACTTTACATTCTAGTTTGTCTCGTCTTGTATTTCTAATAAATAGAATGCCTTCTTCATCCCTAAATATGTCTTCTCCGTTTGAAACACTT
TTTGGCTACACTCCTGATTTGCACCATTTAAAAGTTTTTGGGTGTGCATGTTATCCTTTACTAAAACCAACCAGACAGCATAAACTACAACCTAAAACATCTCAG
CATATTTTCCTTGGATATCCTCTTGATTTCAAAGGTTGCATATGTTATAATCCTCACACTAAGAAATTCATTGTCTCTCATCATGTCATCTTCCATGAAACTATT
TTTCCTTTTAGCCAACTCTCTTCCAAACCAAACATTTCTACCACTTCTACCACTTCTACCACTAGTGATCCAACTCTATTACTCACCCTCCTAAACTTATCTACA
ACACCGTCTCAAAATCCCTGCTCTGTACCTCTCACTAATCCTATCCCATTACCTACTACCGGCCGTACTAATCCATCCATAGTTGAATCACACGATAATATTACT
TGTACTGATGGCTCATGTCAGATTGCAGAAACTGTAGGTTCACTGCCCTCCATCGTTGAAACTACAAACAATGTGAACATTCATCCTATGCGAACTCGGGCGAAG
TCTAGAATTTTTAAGCCTAAAGCCTTCCTATCCACTACGGTATCTTCAGTACCTACTTATCCTACATCTTACACTGAAGCTTCTAAATACCCTGAATGGAGAAAT
GCCATGTGTGAGGAGTTCAATGCACTTCAACAACAAGAGTGGGGAACTGGAAACATAATCACACAAGATGGAATTCACTCATTCCCGACTCTAGGGAAAAGGTCT
TGGCCTTTACCTATATGTAGACCCTTGGGGTGGGGTGAAAGGCATCCTATTGTCTCTCATTCTCTCAGAGATTTCTCCCAAAAGACTCCCACAAGTCTCCTGCTT
CAAGAAGTCTTAGAGATATACCGGTGTAGCCGATTGGTGGTGTTCGCACAAGAGGGTTGTTGCGTTTTCATTCTTATTGGCAAGAAAAGACGAATTGATCAAGGC
TTTCTACAAAATCCAGTGGTTAAGAAGCCAACAATTAGGGTCATACTTACTCTAGCTGCTCAATATGGATGGCCCTTGAATCAGTTGGATGTAAAAAATGCATTC
CTACATGGCCACCTTAGGAAGAGTCTTTATGGTCTTAAACAGTCTCCTCGCGCTTGGTTTGATCGTTTCACAACGTACTTATTGACTTTGGGATTTGTTGCTTCT
GCTGCTGACCCGTCTTTATTTATACGTTCTGTTAACAAGTCCTTTACATATTTATTACTCTATGTGGACGATATTATTGTAACTAGATCTGATCTTACATATATT
TTTGTGCTACAGAAGCAACTAGCCACTGAATTTCAGATTTCAGATCTTGGAGCATTGAAGTACTTCCTTGGTTTAGAAGTTCACTCCTCCACCGATGGAATTTTC
GTTAATCAGGCCAAGTATCTCAATGATCTGCTTCAAACATCAGGCATGGCAACTGCTAAGACATGCTTAACACCTATGTCCACCACAGTTGATTTATACGCTTTT
GCACCCTTATTCAAGGATGCAACTCTCTATCGTCAACTAGTCGATTCTCTTCAGTATCTCACATTCACAAGACGGACATTACCTTTGCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAATGGCCTCCTCTTCAGCAATGGTTATCTCCTCCACCAACTCATCAATGGCTTCCCCTCTATTTCTTCTATCTAATATTTGTAACTTGGTTCCACTTCGA
TTGGACTCGAGCAATTATGTCCTTTGGAAGTATCAAGTCTCTTCAATTCTGAAAGCCCACTCCTTATTTGGGCATATCGACGAATCTCTTCCTCAACCACACAAG
TTTGTTCAATCATCCACCGGCACCGATACAACGGAGATCAGTAATGATTATCTTCAATGGAATTCTCGCGATCAAGCTCTCATCACCCTACTCAACGCCACCTTG
TCTCCCTTGGCCTTGGCTCACGTTGTTGGATCAACCTCTGCAAAGGAACTTTGGCTTTCATTAGAAAAGCGATACTCGTCCAAAACTCGCTCCAACATTCTTGAG
TTAAGATCTGCTCTCTACTCTATTAAGAAACTTTCTTCTGAATCTGTTGAAAAGTTTTCTTGCAGAATCAAAGATCTAGTTGAAAAACTTGCAGCAGCCTCAGTT
AAGATTGATGATGAAGAAATATTAGTCCATACCTTGAATGGATTATCGGACGACTTCAGCGCTTTCCGGACGTCGATTCGAACTCGCAGTGGCTCCCTAACTCTT
GAAGATCTTCATGCTCTTCTTAATGCTGAAGAGAAAACGATGAAGAAGAACACTACTGCCGACATCATACCCACGACAATGGCGGCATTCAAAGATCATTCTCCA
ACCGGCGGTCGTGGACGAGGACGACGAAACCTCCATGGCAGGGGACGTGCGGCGCAACCCTTTGCTTCCAGCCCTAATTTTCCTTTTCACGGGTCAATATCAGAC
GGCAGGGGAGCTCATTCCAATTTCCGAAAATTTGGGCCACAACAGACCTTTGGGCCGAATCACATGGGCCACGTTTTTGGGCAGCAATATTCTCTCTCAAATGGG
TCAAACAGTCGTGGCATCCTTGGATCAGGTTCAAACTCGGGTCAAATAATTGAAAATGGGTCAGCTTTCGGGTCTGCATATAATAATTCACATGGAGGTCGAATT
TCATGTCAAATTTGCTCCAAATTTGGTCATAGTGCACTTGATTGTTACAATCGGATGAATTTCTCTTATCAGGGGCGACATCCTCCAACCCAATTAGCAGCTATG
GCGTGGATGCAATGTTCATATGAAAATGATTTGGCCAATCTCAATTTATCGAACAACTATAATGGAGAAGAATCGGTCACAGTGGGTGCTAGCATCAATGGTCTC
TATCTGATTCCCAGCCCTTCAGCTCTATCTAATTCCAGTATGCAACCCAAAAATTTTAATTTTCATGCAAAACAGGAGTGTTCTCTGTGGCATCAACGGTTTGGA
CATCCTGCTCCTAAGACTTTACATTCTAGTTTGTCTCGTCTTGTATTTCTAATAAATAGAATGCCTTCTTCATCCCTAAATATGTCTTCTCCGTTTGAAACACTT
TTTGGCTACACTCCTGATTTGCACCATTTAAAAGTTTTTGGGTGTGCATGTTATCCTTTACTAAAACCAACCAGACAGCATAAACTACAACCTAAAACATCTCAG
CATATTTTCCTTGGATATCCTCTTGATTTCAAAGGTTGCATATGTTATAATCCTCACACTAAGAAATTCATTGTCTCTCATCATGTCATCTTCCATGAAACTATT
TTTCCTTTTAGCCAACTCTCTTCCAAACCAAACATTTCTACCACTTCTACCACTTCTACCACTAGTGATCCAACTCTATTACTCACCCTCCTAAACTTATCTACA
ACACCGTCTCAAAATCCCTGCTCTGTACCTCTCACTAATCCTATCCCATTACCTACTACCGGCCGTACTAATCCATCCATAGTTGAATCACACGATAATATTACT
TGTACTGATGGCTCATGTCAGATTGCAGAAACTGTAGGTTCACTGCCCTCCATCGTTGAAACTACAAACAATGTGAACATTCATCCTATGCGAACTCGGGCGAAG
TCTAGAATTTTTAAGCCTAAAGCCTTCCTATCCACTACGGTATCTTCAGTACCTACTTATCCTACATCTTACACTGAAGCTTCTAAATACCCTGAATGGAGAAAT
GCCATGTGTGAGGAGTTCAATGCACTTCAACAACAAGAGTGGGGAACTGGAAACATAATCACACAAGATGGAATTCACTCATTCCCGACTCTAGGGAAAAGGTCT
TGGCCTTTACCTATATGTAGACCCTTGGGGTGGGGTGAAAGGCATCCTATTGTCTCTCATTCTCTCAGAGATTTCTCCCAAAAGACTCCCACAAGTCTCCTGCTT
CAAGAAGTCTTAGAGATATACCGGTGTAGCCGATTGGTGGTGTTCGCACAAGAGGGTTGTTGCGTTTTCATTCTTATTGGCAAGAAAAGACGAATTGATCAAGGC
TTTCTACAAAATCCAGTGGTTAAGAAGCCAACAATTAGGGTCATACTTACTCTAGCTGCTCAATATGGATGGCCCTTGAATCAGTTGGATGTAAAAAATGCATTC
CTACATGGCCACCTTAGGAAGAGTCTTTATGGTCTTAAACAGTCTCCTCGCGCTTGGTTTGATCGTTTCACAACGTACTTATTGACTTTGGGATTTGTTGCTTCT
GCTGCTGACCCGTCTTTATTTATACGTTCTGTTAACAAGTCCTTTACATATTTATTACTCTATGTGGACGATATTATTGTAACTAGATCTGATCTTACATATATT
TTTGTGCTACAGAAGCAACTAGCCACTGAATTTCAGATTTCAGATCTTGGAGCATTGAAGTACTTCCTTGGTTTAGAAGTTCACTCCTCCACCGATGGAATTTTC
GTTAATCAGGCCAAGTATCTCAATGATCTGCTTCAAACATCAGGCATGGCAACTGCTAAGACATGCTTAACACCTATGTCCACCACAGTTGATTTATACGCTTTT
GCACCCTTATTCAAGGATGCAACTCTCTATCGTCAACTAGTCGATTCTCTTCAGTATCTCACATTCACAAGACGGACATTACCTTTGCAGTAA
Protein sequenceShow/hide protein sequence
MSMASSSAMVISSTNSSMASPLFLLSNICNLVPLRLDSSNYVLWKYQVSSILKAHSLFGHIDESLPQPHKFVQSSTGTDTTEISNDYLQWNSRDQALITLLNATL
SPLALAHVVGSTSAKELWLSLEKRYSSKTRSNILELRSALYSIKKLSSESVEKFSCRIKDLVEKLAAASVKIDDEEILVHTLNGLSDDFSAFRTSIRTRSGSLTL
EDLHALLNAEEKTMKKNTTADIIPTTMAAFKDHSPTGGRGRGRRNLHGRGRAAQPFASSPNFPFHGSISDGRGAHSNFRKFGPQQTFGPNHMGHVFGQQYSLSNG
SNSRGILGSGSNSGQIIENGSAFGSAYNNSHGGRISCQICSKFGHSALDCYNRMNFSYQGRHPPTQLAAMAWMQCSYENDLANLNLSNNYNGEESVTVGASINGL
YLIPSPSALSNSSMQPKNFNFHAKQECSLWHQRFGHPAPKTLHSSLSRLVFLINRMPSSSLNMSSPFETLFGYTPDLHHLKVFGCACYPLLKPTRQHKLQPKTSQ
HIFLGYPLDFKGCICYNPHTKKFIVSHHVIFHETIFPFSQLSSKPNISTTSTTSTTSDPTLLLTLLNLSTTPSQNPCSVPLTNPIPLPTTGRTNPSIVESHDNIT
CTDGSCQIAETVGSLPSIVETTNNVNIHPMRTRAKSRIFKPKAFLSTTVSSVPTYPTSYTEASKYPEWRNAMCEEFNALQQQEWGTGNIITQDGIHSFPTLGKRS
WPLPICRPLGWGERHPIVSHSLRDFSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCVFILIGKKRRIDQGFLQNPVVKKPTIRVILTLAAQYGWPLNQLDVKNAF
LHGHLRKSLYGLKQSPRAWFDRFTTYLLTLGFVASAADPSLFIRSVNKSFTYLLLYVDDIIVTRSDLTYIFVLQKQLATEFQISDLGALKYFLGLEVHSSTDGIF
VNQAKYLNDLLQTSGMATAKTCLTPMSTTVDLYAFAPLFKDATLYRQLVDSLQYLTFTRRTLPLQ