; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016048 (gene) of Snake gourd v1 genome

Gene IDTan0016048
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionENDO3c domain-containing protein
Genome locationLG05:1234835..1238856
RNA-Seq ExpressionTan0016048
SyntenyTan0016048
Gene Ontology termsGO:0006260 - DNA replication (biological process)
GO:0006281 - DNA repair (biological process)
GO:0080111 - DNA demethylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
GO:0035514 - DNA demethylase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR044811 - DNA glycosylase, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46937.2 hypothetical protein Csa_020879 [Cucumis sativus]6.9e-26151.17Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP
        MAE  L     KHV  V MEP+ISTPLQQE KRKRP N   + T+R KKPR+KMYRPKVIGEG++RKSKG NT    +Q V     TP P TPN V+PK 
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP

Query:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL
        RTRKPR LPK +T+K V  +  S                                  EK          K  E DN V+AI SCR+LVLVENELE EK  
Subjt:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL

Query:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA
         EV   EAK+A  +N    V++T VG +E  C        + +ECK KRSS     KI E KPYGLR  R KG  SR KLLPFFF K+KR+PMVRRCNLA
Subjt:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA

Query:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------
        SLFA  VC QL RN HKHA  S+K E  N N IVPIVGW LK P      +        NC+SRDD   TK  E   QS                     
Subjt:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------

Query:  ------------------------------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIR
                                                         S+L++D RD S GTT+TD      V+ E TP KG AQT++   S  FSD+R
Subjt:  ------------------------------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIR

Query:  SEGTLWKIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------W
        SEG L K+GN+ H + +S HN + TM+WLDI HFLTNS+ L+ R+  N P   +SIPRII      GS G+  R QDFTL                   W
Subjt:  SEGTLWKIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------W

Query:  LTGETRANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKGTDSGNEKQIIPYARKRGKKTSNREHN
           E   NN+           GV  EE DS  K  LVPYAADG YNEA F+N+S  VGTNN+ GLSH + QK+GT S +EKQIIPYARK GKK S  EHN
Subjt:  LTGETRANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKGTDSGNEKQIIPYARKRGKKTSNREHN

Query:  PSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSV
        P+ + GMQGAIVPHPKSL STKKKE GRVYL+PRDIT+WK+L ++ S+  KE+ D   E+WW NER++FRGRI+AFNA MH+ILGDRRFSPWKGSVVDSV
Subjt:  PSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSV

Query:  VGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSI
        VGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL          +E+NKATGDPEEA+EL SA+DAISSQ   GSSI
Subjt:  VGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSI

Query:  KQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLC
        K++ D TLLSSTCLEDDCGTCL  NLD TDN  LHS+KST  +EPYSS+Q ST SCGSN++N+  E +E+  ++Q PIS NLN SDT+H  RS GKC+  
Subjt:  KQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLC

Query:  SKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIV
        ++CI KS+ GLENNAED   CE  + VDLQFT     Q  I S+ KFQNQEIQL GDV A+C LCS++N                 + K EAGSQ S  +
Subjt:  SKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIV

Query:  DLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
        D S  L D DVE+VQS ESV+  S+NTN  +++EK+  +  LE R+ N  NDEK+
Subjt:  DLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

XP_008458622.1 PREDICTED: protein ROS1-like isoform X1 [Cucumis melo]1.1e-26351.01Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK
        MAE  L     KHV  V MEP+ISTPLQQE KRKRPQNN  + T+R KKPRKKMYRPKVIGEG++RKSKG N TPAK       +P TP P TP+RV+PK
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK

Query:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA
          TRKPR LPK +T+K V  +  S                                  EK          K  E DNRV+AI SCR+LVLVENELE EK 
Subjt:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA

Query:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL
          EV   E KEA+++NS   V++T V  +E  C        + +ECK KRSS     KI ERKPYGLR  R KG  SR KLLPF F KRKR+PMVRRCNL
Subjt:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL

Query:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------
        ASLF   VC QL RN HKHA  S K E  N N IVPIVGW LK P   RN++K+Q        NC+S DD   TK  E   QS                 
Subjt:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------

Query:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT
                                                                                              S+L+++ RD S GT
Subjt:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT

Query:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR
        TITDV+ E   T+G +QT++  AS KFSD+RSEG L K+GN+ H ++L  HN + TM+WLDI HFLTNS+  + RS  N PE  +SIPRI       GS 
Subjt:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR

Query:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS
        G+  R QDF               TL    W   E   NN+          GV GEE DS  K  LVPYAADGRYNEA  RN+S  V TNN+ GLSH + 
Subjt:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS

Query:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR
        QK+ T S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPH KSL STKKKE+GRV L+PRDI +WK+L ++ S+  KE+ D   E+WW NER++FR
Subjt:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR

Query:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN
        GRI+AFNA MHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL         +E+N
Subjt:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN

Query:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD
        KATGDPEEA+EL S +DAISSQD  G SIK++ D TLLSS CLEDDCGTCLS NLD TDN  LHSNKST  +EPYSSSQ+STSSCGSNQ+N+  ES+E+ 
Subjt:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD

Query:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT
         ++Q PIS N N+SDTMH  RS GKC+  ++CI KS+ GLENNAED  RCE  I VDLQF     +Q    S  KFQNQEIQL GDV A+C LCS++N  
Subjt:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT

Query:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                       + K EAGSQ  S I D S+K+D DVEKVQSQESV Q S+NTN+ +++EK+ A+  LE  + N  NDEK+
Subjt:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

XP_011658415.1 uncharacterized protein LOC101216331 isoform X1 [Cucumis sativus]2.3e-25649.33Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP
        MAE  L     KHV  V MEP+ISTPLQQE KRKRP N   + T+R KKPR+KMYRPKVIGEG++RKSKG NT    +Q V     TP P TPN V+PK 
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP

Query:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL
        RTRKPR LPK +T+K V  +  S                                  EK          K  E DN V+AI SCR+LVLVENELE EK  
Subjt:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL

Query:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA
         EV   EAK+A  +N    V++T VG +E  C        + +ECK KRSS     KI E KPYGLR  R KG  SR KLLPFFF K+KR+PMVRRCNLA
Subjt:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA

Query:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------
        SLFA  VC QL RN HKHA  S+K E  N N IVPIVGW LK P      +        NC+SRDD   TK  E   QS                     
Subjt:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------

Query:  ------------------------------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIR
                                                         S+L++D RD S GTT+TD      V+ E TP KG AQT++   S  FSD+R
Subjt:  ------------------------------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIR

Query:  SEGTLWKIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------W
        SEG L K+GN+ H + +S HN + TM+WLDI HFLTNS+ L+ R+  N P   +SIPRII      GS G+  R QDFTL                   W
Subjt:  SEGTLWKIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------W

Query:  LTGETRANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------
           E   NN+           GV  EE DS  K  LVPYAADG YNEA F+N+S  VGTNN+ GLSH                                 
Subjt:  LTGETRANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------

Query:  ------------DNSQKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEAD
                    D+ QK+GT S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPHPKSL STKKKE GRVYL+PRDIT+WK+L ++ S+  KE+ D
Subjt:  ------------DNSQKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEAD

Query:  INNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGY
           E+WW NER++FRGRI+AFNA MH+ILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  
Subjt:  INNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGY

Query:  FL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSS
        FL          +E+NKATGDPEEA+EL SA+DAISSQ   GSSIK++ D TLLSSTCLEDDCGTCL  NLD TDN  LHS+KST  +EPYSS+Q ST S
Subjt:  FL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSS

Query:  CGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLA
        CGSN++N+  E +E+  ++Q PIS NLN SDT+H  RS GKC+  ++CI KS+ GLENNAED   CE  + VDLQFT     Q  I S+ KFQNQEIQL 
Subjt:  CGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLA

Query:  GDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
        GDV A+C LCS++N                 + K EAGSQ S  +D S  L D DVE+VQS ESV+  S+NTN  +++EK+  +  LE R+ N  NDEK+
Subjt:  GDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

XP_016902242.1 PREDICTED: protein ROS1-like isoform X2 [Cucumis melo]3.9e-26451.19Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK
        MAE  L     KHV  V MEP+ISTPLQQE KRKRPQNN  + T+R KKPRKKMYRPKVIGEG++RKSKG N TPAK       +P TP P TP+RV+PK
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK

Query:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA
          TRKPR LPK +T+K V  +  S                                  EK          K  E DNRV+AI SCR+LVLVENELE EK 
Subjt:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA

Query:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL
          EV   E KEA+++NS   V++T V  +E  C        + +ECK KRSS     KI ERKPYGLR  R KG  SR KLLPF F KRKR+PMVRRCNL
Subjt:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL

Query:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------
        ASLF   VC QL RN HKHA  S K E  N N IVPIVGW LK P   RN++K+Q        NC+S DD   TK  E   QS                 
Subjt:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------

Query:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT
                                                                                              S+L+++ RD S GT
Subjt:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT

Query:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR
        TITDV+ E   T+G +QT++  AS KFSD+RSEG L K+GN+ H ++L  HN + TM+WLDI HFLTNS+  + RS  N PE  +SIPRI       GS 
Subjt:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR

Query:  GSQIRQQDF---------------TLPQSKWLTGETRANNN------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKG
        G+  R QDF               TL    W   E   NN+      GV GEE DS  K  LVPYAADGRYNEA  RN+S  V TNN+ GLSH + QK+ 
Subjt:  GSQIRQQDF---------------TLPQSKWLTGETRANNN------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKG

Query:  TDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRID
        T S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPH KSL STKKKE+GRV L+PRDI +WK+L ++ S+  KE+ D   E+WW NER++FRGRI+
Subjt:  TDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRID

Query:  AFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESNKATG
        AFNA MHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL         +E+NKATG
Subjt:  AFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESNKATG

Query:  DPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQ
        DPEEA+EL S +DAISSQD  G SIK++ D TLLSS CLEDDCGTCLS NLD TDN  LHSNKST  +EPYSSSQ+STSSCGSNQ+N+  ES+E+  ++Q
Subjt:  DPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQ

Query:  IPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGG
         PIS N N+SDTMH  RS GKC+  ++CI KS+ GLENNAED  RCE  I VDLQF     +Q    S  KFQNQEIQL GDV A+C LCS++N      
Subjt:  IPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGG

Query:  IKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                   + K EAGSQ  S I D S+K+D DVEKVQSQESV Q S+NTN+ +++EK+ A+  LE  + N  NDEK+
Subjt:  IKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

XP_031742557.1 uncharacterized protein LOC101216331 isoform X2 [Cucumis sativus]2.5e-25849.58Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP
        MAE  L     KHV  V MEP+ISTPLQQE KRKRP N   + T+R KKPR+KMYRPKVIGEG++RKSKG NT    +Q V     TP P TPN V+PK 
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP

Query:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL
        RTRKPR LPK +T+K V  +  S                                  EK          K  E DN V+AI SCR+LVLVENELE EK  
Subjt:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL

Query:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA
         EV   EAK+A  +N    V++T VG +E  C        + +ECK KRSS     KI E KPYGLR  R KG  SR KLLPFFF K+KR+PMVRRCNLA
Subjt:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA

Query:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------
        SLFA  VC QL RN HKHA  S+K E  N N IVPIVGW LK P      +        NC+SRDD   TK  E   QS                     
Subjt:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------

Query:  ------------------------------------------------PSVLKSDDRDASVGTTITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLW
                                                         S+L++D RD S GTT+TDV+ E TP KG AQT++   S  FSD+RSEG L 
Subjt:  ------------------------------------------------PSVLKSDDRDASVGTTITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLW

Query:  KIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------WLTGETR
        K+GN+ H + +S HN + TM+WLDI HFLTNS+ L+ R+  N P   +SIPRII      GS G+  R QDFTL                   W   E  
Subjt:  KIGNVIHTEKLS-HNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------WLTGETR

Query:  ANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------------
         NN+           GV  EE DS  K  LVPYAADG YNEA F+N+S  VGTNN+ GLSH                                       
Subjt:  ANNN-----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------------

Query:  ------DNSQKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKW
              D+ QK+GT S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPHPKSL STKKKE GRVYL+PRDIT+WK+L ++ S+  KE+ D   E+W
Subjt:  ------DNSQKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKW

Query:  WANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----
        W NER++FRGRI+AFNA MH+ILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL    
Subjt:  WANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----

Query:  ------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQE
              +E+NKATGDPEEA+EL SA+DAISSQ   GSSIK++ D TLLSSTCLEDDCGTCL  NLD TDN  LHS+KST  +EPYSS+Q ST SCGSN++
Subjt:  ------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQE

Query:  NKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRAR
        N+  E +E+  ++Q PIS NLN SDT+H  RS GKC+  ++CI KS+ GLENNAED   CE  + VDLQFT     Q  I S+ KFQNQEIQL GDV A+
Subjt:  NKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRAR

Query:  CCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
        C LCS++N                 + K EAGSQ S  +D S  L D DVE+VQS ESV+  S+NTN  +++EK+  +  LE R+ N  NDEK+
Subjt:  CCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

TrEMBL top hitse value%identityAlignment
A0A0A0KB10 ENDO3c domain-containing protein9.4e-24846.36Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP
        MAE  L     KHV  V MEP+ISTPLQQE KRKRP N   + T+R KKPR+KMYRPKVIGEG++RKSKG NT    +Q V     TP P TPN V+PK 
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP

Query:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL
        RTRKPR LPK +T+K V  +  S                                  EK          K  E DN V+AI SCR+LVLVENELE EK  
Subjt:  RTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKAL

Query:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA
         EV   EAK+A  +N    V++T VG +E  C        + +ECK KRSS     KI E KPYGLR  R KG  SR KLLPFFF K+KR+PMVRRCNLA
Subjt:  AEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLA

Query:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------
        SLFA  VC QL RN HKHA  S+K E  N N IVPIVGW LK P      +        NC+SRDD   TK  E   QS                     
Subjt:  SLFA--VCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQS---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKLS-HNAK
                                  S+L++D RD S GTT+TD      V+ E TP KG AQT++   S  FSD+RSEG L K+GN+ H + +S HN +
Subjt:  -------------------------PSVLKSDDRDASVGTTITD------VHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKLS-HNAK

Query:  TTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------WLTGETRANNN-----------GV
         TM+WLDI HFLTNS+ L+ R+  N P   +SIPRII      GS G+  R QDFTL                   W   E   NN+           GV
Subjt:  TTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSK---------------WLTGETRANNN-----------GV

Query:  PGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------------------DNSQKKGTDSG
          EE DS  K  LVPYAADG YNEA F+N+S  VGTNN+ GLSH                                             D+ QK+GT S 
Subjt:  PGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSH---------------------------------------------DNSQKKGTDSG

Query:  NEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNA
        +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPHPKSL STKKKE GRVYL+PRDIT+WK+L ++ S+  KE+ D   E+WW NER++FRGRI+AFNA
Subjt:  NEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNA

Query:  RMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPE
         MH+ILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL          +E+NKATGDPE
Subjt:  RMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPE

Query:  EAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPI
        EA+EL SA+DAISSQ   GSSIK++ D TLLSSTCLEDDCGTCL  NLD TDN  LHS+KST  +EPYSS+Q ST SCGSN++N+  E +E+  ++Q PI
Subjt:  EAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPI

Query:  SENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKK
        S NLN SDT+H  RS GKC+  ++CI KS+ GLENNAED   CE  + VDLQFT     Q  I S+ KFQNQEIQL GDV A+C LCS++N         
Subjt:  SENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKK

Query:  TTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                + K EAGSQ S  +D S  L D DVE+VQS ESV+  S+NTN  +++EK+  +  LE R+ N  NDEK+
Subjt:  TTEIYNYYQEKTEAGSQPSPIVDLSKKL-DIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

A0A1S3C8A0 protein ROS1-like isoform X15.5e-26451.01Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK
        MAE  L     KHV  V MEP+ISTPLQQE KRKRPQNN  + T+R KKPRKKMYRPKVIGEG++RKSKG N TPAK       +P TP P TP+RV+PK
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK

Query:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA
          TRKPR LPK +T+K V  +  S                                  EK          K  E DNRV+AI SCR+LVLVENELE EK 
Subjt:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA

Query:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL
          EV   E KEA+++NS   V++T V  +E  C        + +ECK KRSS     KI ERKPYGLR  R KG  SR KLLPF F KRKR+PMVRRCNL
Subjt:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL

Query:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------
        ASLF   VC QL RN HKHA  S K E  N N IVPIVGW LK P   RN++K+Q        NC+S DD   TK  E   QS                 
Subjt:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------

Query:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT
                                                                                              S+L+++ RD S GT
Subjt:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT

Query:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR
        TITDV+ E   T+G +QT++  AS KFSD+RSEG L K+GN+ H ++L  HN + TM+WLDI HFLTNS+  + RS  N PE  +SIPRI       GS 
Subjt:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR

Query:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS
        G+  R QDF               TL    W   E   NN+          GV GEE DS  K  LVPYAADGRYNEA  RN+S  V TNN+ GLSH + 
Subjt:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS

Query:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR
        QK+ T S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPH KSL STKKKE+GRV L+PRDI +WK+L ++ S+  KE+ D   E+WW NER++FR
Subjt:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR

Query:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN
        GRI+AFNA MHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL         +E+N
Subjt:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN

Query:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD
        KATGDPEEA+EL S +DAISSQD  G SIK++ D TLLSS CLEDDCGTCLS NLD TDN  LHSNKST  +EPYSSSQ+STSSCGSNQ+N+  ES+E+ 
Subjt:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD

Query:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT
         ++Q PIS N N+SDTMH  RS GKC+  ++CI KS+ GLENNAED  RCE  I VDLQF     +Q    S  KFQNQEIQL GDV A+C LCS++N  
Subjt:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT

Query:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                       + K EAGSQ  S I D S+K+D DVEKVQSQESV Q S+NTN+ +++EK+ A+  LE  + N  NDEK+
Subjt:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

A0A1S4E1Z0 protein ROS1-like isoform X21.9e-26451.19Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK
        MAE  L     KHV  V MEP+ISTPLQQE KRKRPQNN  + T+R KKPRKKMYRPKVIGEG++RKSKG N TPAK       +P TP P TP+RV+PK
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK

Query:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA
          TRKPR LPK +T+K V  +  S                                  EK          K  E DNRV+AI SCR+LVLVENELE EK 
Subjt:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA

Query:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL
          EV   E KEA+++NS   V++T V  +E  C        + +ECK KRSS     KI ERKPYGLR  R KG  SR KLLPF F KRKR+PMVRRCNL
Subjt:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL

Query:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------
        ASLF   VC QL RN HKHA  S K E  N N IVPIVGW LK P   RN++K+Q        NC+S DD   TK  E   QS                 
Subjt:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------

Query:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT
                                                                                              S+L+++ RD S GT
Subjt:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT

Query:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR
        TITDV+ E   T+G +QT++  AS KFSD+RSEG L K+GN+ H ++L  HN + TM+WLDI HFLTNS+  + RS  N PE  +SIPRI       GS 
Subjt:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR

Query:  GSQIRQQDF---------------TLPQSKWLTGETRANNN------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKG
        G+  R QDF               TL    W   E   NN+      GV GEE DS  K  LVPYAADGRYNEA  RN+S  V TNN+ GLSH + QK+ 
Subjt:  GSQIRQQDF---------------TLPQSKWLTGETRANNN------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKG

Query:  TDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRID
        T S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPH KSL STKKKE+GRV L+PRDI +WK+L ++ S+  KE+ D   E+WW NER++FRGRI+
Subjt:  TDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRID

Query:  AFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESNKATG
        AFNA MHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL         +E+NKATG
Subjt:  AFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESNKATG

Query:  DPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQ
        DPEEA+EL S +DAISSQD  G SIK++ D TLLSS CLEDDCGTCLS NLD TDN  LHSNKST  +EPYSSSQ+STSSCGSNQ+N+  ES+E+  ++Q
Subjt:  DPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQ

Query:  IPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGG
         PIS N N+SDTMH  RS GKC+  ++CI KS+ GLENNAED  RCE  I VDLQF     +Q    S  KFQNQEIQL GDV A+C LCS++N      
Subjt:  IPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGG

Query:  IKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                   + K EAGSQ  S I D S+K+D DVEKVQSQESV Q S+NTN+ +++EK+ A+  LE  + N  NDEK+
Subjt:  IKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

A0A5A7SVP4 Protein ROS1-like isoform X15.5e-26451.01Show/hide
Query:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK
        MAE  L     KHV  V MEP+ISTPLQQE KRKRPQNN  + T+R KKPRKKMYRPKVIGEG++RKSKG N TPAK       +P TP P TP+RV+PK
Subjt:  MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPN-TPAKHKQQVSPKPETPRPATPNRVLPK

Query:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA
          TRKPR LPK +T+K V  +  S                                  EK          K  E DNRV+AI SCR+LVLVENELE EK 
Subjt:  PRTRKPRALPKQKTQKFVVPKNKS----------------------------------EK--------STKLLEDDNRVNAIVSCRDLVLVENELENEKA

Query:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL
          EV   E KEA+++NS   V++T V  +E  C        + +ECK KRSS     KI ERKPYGLR  R KG  SR KLLPF F KRKR+PMVRRCNL
Subjt:  LAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNL

Query:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------
        ASLF   VC QL RN HKHA  S K E  N N IVPIVGW LK P   RN++K+Q        NC+S DD   TK  E   QS                 
Subjt:  ASLF--AVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVS---SKFNCNSRDDVESTKPKEFDSQS-----------------

Query:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT
                                                                                              S+L+++ RD S GT
Subjt:  -------------------------------------------------------------------------------------PSVLKSDDRDASVGT

Query:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR
        TITDV+ E   T+G +QT++  AS KFSD+RSEG L K+GN+ H ++L  HN + TM+WLDI HFLTNS+  + RS  N PE  +SIPRI       GS 
Subjt:  TITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKL-SHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSR

Query:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS
        G+  R QDF               TL    W   E   NN+          GV GEE DS  K  LVPYAADGRYNEA  RN+S  V TNN+ GLSH + 
Subjt:  GSQIRQQDF---------------TLPQSKWLTGETRANNN----------GVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS

Query:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR
        QK+ T S +EKQIIPYARK GKK S  EHNP+ + GMQGAIVPH KSL STKKKE+GRV L+PRDI +WK+L ++ S+  KE+ D   E+WW NER++FR
Subjt:  QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFR

Query:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN
        GRI+AFNA MHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPL      TENY G+E+FCIQQSTQR +  FL         +E+N
Subjt:  GRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL---------LESN

Query:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD
        KATGDPEEA+EL S +DAISSQD  G SIK++ D TLLSS CLEDDCGTCLS NLD TDN  LHSNKST  +EPYSSSQ+STSSCGSNQ+N+  ES+E+ 
Subjt:  KATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELD

Query:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT
         ++Q PIS N N+SDTMH  RS GKC+  ++CI KS+ GLENNAED  RCE  I VDLQF     +Q    S  KFQNQEIQL GDV A+C LCS++N  
Subjt:  CKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQF-----TQRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQT

Query:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD
                       + K EAGSQ  S I D S+K+D DVEKVQSQESV Q S+NTN+ +++EK+ A+  LE  + N  NDEK+
Subjt:  NNGGIKKTTEIYNYYQEKTEAGSQ-PSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKD

A0A6J1H5D5 protein ROS1-like2.5e-21650.64Show/hide
Query:  AEISLPFTN-EKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP
        AEI+ P  N +KH NG+ MEPTISTPL  E KRKRPQ     +  R+KK +KKMYRPKVIGEG+ RK        +  Q V PKP   RP    RV PKP
Subjt:  AEISLPFTN-EKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKP

Query:  RTRKPRALP----------KQKTQKFVVPKNKSEKSTKLLEDDNRV--NAIVSCRDLVLVENELENEKA----------LAEVGGTEAKEAEEDNSKSIV
        R +  + +           ++  Q  ++ K     S + L ++  +  NA VSC+DLVL+ NEL+NEKA           AEV   + KE+E D+SKS V
Subjt:  RTRKPRALP----------KQKTQKFVVPKNKSEKSTKLLEDDNRV--NAIVSCRDLVLVENELENEKA----------LAEVGGTEAKEAEEDNSKSIV

Query:  IQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLASLFAVCKQLLRNTHKHAAMS
        IQT VGS  RL EW  G   IPQ+C+ KRSS  RP KIT+R PYGLR    KG  SRN L PF +CKRKRSPMVRRCN+AS   VCKQL RNTHKHA+ +
Subjt:  IQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERKPYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLASLFAVCKQLLRNTHKHAAMS

Query:  KKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQSPSVLKSDDRDASVGTTITDVHNEGTPTKGSAQTAISPASFKFS
        KKA     NG         +  +    ++K ++       S    E     +   QS S L S+DR+ASV T I +          S QT +   +FKF+
Subjt:  KKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKEFDSQSPSVLKSDDRDASVGTTITDVHNEGTPTKGSAQTAISPASFKFS

Query:  DIRSEGTLWKIGNVIHTEKLSHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSKWLTGETRANNNGVP
         I  EGT WKIGN I  E LSHN K TMRWLDIR FLTN K +   +EKN     IS+PRII      GS  S IRQQDFT P S     ETR   N + 
Subjt:  DIRSEGTLWKIGNVIHTEKLSHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII-----AGSRGSQIRQQDFTLPQSKWLTGETRANNNGVP

Query:  GEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS-QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKK
           ++  R             N    R      G +NSTGL+H NS QK+GT SG+EKQI+PYARK+G KTS  EHN SYVHGM+GAIVPHP+SL STKK
Subjt:  GEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNS-QKKGTDSGNEKQIIPYARKRGKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKK

Query:  KEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAA
        K +GRV LDPRDIT+W L+TQ+ASD G E+ D++ EKWWA+EREIFR RIDAFNARMH+ILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAA
Subjt:  KEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAA

Query:  TFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLS
        TFPL ATRNHTE YQGQ+VFC QQSTQR KGYFL          LESNK TGD EE E+L SANDAI SQDF+GSS+KQSLD+TL SSTC +DDCG  L 
Subjt:  TFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFL----------LESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLS

Query:  TNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEG
        TNLDGTDNTVLHSNKST VQEPYSSS +STSS  SNQEN+ILES+E                                                      
Subjt:  TNLDGTDNTVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEG

Query:  TIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKLDIDVEKVQSQESVIQP-
         +EVDLQ T     Q SI S  +FQNQEIQL GDV        ++  +N+   +K T+I     + T   S+  P +D +  L    + ++   SV +P 
Subjt:  TIEVDLQFT-----QRSIDSLGKFQNQEIQLAGDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKLDIDVEKVQSQESVIQP-

Query:  SDNTNDTEDREKQGA
        S +  D+ D E  G+
Subjt:  SDNTNDTEDREKQGA

SwissProt top hitse value%identityAlignment
B8YIE8 Protein ROS1C2.2e-3152.24Show/hide
Query:  GAIVPHPKSLKSTKK-KEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQ
        GA+VP+       KK +   +V LD     +W LL   A+D   +  D++ E+WW  ERE+F+GR ++F ARM ++ GDRRFSPWKGSVVDSVVGVFLTQ
Subjt:  GAIVPHPKSLKSTKK-KEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQ

Query:  NVSDHLSSSAYMSLAATFPLRATRNHTENYQGQE
        NV+DHLSSSAYM+LAA+FP  +  N  +   GQ+
Subjt:  NVSDHLSSSAYMSLAATFPLRATRNHTENYQGQE

C7IW64 Protein ROS1A8.2e-3143.5Show/hide
Query:  HGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLT-QDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGV
        +G  G IVP    +K  +K+   +V LDP    +WKLL   D SD   E  D + EKW   ER+IF+GR+D+F ARMH++ GDRRFSPWKGSVVDSVVGV
Subjt:  HGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLT-QDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGV

Query:  FLTQNVSDHLSSSAYMSLAATFPLRATRN--------HTENYQGQEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLD
        FLTQNVSDHLSSSA+M+LAA FP++   +        HT +  G +   +  ++ +++G  L++    T     +   T   +  +S + LGSS    +D
Subjt:  FLTQNVSDHLSSSAYMSLAATFPLRATRN--------HTENYQGQEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLD

Q8LK56 Transcriptional activator DEMETER3.6e-3442.73Show/hide
Query:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF
        G K   +E N   ++   GA+VP+    +S K+K   +V +D     IW LL     +  G EE D   EKWW  ER +FRGR D+F ARMH++ GDRRF
Subjt:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF

Query:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT
        SPWKGSVVDSV+GVFLTQNVSDHLSSSA+MSLAA FP  L ++R    N +               E+   Q+  Q      +  S   +G  E+  +  
Subjt:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT

Query:  SANDAISSQDFLGSSIKQSLDETLLSS
         +N  I   +FL  SI Q+L+E +LSS
Subjt:  SANDAISSQDFLGSSIKQSLDETLLSS

Q9SJQ6 DNA glycosylase/AP lyase ROS15.3e-3032.57Show/hide
Query:  GMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFL
        G  GAIVP        K +   +V LD     +WKLL ++ +  G + +D    KWW  ER +FRGR D+F ARMH++ GDRRF+PWKGSVVDSVVGVFL
Subjt:  GMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFL

Query:  TQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQ-------QSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETL
        TQNVSDHLSSSA+MSLA+ FP+    +   +     +  IQ       ++      +           P+E ++   +N+   S   +  S  +S+D+T 
Subjt:  TQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQ-------QSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETL

Query:  LSSTCLEDDCGTCLSTNLDGTDN-----TVLHSNKSTLV--QEPYSSSQSSTSSCGSNQE-----------NKILESEELDCKNQIPISENLNASDTMHM
         S   ++ D     S  +D TD       +  S  S L       S +  +T   GS+ E            K+L+  ++  ++   +S N++  D    
Subjt:  LSSTCLEDDCGTCLSTNLDGTDN-----TVLHSNKSTLV--QEPYSSSQSSTSSCGSNQE-----------NKILESEELDCKNQIPISENLNASDTMHM

Query:  LRSF
        ++ F
Subjt:  LRSF

Q9SR66 DEMETER-like protein 23.4e-2951.16Show/hide
Query:  HGMQGAIVPHPKSLKSTK------KKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVD
        H  + A++ + KS +  K      KK+  +V LDP    +WKLL       G + +D    KWW  ER +F GR ++F ARM V+ G+R FSPWKGSVVD
Subjt:  HGMQGAIVPHPKSLKSTK------KKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVD

Query:  SVVGVFLTQNVSDHLSSSAYMSLAATFPL
        SVVGVFLTQNV+DH SSSAYM LAA FP+
Subjt:  SVVGVFLTQNVSDHLSSSAYMSLAATFPL

Arabidopsis top hitse value%identityAlignment
AT2G36490.1 demeter-like 13.8e-3132.57Show/hide
Query:  GMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFL
        G  GAIVP        K +   +V LD     +WKLL ++ +  G + +D    KWW  ER +FRGR D+F ARMH++ GDRRF+PWKGSVVDSVVGVFL
Subjt:  GMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFL

Query:  TQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQ-------QSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETL
        TQNVSDHLSSSA+MSLA+ FP+    +   +     +  IQ       ++      +           P+E ++   +N+   S   +  S  +S+D+T 
Subjt:  TQNVSDHLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQ-------QSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETL

Query:  LSSTCLEDDCGTCLSTNLDGTDN-----TVLHSNKSTLV--QEPYSSSQSSTSSCGSNQE-----------NKILESEELDCKNQIPISENLNASDTMHM
         S   ++ D     S  +D TD       +  S  S L       S +  +T   GS+ E            K+L+  ++  ++   +S N++  D    
Subjt:  LSSTCLEDDCGTCLSTNLDGTDN-----TVLHSNKSTLV--QEPYSSSQSSTSSCGSNQE-----------NKILESEELDCKNQIPISENLNASDTMHM

Query:  LRSF
        ++ F
Subjt:  LRSF

AT3G10010.1 demeter-like 22.4e-3051.16Show/hide
Query:  HGMQGAIVPHPKSLKSTK------KKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVD
        H  + A++ + KS +  K      KK+  +V LDP    +WKLL       G + +D    KWW  ER +F GR ++F ARM V+ G+R FSPWKGSVVD
Subjt:  HGMQGAIVPHPKSLKSTK------KKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVD

Query:  SVVGVFLTQNVSDHLSSSAYMSLAATFPL
        SVVGVFLTQNV+DH SSSAYM LAA FP+
Subjt:  SVVGVFLTQNVSDHLSSSAYMSLAATFPL

AT4G34060.1 demeter-like protein 31.3e-2351.79Show/hide
Query:  LKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSA
        +K+ KK    +V LDP  I  W +L  + S   +   D   E  W  EREIF+ RID F  RMH + G+R+F  WKGSVVDSVVGVFLTQN +D+LSS+A
Subjt:  LKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSA

Query:  YMSLAATFPLRA
        +MS+AA FP+ A
Subjt:  YMSLAATFPLRA

AT5G04560.1 HhH-GPD base excision DNA repair family protein2.5e-3542.73Show/hide
Query:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF
        G K   +E N   ++   GA+VP+    +S K+K   +V +D     IW LL     +  G EE D   EKWW  ER +FRGR D+F ARMH++ GDRRF
Subjt:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF

Query:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT
        SPWKGSVVDSV+GVFLTQNVSDHLSSSA+MSLAA FP  L ++R    N +               E+   Q+  Q      +  S   +G  E+  +  
Subjt:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT

Query:  SANDAISSQDFLGSSIKQSLDETLLSS
         +N  I   +FL  SI Q+L+E +LSS
Subjt:  SANDAISSQDFLGSSIKQSLDETLLSS

AT5G04560.2 HhH-GPD base excision DNA repair family protein2.5e-3542.73Show/hide
Query:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF
        G K   +E N   ++   GA+VP+    +S K+K   +V +D     IW LL     +  G EE D   EKWW  ER +FRGR D+F ARMH++ GDRRF
Subjt:  GKKTSNREHNPSYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASD-VGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRF

Query:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT
        SPWKGSVVDSV+GVFLTQNVSDHLSSSA+MSLAA FP  L ++R    N +               E+   Q+  Q      +  S   +G  E+  +  
Subjt:  SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFP--LRATRNHTENYQG-------------QEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELT

Query:  SANDAISSQDFLGSSIKQSLDETLLSS
         +N  I   +FL  SI Q+L+E +LSS
Subjt:  SANDAISSQDFLGSSIKQSLDETLLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAATATCACTTCCATTCACGAATGAGAAACATGTGAATGGAGTGGCAATGGAACCAACCATTTCAACTCCATTGCAGCAGGAGACTAAAAGAAAGAGGCCACA
GAACAACGAAACAGATTTAACCAGAAGAAAAAAGAAACCCCGAAAAAAAATGTACAGACCAAAAGTGATTGGAGAAGGTAAACGAAGAAAAAGCAAAGGACCAAATACGC
CTGCAAAACACAAACAACAAGTTAGCCCAAAGCCAGAGACGCCAAGACCAGCAACTCCAAATCGTGTTTTACCAAAACCAAGGACCCGAAAACCTCGTGCTTTACCGAAA
CAAAAGACCCAAAAATTTGTAGTTCCTAAGAACAAGTCTGAAAAGAGCACTAAGTTGTTGGAAGATGATAATCGAGTCAATGCTATCGTTAGTTGCAGAGATCTAGTTTT
AGTGGAGAATGAATTGGAAAATGAGAAAGCTTTAGCAGAAGTTGGTGGCACTGAAGCAAAGGAAGCCGAGGAAGATAATTCAAAGAGTATAGTTATTCAAACTGCAGTCG
GGTCGAACGAAAGGCTTTGTGAATGGTTCTGTGGAATATCACATATACCACAGGAATGCAAGGGAAAGAGGAGTTCCTGGGGAAGGCCAACAAAAATCACTGAGAGAAAA
CCATATGGATTGAGGCATGTAAGAGGTAAAGGCGGGACCTCAAGAAACAAACTACTGCCATTCTTCTTCTGTAAGAGAAAAAGGAGTCCAATGGTTAGAAGGTGCAACTT
AGCTTCACTATTTGCAGTTTGTAAACAGTTGCTGAGAAACACACATAAACACGCAGCCATGAGCAAAAAAGCAGAAAATTTTAATGAGAATGGCATTGTTCCCATTGTAG
GGTGGCTATTAAAGGGGCCCAATCAGTTTCGGAATCGCAAGAAAACTCAAGTCAGTTCAAAGTTCAACTGTAATAGCAGAGATGATGTTGAGAGCACAAAACCAAAAGAG
TTTGACAGTCAATCTCCTTCGGTGTTGAAGTCTGATGACAGAGATGCATCTGTGGGAACTACCATAACAGATGTCCACAACGAAGGCACACCAACAAAAGGGTCGGCTCA
AACAGCAATCTCTCCAGCTAGTTTCAAGTTTTCGGATATACGAAGTGAGGGGACTTTGTGGAAGATTGGGAATGTAATTCATACTGAAAAATTAAGTCACAATGCTAAGA
CAACAATGAGATGGCTTGACATCAGACATTTCTTAACTAATTCAAAGTTTCTCATTGATAGAAGTGAAAAGAACCTACCAGAGAAAAGCATTTCAATTCCTAGAATTATA
GCAGGTAGCAGAGGAAGCCAGATCAGGCAGCAGGACTTCACCCTTCCCCAGAGCAAATGGCTCACAGGTGAAACAAGAGCAAATAATAATGGTGTACCAGGAGAAGAAAC
AGATTCTTTCCGTAAGGAAGTTTTGGTTCCATATGCTGCTGATGGTCGTTACAACGAAGCACAGTTCAGAAATATTTCCACTTGTGTAGGCACTAATAACAGCACTGGTT
TATCACACGATAACTCACAGAAAAAAGGAACAGATTCGGGCAACGAGAAGCAAATTATTCCATATGCACGAAAAAGAGGAAAGAAAACCAGCAACCGAGAGCACAATCCA
AGTTATGTTCATGGCATGCAAGGTGCAATAGTTCCCCATCCTAAATCATTGAAGTCAACTAAGAAAAAAGAGATGGGTAGAGTCTACCTTGACCCAAGAGACATTACCAT
ATGGAAGCTACTCACCCAAGATGCAAGTGATGTTGGTAAAGAAGAAGCCGACATCAATAACGAAAAGTGGTGGGCAAATGAAAGGGAGATTTTCCGTGGACGGATAGATG
CTTTTAATGCTAGAATGCATGTAATTTTAGGTGATAGGCGTTTTTCACCATGGAAAGGATCGGTGGTGGATTCTGTGGTTGGCGTTTTTCTTACTCAAAATGTCTCCGAC
CACCTTTCAAGCTCTGCTTACATGTCACTTGCTGCAACGTTCCCCCTCCGGGCAACAAGAAACCATACAGAAAATTATCAAGGCCAGGAGGTTTTCTGCATTCAACAATC
GACTCAAAGAATCAAAGGATATTTTCTGTTGGAGAGTAATAAGGCAACAGGAGATCCTGAAGAAGCTGAAGAATTAACATCTGCGAATGATGCAATTTCATCTCAAGATT
TTCTGGGGTCATCAATAAAACAAAGCCTTGATGAAACACTTCTCTCGTCTACGTGCTTAGAAGATGATTGTGGAACGTGTTTATCTACGAATTTGGATGGCACTGATAAC
ACAGTGCTCCATTCTAATAAATCAACCCTTGTGCAAGAACCTTATAGCAGCAGTCAAAGTAGCACATCATCGTGTGGAAGCAATCAAGAAAATAAAATTTTAGAATCTGA
AGAGTTGGATTGCAAAAACCAAATTCCAATTTCTGAAAACTTAAATGCCAGTGACACAATGCATATGCTGAGATCATTTGGGAAGTGCCATTTATGCTCAAAATGCATTT
GCAAATCAAGGAGGGGACTTGAAAATAATGCTGAGGATTATAAACGTTGTGAAGGAACAATAGAAGTTGATTTACAGTTCACTCAGAGATCTATTGACAGTCTCGGAAAG
TTTCAAAATCAGGAGATTCAACTCGCTGGTGATGTCCGTGCTAGGTGTTGCTTGTGTTCCGACAATAATCAAACAAACAATGGAGGAATCAAGAAAACCACAGAGATCTA
TAATTACTATCAAGAAAAAACGGAAGCTGGTTCTCAGCCCTCGCCCATTGTCGATTTATCTAAGAAATTAGATATTGATGTAGAAAAGGTCCAAAGTCAGGAGAGTGTGA
TTCAACCAAGTGATAATACCAATGACACGGAAGATAGAGAAAAGCAAGGCGCACAAAAAAAATTGGAAGCCAGAGATTCTAATGATTTTAATGATGAAAAAGACAGAAAA
TTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAAATATCACTTCCATTCACGAATGAGAAACATGTGAATGGAGTGGCAATGGAACCAACCATTTCAACTCCATTGCAGCAGGAGACTAAAAGAAAGAGGCCACA
GAACAACGAAACAGATTTAACCAGAAGAAAAAAGAAACCCCGAAAAAAAATGTACAGACCAAAAGTGATTGGAGAAGGTAAACGAAGAAAAAGCAAAGGACCAAATACGC
CTGCAAAACACAAACAACAAGTTAGCCCAAAGCCAGAGACGCCAAGACCAGCAACTCCAAATCGTGTTTTACCAAAACCAAGGACCCGAAAACCTCGTGCTTTACCGAAA
CAAAAGACCCAAAAATTTGTAGTTCCTAAGAACAAGTCTGAAAAGAGCACTAAGTTGTTGGAAGATGATAATCGAGTCAATGCTATCGTTAGTTGCAGAGATCTAGTTTT
AGTGGAGAATGAATTGGAAAATGAGAAAGCTTTAGCAGAAGTTGGTGGCACTGAAGCAAAGGAAGCCGAGGAAGATAATTCAAAGAGTATAGTTATTCAAACTGCAGTCG
GGTCGAACGAAAGGCTTTGTGAATGGTTCTGTGGAATATCACATATACCACAGGAATGCAAGGGAAAGAGGAGTTCCTGGGGAAGGCCAACAAAAATCACTGAGAGAAAA
CCATATGGATTGAGGCATGTAAGAGGTAAAGGCGGGACCTCAAGAAACAAACTACTGCCATTCTTCTTCTGTAAGAGAAAAAGGAGTCCAATGGTTAGAAGGTGCAACTT
AGCTTCACTATTTGCAGTTTGTAAACAGTTGCTGAGAAACACACATAAACACGCAGCCATGAGCAAAAAAGCAGAAAATTTTAATGAGAATGGCATTGTTCCCATTGTAG
GGTGGCTATTAAAGGGGCCCAATCAGTTTCGGAATCGCAAGAAAACTCAAGTCAGTTCAAAGTTCAACTGTAATAGCAGAGATGATGTTGAGAGCACAAAACCAAAAGAG
TTTGACAGTCAATCTCCTTCGGTGTTGAAGTCTGATGACAGAGATGCATCTGTGGGAACTACCATAACAGATGTCCACAACGAAGGCACACCAACAAAAGGGTCGGCTCA
AACAGCAATCTCTCCAGCTAGTTTCAAGTTTTCGGATATACGAAGTGAGGGGACTTTGTGGAAGATTGGGAATGTAATTCATACTGAAAAATTAAGTCACAATGCTAAGA
CAACAATGAGATGGCTTGACATCAGACATTTCTTAACTAATTCAAAGTTTCTCATTGATAGAAGTGAAAAGAACCTACCAGAGAAAAGCATTTCAATTCCTAGAATTATA
GCAGGTAGCAGAGGAAGCCAGATCAGGCAGCAGGACTTCACCCTTCCCCAGAGCAAATGGCTCACAGGTGAAACAAGAGCAAATAATAATGGTGTACCAGGAGAAGAAAC
AGATTCTTTCCGTAAGGAAGTTTTGGTTCCATATGCTGCTGATGGTCGTTACAACGAAGCACAGTTCAGAAATATTTCCACTTGTGTAGGCACTAATAACAGCACTGGTT
TATCACACGATAACTCACAGAAAAAAGGAACAGATTCGGGCAACGAGAAGCAAATTATTCCATATGCACGAAAAAGAGGAAAGAAAACCAGCAACCGAGAGCACAATCCA
AGTTATGTTCATGGCATGCAAGGTGCAATAGTTCCCCATCCTAAATCATTGAAGTCAACTAAGAAAAAAGAGATGGGTAGAGTCTACCTTGACCCAAGAGACATTACCAT
ATGGAAGCTACTCACCCAAGATGCAAGTGATGTTGGTAAAGAAGAAGCCGACATCAATAACGAAAAGTGGTGGGCAAATGAAAGGGAGATTTTCCGTGGACGGATAGATG
CTTTTAATGCTAGAATGCATGTAATTTTAGGTGATAGGCGTTTTTCACCATGGAAAGGATCGGTGGTGGATTCTGTGGTTGGCGTTTTTCTTACTCAAAATGTCTCCGAC
CACCTTTCAAGCTCTGCTTACATGTCACTTGCTGCAACGTTCCCCCTCCGGGCAACAAGAAACCATACAGAAAATTATCAAGGCCAGGAGGTTTTCTGCATTCAACAATC
GACTCAAAGAATCAAAGGATATTTTCTGTTGGAGAGTAATAAGGCAACAGGAGATCCTGAAGAAGCTGAAGAATTAACATCTGCGAATGATGCAATTTCATCTCAAGATT
TTCTGGGGTCATCAATAAAACAAAGCCTTGATGAAACACTTCTCTCGTCTACGTGCTTAGAAGATGATTGTGGAACGTGTTTATCTACGAATTTGGATGGCACTGATAAC
ACAGTGCTCCATTCTAATAAATCAACCCTTGTGCAAGAACCTTATAGCAGCAGTCAAAGTAGCACATCATCGTGTGGAAGCAATCAAGAAAATAAAATTTTAGAATCTGA
AGAGTTGGATTGCAAAAACCAAATTCCAATTTCTGAAAACTTAAATGCCAGTGACACAATGCATATGCTGAGATCATTTGGGAAGTGCCATTTATGCTCAAAATGCATTT
GCAAATCAAGGAGGGGACTTGAAAATAATGCTGAGGATTATAAACGTTGTGAAGGAACAATAGAAGTTGATTTACAGTTCACTCAGAGATCTATTGACAGTCTCGGAAAG
TTTCAAAATCAGGAGATTCAACTCGCTGGTGATGTCCGTGCTAGGTGTTGCTTGTGTTCCGACAATAATCAAACAAACAATGGAGGAATCAAGAAAACCACAGAGATCTA
TAATTACTATCAAGAAAAAACGGAAGCTGGTTCTCAGCCCTCGCCCATTGTCGATTTATCTAAGAAATTAGATATTGATGTAGAAAAGGTCCAAAGTCAGGAGAGTGTGA
TTCAACCAAGTGATAATACCAATGACACGGAAGATAGAGAAAAGCAAGGCGCACAAAAAAAATTGGAAGCCAGAGATTCTAATGATTTTAATGATGAAAAAGACAGAAAA
TTCTAA
Protein sequenceShow/hide protein sequence
MAEISLPFTNEKHVNGVAMEPTISTPLQQETKRKRPQNNETDLTRRKKKPRKKMYRPKVIGEGKRRKSKGPNTPAKHKQQVSPKPETPRPATPNRVLPKPRTRKPRALPK
QKTQKFVVPKNKSEKSTKLLEDDNRVNAIVSCRDLVLVENELENEKALAEVGGTEAKEAEEDNSKSIVIQTAVGSNERLCEWFCGISHIPQECKGKRSSWGRPTKITERK
PYGLRHVRGKGGTSRNKLLPFFFCKRKRSPMVRRCNLASLFAVCKQLLRNTHKHAAMSKKAENFNENGIVPIVGWLLKGPNQFRNRKKTQVSSKFNCNSRDDVESTKPKE
FDSQSPSVLKSDDRDASVGTTITDVHNEGTPTKGSAQTAISPASFKFSDIRSEGTLWKIGNVIHTEKLSHNAKTTMRWLDIRHFLTNSKFLIDRSEKNLPEKSISIPRII
AGSRGSQIRQQDFTLPQSKWLTGETRANNNGVPGEETDSFRKEVLVPYAADGRYNEAQFRNISTCVGTNNSTGLSHDNSQKKGTDSGNEKQIIPYARKRGKKTSNREHNP
SYVHGMQGAIVPHPKSLKSTKKKEMGRVYLDPRDITIWKLLTQDASDVGKEEADINNEKWWANEREIFRGRIDAFNARMHVILGDRRFSPWKGSVVDSVVGVFLTQNVSD
HLSSSAYMSLAATFPLRATRNHTENYQGQEVFCIQQSTQRIKGYFLLESNKATGDPEEAEELTSANDAISSQDFLGSSIKQSLDETLLSSTCLEDDCGTCLSTNLDGTDN
TVLHSNKSTLVQEPYSSSQSSTSSCGSNQENKILESEELDCKNQIPISENLNASDTMHMLRSFGKCHLCSKCICKSRRGLENNAEDYKRCEGTIEVDLQFTQRSIDSLGK
FQNQEIQLAGDVRARCCLCSDNNQTNNGGIKKTTEIYNYYQEKTEAGSQPSPIVDLSKKLDIDVEKVQSQESVIQPSDNTNDTEDREKQGAQKKLEARDSNDFNDEKDRK
F