; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021432 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021432
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein POLLENLESS 3-LIKE 1-like
Genome locationChr05:9070703..9074110
RNA-Seq ExpressionHG10021432
SyntenyHG10021432
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0010277 - chlorophyllide a oxygenase [overall] activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR044961 - Tetratricopeptide repeat protein POLLENLESS 3/SULFUR DEFICIENCY-INDUCED 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445490.1 PREDICTED: uncharacterized protein LOC103488488 [Cucumis melo]0.0e+0080.3Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        M TNS KN FPCKGFSTPPPSWKS+PFR PKTAPFSE KRSSPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GTVFGGKRTKAARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDNVY+AEEYYRKALS ESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKE K FNST HEED++T  TITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TGK+G CVPQ+TAST+WTHDD++MY+NENS D D HWDC E KSIGAVNSSHN+LHCDKWS GCFIENLGK DSCI PIKI+G+RNQ  L R  DES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSLYSSP PAK +V+VPFTQPKNS WEFNNRW SKERRQQRKR RK LF NPS K++SF +GFVVDASSESEGT PTSNYKTKYRSAAPD VELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK
        FTQPRSC+W MN  G+ RKATECFRSL SSSSSRKLSFEPPTSTENIQ   DS+FGRSELSRAVSDE QDL  DW Q+SCGDIEYEEG   + Y  MKIK
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK

Query:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII
        EE   VDQKF+ NS TV GKKSWADMVEEEEE+SD+E+ED+ TEE +SSS  GQVNCF  NWS  SSD+GEFKF+DENLNSNILHQ +H PSSNQVEDI+
Subjt:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII

Query:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE
        KF SLEIKD   DS EVVS RN  VR      QQ MLES DN  +SPLP KDLTT  SC  GQEN LMRRNRLQVFHEI TVHQELE
Subjt:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE

XP_011659062.1 uncharacterized protein LOC105436130 isoform X1 [Cucumis sativus]0.0e+0080.1Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        M TNS KN F CKGFSTPPPSWK +PFR PKTAPFSE KR SPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GTVFGGKRTKAARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDN+Y+AEEYYRKALS ESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNST HEED++T  TITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TGK+G CVPQ+TAST+WT DDE+MY+NENS DDD HWDCYE KSIGAVNSSHN+LHCDKWSEGCFIENLGKTDSCI PIKI+G+RNQ GL R  DES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSL+SSP PAK +V+VPFTQPKNSFWEFNNRW SKER+QQRKR RK LF NPS K++SFD+GF+VD+SSESEGT PTSNYKTKYRSAAPD VELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSF-GRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSM-K
        FTQPRSC W MN   + RKATECFRSL SSSSSRKLSFEPPTSTENIQ   DS+F GR ELSRAVSDE QDL  DW Q+SCGDI+YEEG   + Y  M K
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSF-GRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSM-K

Query:  IKEEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVED
        IKEE I VDQK + NS TV GKKSWADMVEEEEE+SDDE+E+D TEE +SSS   QVNCF  NWS SS D+GEFKF+DENLNSNILHQNH  PSSNQ+ED
Subjt:  IKEEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVED

Query:  IIKFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFD--QQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEIT-VHQE
         IK  SLEIKD   DS EVVSSRN   R PLYFD  QQP LES DN C+SPLP KDLTT  SC  GQEN LMR NRLQVFHEIT VHQE
Subjt:  IIKFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFD--QQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEIT-VHQE

XP_011659066.1 uncharacterized protein LOC105436130 isoform X2 [Cucumis sativus]0.0e+0081.97Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        M TNS KN F CKGFSTPPPSWK +PFR PKTAPFSE KR SPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYKIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKA
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYKIE+GTVFGGKRTKAARSQGKKVQIT+EQEKSRVLGNLAWAFLQLDN+Y+AEEYYRKA
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYKIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKA

Query:  LSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGKAGPCVPQMTAST
        LS ESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNST HEED++T  TITSKN TGK+G CVPQ+TAST
Subjt:  LSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGKAGPCVPQMTAST

Query:  RWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDESFNCCSLYSSPIPAKINVD
        +WT DDE+MY+NENS DDD HWDCYE KSIGAVNSSHN+LHCDKWSEGCFIENLGKTDSCI PIKI+G+RNQ GL R  DESFNCCSL+SSP PAK +V+
Subjt:  RWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDESFNCCSLYSSPIPAKINVD

Query:  VPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVPFTQPRSCSWGMNGGGNPR
        VPFTQPKNSFWEFNNRW SKER+QQRKR RK LF NPS K++SFD+GF+VD+SSESEGT PTSNYKTKYRSAAPD VELEVPFTQPRSC W MN   + R
Subjt:  VPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVPFTQPRSCSWGMNGGGNPR

Query:  KATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSF-GRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSM-KIKEEHITVDQKFKDNSST
        KATECFRSL SSSSSRKLSFEPPTSTENIQ   DS+F GR ELSRAVSDE QDL  DW Q+SCGDI+YEEG   + Y  M KIKEE I VDQK + NS T
Subjt:  KATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSF-GRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSM-KIKEEHITVDQKFKDNSST

Query:  VGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDIIKFDSLEIKDGAKDSGE
        V GKKSWADMVEEEEE+SDDE+E+D TEE +SSS   QVNCF  NWS SS D+GEFKF+DENLNSNILHQNH  PSSNQ+ED IK  SLEIKD   DS E
Subjt:  VGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDIIKFDSLEIKDGAKDSGE

Query:  VVSSRNPAVRRPLYFD--QQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEIT-VHQE
        VVSSRN   R PLYFD  QQP LES DN C+SPLP KDLTT  SC  GQEN LMR NRLQVFHEIT VHQE
Subjt:  VVSSRNPAVRRPLYFD--QQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEIT-VHQE

XP_038894110.1 uncharacterized protein LOC120082846 isoform X1 [Benincasa hispida]0.0e+0088.18Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        MWTNS KNNFPCKGFSTPPPSWKSRPFRS KT+PFSERKRSSPNSANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP+RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAIRSFRHLCPYD+QESIDNVLIELYK                  IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDNVYVAE+YYRKALS ESDNNKKCNLAICLILTNRL EAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKES SFNSTGHEED+ TVT ITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNR-NQDGLLRSVDE
        KN T +AGPCVPQMT STRWTHDDEQMY+NENS+DD+ HWDCYE KS GAVNSSHN+LHCDKWSEGCFIENLGKTDSCILPIK +GNR NQDGLLR VDE
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNR-NQDGLLRSVDE

Query:  SFNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGT-GPTSNYKTKYRSAAPDPVELE
        SFNCCSLYSSPIPAK NV+VPFTQPKNSFWEFNNRWRSKERRQQRKRSRK LFENPSMKDQSFDNGFVVDASSESEGT GPTSNYKTKYRSAAPDP ELE
Subjt:  SFNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGT-GPTSNYKTKYRSAAPDPVELE

Query:  VPFTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIK
        VPFTQPRSCSWGMNGG + RKATECFRSL+SSSSSRKLSFEPPT+TENIQ   DS+FGRSELSRAVSDE QDLAADWK++SCGDI+Y EGA+ Y S+KIK
Subjt:  VPFTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIK

Query:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSS-NQVEDII
        EEH+TVDQKFKDNSSTVGGKKSWADMVEEEEEDSD EKE+DTEE +SSS RGQVNCFD NW SSSSD+GEFKF+DENLNSNILHQ + SPSS NQVEDII
Subjt:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSS-NQVEDII

Query:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEITVHQELEC
         FDSLEIKDGAKDSG+VV  RNPAVRRPLYFDQQPMLEST+NRC+SPLP KDLTT   CNSGQENNLMRRNRLQVFHEITVHQELEC
Subjt:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEITVHQELEC

XP_038894111.1 uncharacterized protein LOC120082846 isoform X2 [Benincasa hispida]0.0e+0084.5Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        MWTNS KNNFPCKGFSTPPPSWKSRPFRS KT+PFSERKRSSPNSANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP+RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAIRSFRHLCPYD+QESIDNVLIELYK                  IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDNVYVAE+YYRKALS ESDNNKKCNLAICLILTNRL EAKSLLQSVRASSG                              GHEED+ TVT ITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNR-NQDGLLRSVDE
        KN T +AGPCVPQMT STRWTHDDEQMY+NENS+DD+ HWDCYE KS GAVNSSHN+LHCDKWSEGCFIENLGKTDSCILPIK +GNR NQDGLLR VDE
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNR-NQDGLLRSVDE

Query:  SFNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGT-GPTSNYKTKYRSAAPDPVELE
        SFNCCSLYSSPIPAK NV+VPFTQPKNSFWEFNNRWRSKERRQQRKRSRK LFENPSMKDQSFDNGFVVDASSESEGT GPTSNYKTKYRSAAPDP ELE
Subjt:  SFNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGT-GPTSNYKTKYRSAAPDPVELE

Query:  VPFTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIK
        VPFTQPRSCSWGMNGG + RKATECFRSL+SSSSSRKLSFEPPT+TENIQ   DS+FGRSELSRAVSDE QDLAADWK++SCGDI+Y EGA+ Y S+KIK
Subjt:  VPFTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIK

Query:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSS-NQVEDII
        EEH+TVDQKFKDNSSTVGGKKSWADMVEEEEEDSD EKE+DTEE +SSS RGQVNCFD NW SSSSD+GEFKF+DENLNSNILHQ + SPSS NQVEDII
Subjt:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSS-NQVEDII

Query:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEITVHQELEC
         FDSLEIKDGAKDSG+VV  RNPAVRRPLYFDQQPMLEST+NRC+SPLP KDLTT   CNSGQENNLMRRNRLQVFHEITVHQELEC
Subjt:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEITVHQELEC

TrEMBL top hitse value%identityAlignment
A0A0A0LVU4 TPR_REGION domain-containing protein3.1e-30781.09Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        MWTN+SKNNFPCKGF TPPPSWKS PFRSPKTAPFSERKRSSPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLI+KDP+RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GT+FGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQL+N+YVAE+YYRKALS E+DNNKKCNLAIC ILTNRLTEAKSLLQSVRASSGGKP EESYAKSFERA HML EKESKSFNSTG+EED+   TTITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TG+ G CVPQ+ ASTRWTHDDEQMY+NENS+D D HWDC + KS+GAVNSSHN+LH DKW EGC IENLGKT SCI PIK++GNRN+D L R V+ES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSL++SP P K NV+VPFTQ KNSFWEFN RWRSKER+QQ+KR+RK LFENPS KDQSFD+GFVVD SSES+ T P SNYKTKYRSAAPD +ELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLA-ADWKQSSCGDIEYEEGAIRYNSMKIKE
        FTQPRSCSWGMNGGGN RK TECFRSLLS SSSRKLSFE PTSTEN QA TDS+ GRS+LSR +SDE QDLA  DWKQ+S GDIEYEEG I  +SMKI E
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLA-ADWKQSSCGDIEYEEGAIRYNSMKIKE

Query:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL
        EH+T+D KFK NS TVGGKKSWADMVEEEEEDSDD+ EDDTEET SSS RGQVNCFD NW SSSSD+ E+KF+DE L
Subjt:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL

A0A1S3BDK6 uncharacterized protein LOC103488457 isoform X12.2e-30080.21Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        MW N+ KNNFPCKGF TPPPSWKSRPFRSP+ APFSERKRSSPN ANK+D+FHVIHKVPAGDSPYVKAKQVQLI+KDP+RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GT+FGGKRTKAARSQGKKVQITIEQEK+RVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQL+NVYVAE+YYRKALS E+DNNKKCNLAIC ILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHML EKESKSFN TG+EED+ T TTITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TG+AG CVPQ  ASTRWTHDDEQMY+NENS+ +D HWDC + KSIGAVNSSHN+LH DKW EGC I+NLGKT S I P KI+GNRN+DGLLR V+ES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSLY+SP P K NV+VPFTQPKNSFWEFNNRWRSKE +QQ+KR+RK LFENPS KDQ+FD+GFVVD SSES+   P SNYK+KYRSAA + +ELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFE-PPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIKE
        FTQPRSCSWGMNGGGN RK  E FRSLLSSSSSRKLSFE P TSTEN QA TDS+ GRS+LSR +SDE QDLA D K++S GDIEYEEG I  ++MKI E
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFE-PPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIKE

Query:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL
        EHIT D KFK NS TVGGKKSWADMVEEEEEDSDD+ EDDTEET+SSS R QVNCFD NW SSSSD+ EFKF+DENL
Subjt:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL

A0A1S3BDQ0 uncharacterized protein LOC1034884880.0e+0080.3Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        M TNS KN FPCKGFSTPPPSWKS+PFR PKTAPFSE KRSSPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GTVFGGKRTKAARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDNVY+AEEYYRKALS ESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKE K FNST HEED++T  TITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TGK+G CVPQ+TAST+WTHDD++MY+NENS D D HWDC E KSIGAVNSSHN+LHCDKWS GCFIENLGK DSCI PIKI+G+RNQ  L R  DES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSLYSSP PAK +V+VPFTQPKNS WEFNNRW SKERRQQRKR RK LF NPS K++SF +GFVVDASSESEGT PTSNYKTKYRSAAPD VELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK
        FTQPRSC+W MN  G+ RKATECFRSL SSSSSRKLSFEPPTSTENIQ   DS+FGRSELSRAVSDE QDL  DW Q+SCGDIEYEEG   + Y  MKIK
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK

Query:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII
        EE   VDQKF+ NS TV GKKSWADMVEEEEE+SD+E+ED+ TEE +SSS  GQVNCF  NWS  SSD+GEFKF+DENLNSNILHQ +H PSSNQVEDI+
Subjt:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII

Query:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE
        KF SLEIKD   DS EVVS RN  VR      QQ MLES DN  +SPLP KDLTT  SC  GQEN LMRRNRLQVFHEI TVHQELE
Subjt:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE

A0A5A7V6P1 Protein POLLENLESS 3-LIKE 1-like2.2e-30080.21Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        MW N+ KNNFPCKGF TPPPSWKSRPFRSP+ APFSERKRSSPN ANK+D+FHVIHKVPAGDSPYVKAKQVQLI+KDP+RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GT+FGGKRTKAARSQGKKVQITIEQEK+RVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQL+NVYVAE+YYRKALS E+DNNKKCNLAIC ILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHML EKESKSFN TG+EED+ T TTITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TG+AG CVPQ  ASTRWTHDDEQMY+NENS+ +D HWDC + KSIGAVNSSHN+LH DKW EGC I+NLGKT S I P KI+GNRN+DGLLR V+ES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSLY+SP P K NV+VPFTQPKNSFWEFNNRWRSKE +QQ+KR+RK LFENPS KDQ+FD+GFVVD SSES+   P SNYK+KYRSAA + +ELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFE-PPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIKE
        FTQPRSCSWGMNGGGN RK  E FRSLLSSSSSRKLSFE P TSTEN QA TDS+ GRS+LSR +SDE QDLA D K++S GDIEYEEG I  ++MKI E
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFE-PPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGAIRYNSMKIKE

Query:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL
        EHIT D KFK NS TVGGKKSWADMVEEEEEDSDD+ EDDTEET+SSS R QVNCFD NW SSSSD+ EFKF+DENL
Subjt:  EHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENL

A0A5A7VD19 Protein POLLENLESS 3-LIKE 1-like0.0e+0080.3Show/hide
Query:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK
        M TNS KN FPCKGFSTPPPSWKS+PFR PKTAPFSE KRSSPN ANK+DLFHVIHKVPAGDSPYVKAKQVQLIDKDP RAVSLFWAAINAGDRVDSALK
Subjt:  MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALK

Query:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW
        DMAVVMKQLDRSDEAIEAI+SFRHLCPYD+QESIDNVLIELYK                  IE+GTVFGGKRTKAARSQGKKVQIT+EQEKSRVLGNLAW
Subjt:  DMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAW

Query:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS
        AFLQLDNVY+AEEYYRKALS ESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKE K FNST HEED++T  TITS
Subjt:  AFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITS

Query:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES
        KN TGK+G CVPQ+TAST+WTHDD++MY+NENS D D HWDC E KSIGAVNSSHN+LHCDKWS GCFIENLGK DSCI PIKI+G+RNQ  L R  DES
Subjt:  KNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSIGAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDES

Query:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP
        FNCCSLYSSP PAK +V+VPFTQPKNS WEFNNRW SKERRQQRKR RK LF NPS K++SF +GFVVDASSESEGT PTSNYKTKYRSAAPD VELEVP
Subjt:  FNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMKDQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVP

Query:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK
        FTQPRSC+W MN  G+ RKATECFRSL SSSSSRKLSFEPPTSTENIQ   DS+FGRSELSRAVSDE QDL  DW Q+SCGDIEYEEG   + Y  MKIK
Subjt:  FTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDESQDLAADWKQSSCGDIEYEEGA--IRYNSMKIK

Query:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII
        EE   VDQKF+ NS TV GKKSWADMVEEEEE+SD+E+ED+ TEE +SSS  GQVNCF  NWS  SSD+GEFKF+DENLNSNILHQ +H PSSNQVEDI+
Subjt:  EEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDD-TEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNSNILHQNHHSPSSNQVEDII

Query:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE
        KF SLEIKD   DS EVVS RN  VR      QQ MLES DN  +SPLP KDLTT  SC  GQEN LMRRNRLQVFHEI TVHQELE
Subjt:  KFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEI-TVHQELE

SwissProt top hitse value%identityAlignment
Q8GXU5 Protein SULFUR DEFICIENCY-INDUCED 12.2e-5549.41Show/hide
Query:  RSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCP
        RS K    +       N    ++LFHVIHKVP GD+PYV+AK  QLI+K+P  A+  FW AIN GDRVDSALKDMAVVMKQLDRS+EAIEAI+SFR  C 
Subjt:  RSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCP

Query:  YDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNK
         ++Q+S+DNVLI+LYK                  I +G  F GK TK ARS GKK Q+T++QE SR+LGNL WA++Q      AE  YRKA   E D NK
Subjt:  YDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNK

Query:  KCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKES
         CNLA+CLI   R  E + +L  V      + +     ++ +RA  +L+E ES
Subjt:  KCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKES

Q8L730 Protein SULFUR DEFICIENCY-INDUCED 22.9e-5251.66Show/hide
Query:  RKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNV
        ++R      +    ++V+HK+P GDSPYV+AK VQL++KD   A+ LFW AI A DRVDSALKDMA++MKQ +R++EAI+AI+SFR LC   AQES+DNV
Subjt:  RKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNV

Query:  LIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLIL
        LI+LYK                  I +G  F GK TK ARS GKK Q+T+E+E SR+LGNL WA++QL +   AE  YRKA   E D NK CNL  CLI 
Subjt:  LIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLIL

Query:  TNRLTEAKSLL
          +  EA+S+L
Subjt:  TNRLTEAKSLL

Q9FKV5 Protein POLLENLESS 3-LIKE 11.6e-6651.54Show/hide
Query:  GFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSD
        GF TPPPSW +   R     P SERKR SP   N+        +V  GDSPYV+AK  QL+ KDP RA+SLFWAAINAGDRVDSALKDM VV+KQL+R D
Subjt:  GFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSD

Query:  EAIEAIRSFRHLCPYDAQESIDNVLIELY------------------KIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEE
        E IEAI+SFR+LCP+++Q+SIDN+L+ELY                   +E+   +GG+   A RS  ++   TIEQEK+R+LGNLAW  LQL N  +AE+
Subjt:  EAIEAIRSFRHLCPYDAQESIDNVLIELY------------------KIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEE

Query:  YYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGK
        YYR ALS E DNNK CNLAICLI   R  EAKSLL+ V+ S G +   E + KSFERA+ MLAE+E     +T  ++ +  +T+  S N + +
Subjt:  YYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGK

Q9SD20 Protein POLLENLESS 3-LIKE 29.7e-6451.7Show/hide
Query:  FRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLC
        FR  K+AP S  K     S  +++ FH IHKVP GDSPYV+AK VQL++KDP RA+ LFW AINAGDRVDSALKDMA+VMKQ +R++EAIEAI+S R  C
Subjt:  FRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLC

Query:  PYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNN
           AQES+DN+L++LYK                  I++G  F GKRTK ARSQGKK Q+++EQE +R+LGNL WA +Q DN   AE+ YR+ALS   DNN
Subjt:  PYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNN

Query:  KKCNLAICLILTNRLTEAKSLLQSVR-ASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEE
        K CNL ICL+   R+ EAK  L+ V+ A   G    +S+ K++ERA  ML +  S+     G ++
Subjt:  KKCNLAICLILTNRLTEAKSLLQSVR-ASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEE

Q9SUC3 Protein POLLENLESS 35.7e-7255.72Show/hide
Query:  FSTPPPSWKSRPFRSPKTAPFSERKR---SSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR
        + TPPP   +R        P +ER+R   S  +S+ + D FH++HKVP+GDSPYV+AK  QLIDKDP RA+SLFW AINAGDRVDSALKDMAVVMKQL R
Subjt:  FSTPPPSWKSRPFRSPKTAPFSERKR---SSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR

Query:  SDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVA
        SDE IEAI+SFR+LC +++Q+SIDN+L+ELYK                  +E+G  FGG+ ++A R QGK V +TIEQEK+R+LGNL W  LQL N  +A
Subjt:  SDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVA

Query:  EEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPM-EESYAKSFERASHMLAEKESK
        E++YR+AL  E D NK CNLAICL+  +R+ EAKSLL  VR S       +E +AKS++RA  MLAE ESK
Subjt:  EEYYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPM-EESYAKSFERASHMLAEKESK

Arabidopsis top hitse value%identityAlignment
AT1G04770.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-5351.66Show/hide
Query:  RKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNV
        ++R      +    ++V+HK+P GDSPYV+AK VQL++KD   A+ LFW AI A DRVDSALKDMA++MKQ +R++EAI+AI+SFR LC   AQES+DNV
Subjt:  RKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCPYDAQESIDNV

Query:  LIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLIL
        LI+LYK                  I +G  F GK TK ARS GKK Q+T+E+E SR+LGNL WA++QL +   AE  YRKA   E D NK CNL  CLI 
Subjt:  LIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLIL

Query:  TNRLTEAKSLL
          +  EA+S+L
Subjt:  TNRLTEAKSLL

AT3G51280.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-6551.7Show/hide
Query:  FRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLC
        FR  K+AP S  K     S  +++ FH IHKVP GDSPYV+AK VQL++KDP RA+ LFW AINAGDRVDSALKDMA+VMKQ +R++EAIEAI+S R  C
Subjt:  FRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLC

Query:  PYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNN
           AQES+DN+L++LYK                  I++G  F GKRTK ARSQGKK Q+++EQE +R+LGNL WA +Q DN   AE+ YR+ALS   DNN
Subjt:  PYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNN

Query:  KKCNLAICLILTNRLTEAKSLLQSVR-ASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEE
        K CNL ICL+   R+ EAK  L+ V+ A   G    +S+ K++ERA  ML +  S+     G ++
Subjt:  KKCNLAICLILTNRLTEAKSLLQSVR-ASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEE

AT4G20900.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.4e-7052.61Show/hide
Query:  FSTPPPSWKSRPFRSPKTAPFSERKR---SSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR
        + TPPP   +R        P +ER+R   S  +S+ + D FH++HKVP+GDSPYV+AK  QLIDKDP RA+SLFW AINAGDRVDSALKDMAVVMKQL R
Subjt:  FSTPPPSWKSRPFRSPKTAPFSERKR---SSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDR

Query:  SDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVA
        SDE IEAI+SFR+LC +++Q+SIDN+L+ELYK                  +E+G  FGG+ ++A R QGK V +TIEQEK+R+LGNL W  LQL N  +A
Subjt:  SDEAIEAIRSFRHLCPYDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVA

Query:  EEYYR----------------KALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPM-EESYAKSFERASHMLAEKESK
        E++YR                +AL  E D NK CNLAICL+  +R+ EAKSLL  VR S       +E +AKS++RA  MLAE ESK
Subjt:  EEYYR----------------KALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPM-EESYAKSFERASHMLAEKESK

AT5G44330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-6751.54Show/hide
Query:  GFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSD
        GF TPPPSW +   R     P SERKR SP   N+        +V  GDSPYV+AK  QL+ KDP RA+SLFWAAINAGDRVDSALKDM VV+KQL+R D
Subjt:  GFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSD

Query:  EAIEAIRSFRHLCPYDAQESIDNVLIELY------------------KIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEE
        E IEAI+SFR+LCP+++Q+SIDN+L+ELY                   +E+   +GG+   A RS  ++   TIEQEK+R+LGNLAW  LQL N  +AE+
Subjt:  EAIEAIRSFRHLCPYDAQESIDNVLIELY------------------KIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEE

Query:  YYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGK
        YYR ALS E DNNK CNLAICLI   R  EAKSLL+ V+ S G +   E + KSFERA+ MLAE+E     +T  ++ +  +T+  S N + +
Subjt:  YYRKALSFESDNNKKCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGK

AT5G48850.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-5649.41Show/hide
Query:  RSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCP
        RS K    +       N    ++LFHVIHKVP GD+PYV+AK  QLI+K+P  A+  FW AIN GDRVDSALKDMAVVMKQLDRS+EAIEAI+SFR  C 
Subjt:  RSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLDRSDEAIEAIRSFRHLCP

Query:  YDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNK
         ++Q+S+DNVLI+LYK                  I +G  F GK TK ARS GKK Q+T++QE SR+LGNL WA++Q      AE  YRKA   E D NK
Subjt:  YDAQESIDNVLIELYK------------------IEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNK

Query:  KCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKES
         CNLA+CLI   R  E + +L  V      + +     ++ +RA  +L+E ES
Subjt:  KCNLAICLILTNRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGACGAACAGCAGCAAGAACAATTTTCCCTGCAAGGGATTTTCGACCCCACCGCCGTCGTGGAAATCGAGGCCGTTCCGGTCACCGAAAACGGCGCCATTCTCAGA
GAGGAAAAGATCGTCTCCTAATTCTGCGAACAAAAATGATCTTTTTCATGTCATTCACAAAGTTCCCGCCGGGGACTCTCCCTATGTTAAAGCTAAACAAGTTCAGTTGA
TAGACAAAGATCCGACTAGGGCTGTTTCTCTGTTTTGGGCGGCGATAAATGCCGGGGATCGAGTGGACAGTGCACTGAAAGACATGGCTGTAGTGATGAAGCAGCTGGAC
CGCTCTGATGAAGCGATTGAGGCGATCAGATCGTTTCGTCATCTCTGCCCTTATGACGCTCAGGAATCTATTGATAATGTCTTGATTGAATTATACAAGATCGAAGAAGG
CACGGTTTTTGGAGGGAAGAGGACGAAGGCTGCAAGATCCCAGGGGAAGAAGGTGCAAATTACTATTGAGCAAGAGAAATCAAGAGTTCTTGGCAACTTGGCCTGGGCTT
TCTTGCAGCTGGACAATGTCTATGTCGCTGAAGAGTATTACCGGAAAGCTTTGTCTTTTGAGTCGGATAACAACAAAAAATGCAATCTTGCGATATGTCTGATCCTTACG
AATCGGCTCACGGAAGCAAAGTCTCTGCTTCAGTCTGTAAGAGCTTCTTCTGGAGGCAAGCCCATGGAAGAGTCATATGCCAAATCATTTGAACGCGCGTCTCACATGCT
GGCTGAAAAAGAATCAAAGTCGTTCAATTCAACAGGACACGAAGAAGATGACAGCACCGTGACCACGATAACCTCAAAGAACATGACTGGTAAAGCTGGGCCTTGTGTTC
CTCAGATGACTGCATCCACAAGGTGGACTCATGATGATGAACAGATGTACATGAATGAAAATAGTCAGGACGACGATCGTCACTGGGACTGCTATGAGAAGAAGTCAATT
GGAGCTGTGAATTCTTCACATAATTTTCTGCATTGTGATAAATGGAGTGAAGGTTGTTTCATTGAAAATCTAGGAAAAACTGACTCCTGCATTCTTCCCATCAAAATAAG
GGGAAACCGAAACCAGGATGGTTTATTGAGATCAGTAGATGAGAGTTTCAACTGCTGCTCATTGTATTCATCTCCGATTCCAGCGAAAATAAATGTCGATGTTCCATTCA
CTCAACCGAAAAACTCCTTTTGGGAATTCAATAATCGATGGCGGTCGAAGGAAAGGAGGCAGCAGCGGAAAAGAAGTAGGAAAGCTTTGTTTGAGAATCCTTCAATGAAG
GATCAAAGTTTTGACAATGGCTTTGTTGTAGATGCTTCTTCTGAATCTGAAGGAACTGGACCGACTTCAAATTACAAGACAAAGTATAGGTCTGCAGCTCCTGATCCGGT
TGAACTGGAAGTTCCATTTACACAGCCAAGGAGTTGTTCATGGGGTATGAATGGAGGAGGAAATCCGAGAAAGGCGACCGAATGCTTCAGAAGTTTGCTCAGCAGTAGTT
CTAGTAGAAAACTTTCATTTGAGCCTCCCACAAGCACTGAAAATATTCAAGCACCAACTGATTCAAGCTTTGGAAGATCTGAACTTTCCAGAGCAGTGAGTGATGAATCT
CAAGATCTTGCTGCAGACTGGAAACAAAGTTCTTGTGGAGATATCGAGTATGAAGAAGGTGCAATTCGATACAACTCGATGAAGATAAAGGAAGAACACATTACTGTTGA
TCAAAAGTTCAAAGATAATTCATCAACAGTTGGTGGGAAGAAGAGTTGGGCAGATATGGTTGAAGAAGAGGAAGAAGACAGTGACGACGAGAAGGAAGACGATACAGAAG
AAACAACATCGTCAAGCAGAAGAGGTCAAGTCAATTGCTTCGACGGAAATTGGAGTAGCAGCAGCAGTGATGATGGCGAGTTCAAGTTCAGTGATGAAAATCTAAATTCC
AACATATTGCACCAGAACCACCATAGTCCAAGCAGCAATCAGGTGGAAGATATAATTAAATTTGATTCACTTGAGATAAAAGATGGAGCTAAAGACTCTGGCGAAGTCGT
TTCGTCAAGAAATCCAGCAGTACGGCGGCCTTTGTATTTTGACCAACAGCCTATGCTAGAGTCGACCGATAACCGCTGCTCCTCGCCACTGCCAATGAAAGATTTGACAA
CTGCGGCCTCTTGTAATTCTGGGCAGGAAAACAACTTGATGAGGAGAAACAGATTGCAAGTATTTCATGAGATAACAGTGCATCAAGAGCTAGAATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGACGAACAGCAGCAAGAACAATTTTCCCTGCAAGGGATTTTCGACCCCACCGCCGTCGTGGAAATCGAGGCCGTTCCGGTCACCGAAAACGGCGCCATTCTCAGA
GAGGAAAAGATCGTCTCCTAATTCTGCGAACAAAAATGATCTTTTTCATGTCATTCACAAAGTTCCCGCCGGGGACTCTCCCTATGTTAAAGCTAAACAAGTTCAGTTGA
TAGACAAAGATCCGACTAGGGCTGTTTCTCTGTTTTGGGCGGCGATAAATGCCGGGGATCGAGTGGACAGTGCACTGAAAGACATGGCTGTAGTGATGAAGCAGCTGGAC
CGCTCTGATGAAGCGATTGAGGCGATCAGATCGTTTCGTCATCTCTGCCCTTATGACGCTCAGGAATCTATTGATAATGTCTTGATTGAATTATACAAGATCGAAGAAGG
CACGGTTTTTGGAGGGAAGAGGACGAAGGCTGCAAGATCCCAGGGGAAGAAGGTGCAAATTACTATTGAGCAAGAGAAATCAAGAGTTCTTGGCAACTTGGCCTGGGCTT
TCTTGCAGCTGGACAATGTCTATGTCGCTGAAGAGTATTACCGGAAAGCTTTGTCTTTTGAGTCGGATAACAACAAAAAATGCAATCTTGCGATATGTCTGATCCTTACG
AATCGGCTCACGGAAGCAAAGTCTCTGCTTCAGTCTGTAAGAGCTTCTTCTGGAGGCAAGCCCATGGAAGAGTCATATGCCAAATCATTTGAACGCGCGTCTCACATGCT
GGCTGAAAAAGAATCAAAGTCGTTCAATTCAACAGGACACGAAGAAGATGACAGCACCGTGACCACGATAACCTCAAAGAACATGACTGGTAAAGCTGGGCCTTGTGTTC
CTCAGATGACTGCATCCACAAGGTGGACTCATGATGATGAACAGATGTACATGAATGAAAATAGTCAGGACGACGATCGTCACTGGGACTGCTATGAGAAGAAGTCAATT
GGAGCTGTGAATTCTTCACATAATTTTCTGCATTGTGATAAATGGAGTGAAGGTTGTTTCATTGAAAATCTAGGAAAAACTGACTCCTGCATTCTTCCCATCAAAATAAG
GGGAAACCGAAACCAGGATGGTTTATTGAGATCAGTAGATGAGAGTTTCAACTGCTGCTCATTGTATTCATCTCCGATTCCAGCGAAAATAAATGTCGATGTTCCATTCA
CTCAACCGAAAAACTCCTTTTGGGAATTCAATAATCGATGGCGGTCGAAGGAAAGGAGGCAGCAGCGGAAAAGAAGTAGGAAAGCTTTGTTTGAGAATCCTTCAATGAAG
GATCAAAGTTTTGACAATGGCTTTGTTGTAGATGCTTCTTCTGAATCTGAAGGAACTGGACCGACTTCAAATTACAAGACAAAGTATAGGTCTGCAGCTCCTGATCCGGT
TGAACTGGAAGTTCCATTTACACAGCCAAGGAGTTGTTCATGGGGTATGAATGGAGGAGGAAATCCGAGAAAGGCGACCGAATGCTTCAGAAGTTTGCTCAGCAGTAGTT
CTAGTAGAAAACTTTCATTTGAGCCTCCCACAAGCACTGAAAATATTCAAGCACCAACTGATTCAAGCTTTGGAAGATCTGAACTTTCCAGAGCAGTGAGTGATGAATCT
CAAGATCTTGCTGCAGACTGGAAACAAAGTTCTTGTGGAGATATCGAGTATGAAGAAGGTGCAATTCGATACAACTCGATGAAGATAAAGGAAGAACACATTACTGTTGA
TCAAAAGTTCAAAGATAATTCATCAACAGTTGGTGGGAAGAAGAGTTGGGCAGATATGGTTGAAGAAGAGGAAGAAGACAGTGACGACGAGAAGGAAGACGATACAGAAG
AAACAACATCGTCAAGCAGAAGAGGTCAAGTCAATTGCTTCGACGGAAATTGGAGTAGCAGCAGCAGTGATGATGGCGAGTTCAAGTTCAGTGATGAAAATCTAAATTCC
AACATATTGCACCAGAACCACCATAGTCCAAGCAGCAATCAGGTGGAAGATATAATTAAATTTGATTCACTTGAGATAAAAGATGGAGCTAAAGACTCTGGCGAAGTCGT
TTCGTCAAGAAATCCAGCAGTACGGCGGCCTTTGTATTTTGACCAACAGCCTATGCTAGAGTCGACCGATAACCGCTGCTCCTCGCCACTGCCAATGAAAGATTTGACAA
CTGCGGCCTCTTGTAATTCTGGGCAGGAAAACAACTTGATGAGGAGAAACAGATTGCAAGTATTTCATGAGATAACAGTGCATCAAGAGCTAGAATGTTAA
Protein sequenceShow/hide protein sequence
MWTNSSKNNFPCKGFSTPPPSWKSRPFRSPKTAPFSERKRSSPNSANKNDLFHVIHKVPAGDSPYVKAKQVQLIDKDPTRAVSLFWAAINAGDRVDSALKDMAVVMKQLD
RSDEAIEAIRSFRHLCPYDAQESIDNVLIELYKIEEGTVFGGKRTKAARSQGKKVQITIEQEKSRVLGNLAWAFLQLDNVYVAEEYYRKALSFESDNNKKCNLAICLILT
NRLTEAKSLLQSVRASSGGKPMEESYAKSFERASHMLAEKESKSFNSTGHEEDDSTVTTITSKNMTGKAGPCVPQMTASTRWTHDDEQMYMNENSQDDDRHWDCYEKKSI
GAVNSSHNFLHCDKWSEGCFIENLGKTDSCILPIKIRGNRNQDGLLRSVDESFNCCSLYSSPIPAKINVDVPFTQPKNSFWEFNNRWRSKERRQQRKRSRKALFENPSMK
DQSFDNGFVVDASSESEGTGPTSNYKTKYRSAAPDPVELEVPFTQPRSCSWGMNGGGNPRKATECFRSLLSSSSSRKLSFEPPTSTENIQAPTDSSFGRSELSRAVSDES
QDLAADWKQSSCGDIEYEEGAIRYNSMKIKEEHITVDQKFKDNSSTVGGKKSWADMVEEEEEDSDDEKEDDTEETTSSSRRGQVNCFDGNWSSSSSDDGEFKFSDENLNS
NILHQNHHSPSSNQVEDIIKFDSLEIKDGAKDSGEVVSSRNPAVRRPLYFDQQPMLESTDNRCSSPLPMKDLTTAASCNSGQENNLMRRNRLQVFHEITVHQELEC