; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027799 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027799
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein conserved in the green lineage and diatoms 27, chloroplastic
Genome locationtig00153055:2694600..2695792
RNA-Seq ExpressionSgr027799
SyntenySgr027799
Gene Ontology termsGO:0009536 - plastid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009631 - CGLD27-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144055.1 uncharacterized protein ycf36 [Momordica charantia]2.1e-13590.57Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        MAGVMY +P  APPR FFPS+PN RP+ISPF NPRPLHLRNS  RISLS+SFRK NNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN
        ATGASFAL +GLP+ WFGT+GVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQ+LARDRLLGSYTVKPVLN
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN

Query:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCG
        RLKYTLV+LAASLFVSIV+LIN+DGGQLLGP FT KSAE DGGRVIPGVYSDESARSFEPDAFCG
Subjt:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCG

XP_038898549.1 uncharacterized protein ycf36 isoform X1 [Benincasa hispida]6.5e-12985.21Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        M GV+Y LPG+APPRTF PS PNSRP ISPF NP PLHLRNS  RISLS+SFRK +NMPPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY-----EETGWYDG-----QIWVKTAQILARDRLL
        ATGASFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY     EETGWYDG     QIWVKTAQ+LARDRLL
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY-----EETGWYDG-----QIWVKTAQILARDRLL

Query:  GSYTVKPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ
        GSYTVKPVL+RLKYTLVSLAASLFVSIV+L+N+DGGQLLGPFFT K A DDG GRVIPGVYSDESARSFEPDAFC  GE DL+Q
Subjt:  GSYTVKPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ

XP_038898550.1 uncharacterized protein ycf36 isoform X2 [Benincasa hispida]9.1e-13186.74Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        M GV+Y LPG+APPRTF PS PNSRP ISPF NP PLHLRNS  RISLS+SFRK +NMPPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY-----EETGWYDGQIWVKTAQILARDRLLGSYTV
        ATGASFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY     EETGWYDGQIWVKTAQ+LARDRLLGSYTV
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEY-----EETGWYDGQIWVKTAQILARDRLLGSYTV

Query:  KPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ
        KPVL+RLKYTLVSLAASLFVSIV+L+N+DGGQLLGPFFT K A DDG GRVIPGVYSDESARSFEPDAFC  GE DL+Q
Subjt:  KPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ

XP_038898551.1 uncharacterized protein ycf36 isoform X3 [Benincasa hispida]9.1e-13186.74Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        M GV+Y LPG+APPRTF PS PNSRP ISPF NP PLHLRNS  RISLS+SFRK +NMPPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDG-----QIWVKTAQILARDRLLGSYTV
        ATGASFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDG     QIWVKTAQ+LARDRLLGSYTV
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDG-----QIWVKTAQILARDRLLGSYTV

Query:  KPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ
        KPVL+RLKYTLVSLAASLFVSIV+L+N+DGGQLLGPFFT K A DDG GRVIPGVYSDESARSFEPDAFC  GE DL+Q
Subjt:  KPVLNRLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ

XP_038898552.1 uncharacterized protein ycf36 isoform X4 [Benincasa hispida]1.3e-13288.32Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        M GV+Y LPG+APPRTF PS PNSRP ISPF NP PLHLRNS  RISLS+SFRK +NMPPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN
        ATGASFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQ+LARDRLLGSYTVKPVL+
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN

Query:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ
        RLKYTLVSLAASLFVSIV+L+N+DGGQLLGPFFT K A DDG GRVIPGVYSDESARSFEPDAFC  GE DL+Q
Subjt:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDG-GRVIPGVYSDESARSFEPDAFC--GEPDLLQ

TrEMBL top hitse value%identityAlignment
A0A1S3BTX6 uncharacterized protein ycf362.7e-12887.59Show/hide
Query:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA
        MY L G APP TF  S+PNSRPLI PFTN RPLHLRNS  RISLS+SFRK +N+PPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLVATGA
Subjt:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA

Query:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY
        SFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQ+LARDRLLGSYTVKPVLNRLKY
Subjt:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY

Query:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAKS-AEDDGGRVIPGVYSDESARSFEPDAFCGEPDL
        TLVSLAASLFVSIV+LIN+DGG+LLGPFFT KS A +DGGRV+PG+YSDESARSFEPDAFCG  +L
Subjt:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAKS-AEDDGGRVIPGVYSDESARSFEPDAFCGEPDL

A0A5D3D958 DUF1230 domain-containing protein2.7e-12887.59Show/hide
Query:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA
        MY L G APP TF  S+PNSRPLI PFTN RPLHLRNS  RISLS+SFRK +N+PPETGCPVPPEQ PINEYQTLSTSFPFSWA+GDIVEYCSRLVATGA
Subjt:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA

Query:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY
        SFAL IGLPVAWFGT+GVESDPLKR LCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQ+LARDRLLGSYTVKPVLNRLKY
Subjt:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY

Query:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAKS-AEDDGGRVIPGVYSDESARSFEPDAFCGEPDL
        TLVSLAASLFVSIV+LIN+DGG+LLGPFFT KS A +DGGRV+PG+YSDESARSFEPDAFCG  +L
Subjt:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAKS-AEDDGGRVIPGVYSDESARSFEPDAFCGEPDL

A0A6J1CQK2 uncharacterized protein ycf361.0e-13590.57Show/hide
Query:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
        MAGVMY +P  APPR FFPS+PN RP+ISPF NPRPLHLRNS  RISLS+SFRK NNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV
Subjt:  MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLV

Query:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN
        ATGASFAL +GLP+ WFGT+GVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQ+LARDRLLGSYTVKPVLN
Subjt:  ATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLN

Query:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCG
        RLKYTLV+LAASLFVSIV+LIN+DGGQLLGP FT KSAE DGGRVIPGVYSDESARSFEPDAFCG
Subjt:  RLKYTLVSLAASLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCG

A0A6J1F7C1 uncharacterized protein ycf36 isoform X24.3e-12686.19Show/hide
Query:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA
        MY LPG+  P TFFPS+PN RPLISPF NPRP HLRNS  RISLS+SFRK NNMP +TGCPVPPEQ PINEYQTLSTSFPFSWA GDIVEYCSRLVATGA
Subjt:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA

Query:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY
        SFAL IGLPVAWFGT+GVESDPL RSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTA++LARDRLLGSYTVKPVL+RLKY
Subjt:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY

Query:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAK-SAEDDGGRVIPGVYSDESARSFEPDAFC--GEPDL
        TLVSLAASLFVSI++LIN+DG QLLGPFFT K +A+D   RVIPGVYSDESARSFEPDAFC  GE DL
Subjt:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAK-SAEDDGGRVIPGVYSDESARSFEPDAFC--GEPDL

A0A6J1FDI1 uncharacterized protein ycf36 isoform X14.3e-12686.19Show/hide
Query:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA
        MY LPG+  P TFFPS+PN RPLISPF NPRP HLRNS  RISLS+SFRK NNMP +TGCPVPPEQ PINEYQTLSTSFPFSWA GDIVEYCSRLVATGA
Subjt:  MYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGA

Query:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY
        SFAL IGLPVAWFGT+GVESDPL RSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTA++LARDRLLGSYTVKPVL+RLKY
Subjt:  SFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKY

Query:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAK-SAEDDGGRVIPGVYSDESARSFEPDAFC--GEPDL
        TLVSLAASLFVSI++LIN+DG QLLGPFFT K +A+D   RVIPGVYSDESARSFEPDAFC  GE DL
Subjt:  TLVSLAASLFVSIVLLINLDGGQLLGPFFTAK-SAEDDGGRVIPGVYSDESARSFEPDAFC--GEPDL

SwissProt top hitse value%identityAlignment
O78501 Uncharacterized protein ycf365.7e-1936.25Show/hide
Query:  CPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVE--SDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS
        CPVP  Q PINEY  L++++ FSWA         ++      F L + L +  F  I +       +  L ++    LF+ +  +R YLG+ Y+  RLL 
Subjt:  CPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVE--SDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS

Query:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI
        + + YEE+ WYDGQ+WVK    L +DRL+  YTV P+L+RLK   +S   +    I LL+
Subjt:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI

P48276 Uncharacterized protein ycf364.7e-2944.83Show/hide
Query:  TGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS
        T CP+P EQQP+NEYQ L+ S  F+W S  +  Y   L  T  S   ++   + ++  + +   P+   +  +  G   + + ++R+YLGW+Y+  RLLS
Subjt:  TGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS

Query:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTL
        ATV YEE+GWYDGQIWVK++++L +DRL+G Y V+PVLNRLK TL
Subjt:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTL

P51273 Uncharacterized protein ycf364.8e-2642.86Show/hide
Query:  CPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSAT
        CPVP EQQP+NEY +L  S+ F W +     Y  ++  T  +   ++  PV       +   PLK          L     ++R+YLGW+YV  RL+SAT
Subjt:  CPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSAT

Query:  VEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLA
        V YEE+GWYDGQIWVK ++IL +DR +G Y V P+LN++K TL  L+
Subjt:  VEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLA

Q1XDL3 Uncharacterized protein ycf365.3e-2539.87Show/hide
Query:  TGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS
        T CPVP EQQP++EY +L  S+ F W +     Y  ++        L++  P+       +   PLK       +  L     ++R+YLGW+YV  RL+S
Subjt:  TGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLS

Query:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVL
        ATV YEE+GWYDGQIWVK ++IL +DR +G Y V P+LN++K TL  L+  +   ++L
Subjt:  ATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVL

Q9FN15 Protein CONSERVED IN THE GREEN LINEAGE AND DIATOMS 27, chloroplastic5.9e-3244.44Show/hide
Query:  PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRL
        P +   VP +Q+P+NEY +L     +SW      E+  RL         V+G+PVA   +     +PL+  L A +  +  V++ V+R+YLGW+YVG+RL
Subjt:  PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRL

Query:  LSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI
        LSA + YEE+GWYDGQ+WVK  ++LARDRLLGSY VKPV+  LK TL+   A L  + VL +
Subjt:  LSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI

Arabidopsis top hitse value%identityAlignment
AT5G11840.1 Protein of unknown function (DUF1230)1.6e-8061.3Show/hide
Query:  SPNSRPLISPFTNPRPLHLRNSFPRISLSYS---FRKENNMP------PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIG
        SP +  L  P  + R LH  N  P+  L  S   F  EN+ P      PET CPVPPEQQPINEYQ+LSTSFPFSWASGD++EY +RL  TGASFA  +G
Subjt:  SPNSRPLISPFTNPRPLHLRNSFPRISLSYS---FRKENNMP------PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIG

Query:  LPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAA
        LPV+WFG+IG E +P+KR L A SSGI  VT+AVVRMYLGWAYVGNRLLSATVEYEETGWYDGQ+WVKT ++LARDRLLGS++VKPVL RLK TLV L  
Subjt:  LPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAA

Query:  SLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCGEP--DLL
           +S++L+INL    +   + T +   D     IPG Y+DE+AR+FEP+AFCGEP  DLL
Subjt:  SLFVSIVLLINLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCGEP--DLL

AT5G67370.1 Protein of unknown function (DUF1230)4.2e-3344.44Show/hide
Query:  PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRL
        P +   VP +Q+P+NEY +L     +SW      E+  RL         V+G+PVA   +     +PL+  L A +  +  V++ V+R+YLGW+YVG+RL
Subjt:  PETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVIGLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRL

Query:  LSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI
        LSA + YEE+GWYDGQ+WVK  ++LARDRLLGSY VKPV+  LK TL+   A L  + VL +
Subjt:  LSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGTCATGTATACACTCCCGGGACAAGCTCCACCGCGGACTTTCTTCCCCTCCTCTCCCAATTCCCGGCCACTGATTTCGCCGTTCACAAACCCTAGACCCCT
CCACCTCCGGAATTCCTTTCCCAGAATATCCCTCTCGTACTCCTTCCGAAAAGAAAACAACATGCCACCAGAAACAGGCTGCCCTGTTCCGCCGGAACAGCAGCCGATTA
ACGAGTATCAGACACTCTCCACCTCATTCCCCTTCTCCTGGGCTTCAGGAGACATCGTTGAGTATTGTTCTCGATTGGTCGCCACCGGCGCTTCCTTCGCGCTCGTTATC
GGGCTTCCCGTTGCTTGGTTCGGCACCATCGGGGTGGAATCGGATCCCCTGAAACGATCTCTTTGCGCTGTTTCGAGTGGGATTTTGTTCGTCACGATTGCGGTTGTGAG
GATGTATCTCGGTTGGGCTTACGTCGGAAACCGCCTCCTCAGTGCGACTGTCGAATACGAAGAGACCGGGTGGTACGACGGCCAGATATGGGTAAAAACCGCCCAAATAT
TAGCCCGCGACCGCCTCCTCGGTTCCTACACTGTGAAGCCGGTGCTGAATAGGCTAAAATACACTCTCGTGAGCCTGGCGGCGTCTCTGTTTGTATCCATTGTTCTGCTC
ATCAACCTTGATGGAGGGCAACTGCTGGGGCCTTTCTTCACCGCCAAATCTGCAGAAGATGACGGCGGCAGAGTCATACCCGGCGTTTATAGCGACGAATCCGCCAGATC
TTTCGAGCCTGATGCGTTCTGTGGGGAACCTGATCTTCTTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGTCATGTATACACTCCCGGGACAAGCTCCACCGCGGACTTTCTTCCCCTCCTCTCCCAATTCCCGGCCACTGATTTCGCCGTTCACAAACCCTAGACCCCT
CCACCTCCGGAATTCCTTTCCCAGAATATCCCTCTCGTACTCCTTCCGAAAAGAAAACAACATGCCACCAGAAACAGGCTGCCCTGTTCCGCCGGAACAGCAGCCGATTA
ACGAGTATCAGACACTCTCCACCTCATTCCCCTTCTCCTGGGCTTCAGGAGACATCGTTGAGTATTGTTCTCGATTGGTCGCCACCGGCGCTTCCTTCGCGCTCGTTATC
GGGCTTCCCGTTGCTTGGTTCGGCACCATCGGGGTGGAATCGGATCCCCTGAAACGATCTCTTTGCGCTGTTTCGAGTGGGATTTTGTTCGTCACGATTGCGGTTGTGAG
GATGTATCTCGGTTGGGCTTACGTCGGAAACCGCCTCCTCAGTGCGACTGTCGAATACGAAGAGACCGGGTGGTACGACGGCCAGATATGGGTAAAAACCGCCCAAATAT
TAGCCCGCGACCGCCTCCTCGGTTCCTACACTGTGAAGCCGGTGCTGAATAGGCTAAAATACACTCTCGTGAGCCTGGCGGCGTCTCTGTTTGTATCCATTGTTCTGCTC
ATCAACCTTGATGGAGGGCAACTGCTGGGGCCTTTCTTCACCGCCAAATCTGCAGAAGATGACGGCGGCAGAGTCATACCCGGCGTTTATAGCGACGAATCCGCCAGATC
TTTCGAGCCTGATGCGTTCTGTGGGGAACCTGATCTTCTTCAATGA
Protein sequenceShow/hide protein sequence
MAGVMYTLPGQAPPRTFFPSSPNSRPLISPFTNPRPLHLRNSFPRISLSYSFRKENNMPPETGCPVPPEQQPINEYQTLSTSFPFSWASGDIVEYCSRLVATGASFALVI
GLPVAWFGTIGVESDPLKRSLCAVSSGILFVTIAVVRMYLGWAYVGNRLLSATVEYEETGWYDGQIWVKTAQILARDRLLGSYTVKPVLNRLKYTLVSLAASLFVSIVLL
INLDGGQLLGPFFTAKSAEDDGGRVIPGVYSDESARSFEPDAFCGEPDLLQ