; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026453 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026453
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description4Fe-4S ferredoxin-type domain-containing protein
Genome locationtig00153031:5540853..5553194
RNA-Seq ExpressionSgr026453
SyntenySgr026453
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016614 - oxidoreductase activity, acting on CH-OH group of donors (molecular function)
GO:0050660 - flavin adenine dinucleotide binding (molecular function)
InterPro domainsIPR000172 - Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 - Glucose-methanol-choline oxidoreductase, C-terminal
IPR017896 - 4Fe-4S ferredoxin-type, iron-sulphur binding domain
IPR017900 - 4Fe-4S ferredoxin, iron-sulphur binding, conserved site
IPR029058 - Alpha/Beta hydrolase fold
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025247.1 uncharacterized protein E6C27_scaffold541G00960 [Cucumis melo var. makuwa]0.0e+0069.71Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D M G  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSM +TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  +  ++    GGSLVNAGVM+PTPVL+RR+PNWPKEWERDW FCE+AA AMLKVQS+PIKFPSAKVL+EIVDEE  G FE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSI+FDLEESLSNS KIQQRG CLACGNC+AGCPYNAKSSTDKNYLLTAIQAGCVVHT CQVQYVVK+S NQEG TS+KRRWSVYLNE+DFI CDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGS APLN YGL REQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        +TTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLD  LPQK+NVFQRITKKLGG+LFI RYRSTSVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA
        VASDPSRGVCNA GQVFD + PASVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVSDILKY+ Q+GIELSAINDNKHS  KTN NRSQRSIVM 
Subjt:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA

Query:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK
        KETM+GYVGGMPCA+FLIMKMN E  KDF QSKESLGECHP LRGKVGGY EF  IEKDNLYIIDGEVNLCDTGCRTPFTQ+M Y LLLAASSG+RYILK
Subjt:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK

Query:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY
        GKKTLNPYLFGLYAW ETT L V IEK+ EN SM D  +L+GELSIS+LELLKSFLSL+G+++GQF+ LL++T +RTYILQIPR+T+KNSTP+G L+N  
Subjt:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY

Query:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI
          +SR EITTEDGI ISC KFS AQYPSRVR GK+ NPV+L+NGYSTESY+LPTEP DLARTLLGEGHD+WLLQSRLHPLNPSNDFTI DVGRFDIPAAI
Subjt:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI

Query:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------
        +KILEMDGSCRK            ++  + G +V+SS                                                               
Subjt:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------

Query:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---
                                     P+VH +  +E                       D     N     E  A  T           SP  +   
Subjt:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---

Query:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT
                PKFRHER+VV+G+GHSDLLIGEKSCKEVFPHI+SH+KLAE EGA+TG+A+KR SR EALSWSEDPHD    F T
Subjt:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT

XP_008462588.1 PREDICTED: uncharacterized protein LOC103500910 isoform X4 [Cucumis melo]0.0e+0069.97Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D M G  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSM +TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  +  ++    GGSLVNAGVM+PTPVL+RR+PNWPKEWERDW FCE+AA AMLKVQS+PIKFPSAKVL+EIVDEE  G FE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSINFDLEESLSNS KIQQRG CLACGNC+AGCPYNAKSSTDKNYLLTAIQAGCVVHT CQVQYVVK+S NQEG TSQKRRWSVYLNE+DFI CDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGS APLN YGL REQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        +TTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLD  LPQK+NVFQRITKKLGG+LFI RYRSTSVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA
        VASDPSRGVCNA GQVFD + PASVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVSDILKY+ Q+GIELSAINDNKHS  KTN NRSQRSIVM 
Subjt:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA

Query:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK
        KETM+GYVGGMPCA+FLIMKMN E  KDF QSKESLGECHPLLRGKVGGY EF  IEKDNLYIIDGEVNLCDTGCRTPFTQ+M Y LLLAASSG+RYILK
Subjt:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK

Query:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY
        GKKTLNPYLFGLYAW ETT L V IEK+ EN SM D  +L+GELSIS+LELLKSFLSL+G+++GQF+ LL++T +RTYILQIPR+T+KNSTP+G L+N  
Subjt:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY

Query:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI
          +SR EITTEDGI ISC KFS AQYPSRVR GK+ NPV+L+NGYSTESY+LPTEP DLARTLLGEGHD+WLLQSRLHPLNPSNDFTI DVGRFDIPAAI
Subjt:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI

Query:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------
        +KILEMDGSCRK            ++  + G +V+SS                                                               
Subjt:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------

Query:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---
                                     P+VH +  +E                       D     N     E  A  T           SP  +   
Subjt:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---

Query:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT
                PKFRHER+VV+G+GHSDLLIGEKSCKEVFPHI+SH+KLAE EGA+TG+A+KR SR EALSWSEDPHD    F T
Subjt:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT

XP_022132813.1 uncharacterized protein LOC111005575 isoform X1 [Momordica charantia]0.0e+0071.57Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D++CG GVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDF TDS K+TSAVR+ENRNLGLSFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  L  ++    GGSLVNAGVMLPTPV +RRNPNWPKEWE DWYFCEAAAAAMLKVQ  P KFPSAKVLEEI DEE  GSFE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SSVNLSINFDLEESLSNS+K+QQRG CLACGNCIAGCPYNAKSSTDKNYLLTA+QAGC VHTA QVQYVVK+  +QEG TS+K RWSVYLNE DF+TCDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VI+SAGVFGTTEILFRSQMRGLKVSEA+GCGFSCNGNAVAYLAGS APLNAYGLG+EQL KKAFHERPGPSISSSYT+SLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        ITTYGWPNGYWFFHGILD+LKQ+LSFKASQAIVLNAMG+DESDGKIMLQRDTDKMSFFPPLDP LPQKINVFQRITKKLGGILFIS YRS SVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM
        VASDPSRGVCNA GQVFDP K PASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAIND+KHSV KTNINR QR IVM
Subjt:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM

Query:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL
         KETMRGYVGGMPC VFL MKMNSEGQKD Y+SKESLGECHPLLRGKVGG+ EF AIEK+NLYIIDGEVNLCDT  RTPFTQ+MNYHLLLAASSGSRYIL
Subjt:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL

Query:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP
        KGKKTLNPYLFGLYAW ETT LHV +EK+ ENSSM D  +L+GELSIS+LE+LKSFLSL+GE+ GQF+ LL++TL+RTYILQIPR+  KNSTPLGCLKNP
Subjt:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP

Query:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA
        YE  SRYEI TEDGIIISC+KFS AQY SRV+G K+L PVLLVNGYS ESY LPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIED+GRFDIPAA
Subjt:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA

Query:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------
        I+KILE+DGSCRK                      +N+ +   +  +S                                                    
Subjt:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------

Query:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------
          P     T               W E   PS  +  Y++  T  P    P +                                               
Subjt:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------

Query:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT
                 P FRHER+VVDGFGHSDLLIGEKSCKEVFPHILSH+KLAEKEGA TGDA+KRYSR+ALSWSEDPHDG   F T
Subjt:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT

XP_022132814.1 uncharacterized protein LOC111005575 isoform X2 [Momordica charantia]0.0e+0071.49Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D++CG GVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDF TDS K+TSAVR+ENRNLGLSFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  L  ++    GGSLVNAGVMLPTPV +RRNPNWPKEWE DWYFCEAAAAAMLKVQ  P KFPSAKVLEEI DEE  GSFE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SSVNLSINFDLEESLSNS+K+QQRG CLACGNCIAGCPYNAKSSTDKNYLLTA+QAGC VHTA QVQYVVK+  +QEG TS+K RWSVYLNE DF+TCDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VI+SAGVFGTTEILFRSQMRGLKVSEA+GCGFSCNGNAVAYLAGS APLNAYGLG+EQL KKAFHERPGPSISSSYT+SLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        ITTYGWPNGYWFFHGILD+LKQ+LSFKASQAIVLNAMG+DESDGKIMLQRDTDKMSFFPPLDP LPQKINVFQRITKKLGGILFIS YRS SVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM
        VASDPSRGVCNA GQVFDP K PASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAIND+KHSV KTNINR QR IVM
Subjt:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM

Query:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL
         KETMRGYVGGMPC VFL MKMNSEGQKD Y+SKESLGECHPLLRGKVGG+ EF AIEK+NLYIIDGEVNLCDT  RTPFTQ+MNYHLLLAASSGSRYIL
Subjt:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL

Query:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP
        KGKKTLNPYLFGLYAW ETT LHV +EK+ ENSSM D  +L+GELSIS+LE+LKSFLSL+GE+ GQF+ LL++TL+RTYILQIPR+  KNSTPLGCLKNP
Subjt:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP

Query:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA
        YE  SRYEI T DGIIISC+KFS AQY SRV+G K+L PVLLVNGYS ESY LPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIED+GRFDIPAA
Subjt:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA

Query:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------
        I+KILE+DGSCRK                      +N+ +   +  +S                                                    
Subjt:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------

Query:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------
          P     T               W E   PS  +  Y++  T  P    P +                                               
Subjt:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------

Query:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT
                 P FRHER+VVDGFGHSDLLIGEKSCKEVFPHILSH+KLAEKEGA TGDA+KRYSR+ALSWSEDPHDG   F T
Subjt:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT

XP_038881939.1 uncharacterized protein LOC120073271 [Benincasa hispida]0.0e+0070.05Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D MCG  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMK+TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  L  ++    GGSLVNAGVMLPTPVL+R++PNWPKEWERDW FCEAAAAAMLKVQS+P+KFPSAKVLEEIVDEE  GSFE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSINFD+EESLS+S KIQQRG CLACGNC+AGCPYNAKSSTDKNYLL AIQAGCVVHT CQVQYVVK+S NQEG+TS++R+WSVYLNE+DFITCDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VILSAGVFGTTEILFRSQMRGLKVSE+LGCGFSCNGNAVAYLAGS APLNAYGL REQLWKK+FHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        ITTYGWPNGYWFFHGILDKLKQ+LSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLDP LPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA
        VASDPSRGVCNA GQVFDP  P SVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVS+ILKYK Q G++LSA NDNKHS+ KT INRSQ SIVM 
Subjt:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA

Query:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK
        KETM+GYVGGMPCA+FLIMKMNSEGQKDF QSK SLGECHPLLRGKVGGY EF AIEKDNLYIIDGEVNLCDTGCRTPFTQ+M YHLLLAASSGSRYILK
Subjt:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK

Query:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY
        GKKTLNPYLFGLYAW E T LHV +EK+ E SSM D  + +GELSIS+LELLKSFLSL+GE++GQF+ LL++T +RTYILQ PR+T+K+STP+G L+N Y
Subjt:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY

Query:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI
          +SR+EITTEDGI + CIKFS AQY SRV+ GK+ NPV+L+NGYSTESY+LPTEPTDL RTLLGEGHD+WLLQSRLHPLNPSNDFTI D+GRFDIPAAI
Subjt:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI

Query:  SKILEMDGSCRK----------------------ANSFINGQNVASSD----------------------------------------------------
        +KILEMDGSCRK                      +NS +   +  +S                                                     
Subjt:  SKILEMDGSCRK----------------------ANSFINGQNVASSD----------------------------------------------------

Query:  ---------------PNVHGYT-WQEQDPPSFGN-VKYQQETSAPKTDSPFVT-----------------------------------------------
                         + G T W E   PS  + +  +  T  P    P +                                                
Subjt:  ---------------PNVHGYT-WQEQDPPSFGN-VKYQQETSAPKTDSPFVT-----------------------------------------------

Query:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRY-SREALSWSEDPHDGISIFFT
                 KFRHER+VVDGFGHSDLLIGEKSCKEVFPHILSH+KLAEKEGA+TGDA+KRY S EALSWSEDPHDG   F T
Subjt:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRY-SREALSWSEDPHDGISIFFT

TrEMBL top hitse value%identityAlignment
A0A1S3CHC0 uncharacterized protein LOC103500910 isoform X40.0e+0069.97Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D M G  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSM +TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  +  ++    GGSLVNAGVM+PTPVL+RR+PNWPKEWERDW FCE+AA AMLKVQS+PIKFPSAKVL+EIVDEE  G FE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSINFDLEESLSNS KIQQRG CLACGNC+AGCPYNAKSSTDKNYLLTAIQAGCVVHT CQVQYVVK+S NQEG TSQKRRWSVYLNE+DFI CDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGS APLN YGL REQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        +TTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLD  LPQK+NVFQRITKKLGG+LFI RYRSTSVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA
        VASDPSRGVCNA GQVFD + PASVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVSDILKY+ Q+GIELSAINDNKHS  KTN NRSQRSIVM 
Subjt:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA

Query:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK
        KETM+GYVGGMPCA+FLIMKMN E  KDF QSKESLGECHPLLRGKVGGY EF  IEKDNLYIIDGEVNLCDTGCRTPFTQ+M Y LLLAASSG+RYILK
Subjt:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK

Query:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY
        GKKTLNPYLFGLYAW ETT L V IEK+ EN SM D  +L+GELSIS+LELLKSFLSL+G+++GQF+ LL++T +RTYILQIPR+T+KNSTP+G L+N  
Subjt:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY

Query:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI
          +SR EITTEDGI ISC KFS AQYPSRVR GK+ NPV+L+NGYSTESY+LPTEP DLARTLLGEGHD+WLLQSRLHPLNPSNDFTI DVGRFDIPAAI
Subjt:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI

Query:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------
        +KILEMDGSCRK            ++  + G +V+SS                                                               
Subjt:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------

Query:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---
                                     P+VH +  +E                       D     N     E  A  T           SP  +   
Subjt:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---

Query:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT
                PKFRHER+VV+G+GHSDLLIGEKSCKEVFPHI+SH+KLAE EGA+TG+A+KR SR EALSWSEDPHD    F T
Subjt:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT

A0A1S3CHT2 uncharacterized protein LOC103500910 isoform X10.0e+0068.63Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D M G  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSM +TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  +  ++    GGSLVNAGVM+PTPVL+RR+PNWPKEWERDW FCE+AA AMLKVQS+PIKFPSAKVL+EIVDEE  G FE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSINFDLEESLSNS KIQQRG CLACGNC+AGCPYNAKSSTDKNYLLTAIQAGCVVHT CQVQYVVK+S NQEG TSQKRRWSVYLNE+DFI CDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQ---------------
        VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGS APLN YGL REQLWKKAFHERPGPSISSSYTSSLGFTIQ               
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQ---------------

Query:  --------SAVLPSAYPNLLFKGITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITK
                SAVLPSAYPNLLFKG+TTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLD  LPQK+NVFQRITK
Subjt:  --------SAVLPSAYPNLLFKGITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITK

Query:  KLGGILFISRYRSTSVHHLGGCNVASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSA
        KLGG+LFI RYRSTSVHHLGGCNVASDPSRGVCNA GQVFD + PASVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVSDILKY+ Q+GIELSA
Subjt:  KLGGILFISRYRSTSVHHLGGCNVASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSA

Query:  INDNKHSVSKTNINRSQRSIVMAKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRT
        INDNKHS  KTN NRSQRSIVM KETM+GYVGGMPCA+FLIMKMN E  KDF QSKESLGECHPLLRGKVGGY EF  IEKDNLYIIDGEVNLCDTGCRT
Subjt:  INDNKHSVSKTNINRSQRSIVMAKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRT

Query:  PFTQFMNYHLLLAASSGSRYILKGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRT
        PFTQ+M Y LLLAASSG+RYILKGKKTLNPYLFGLYAW ETT L V IEK+ EN SM D  +L+GELSIS+LELLKSFLSL+G+++GQF+ LL++T +RT
Subjt:  PFTQFMNYHLLLAASSGSRYILKGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRT

Query:  YILQIPRMTHKNSTPLGCLKNPYECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRL
        YILQIPR+T+KNSTP+G L+N    +SR EITTEDGI ISC KFS AQYPSRVR GK+ NPV+L+NGYSTESY+LPTEP DLARTLLGEGHD+WLLQSRL
Subjt:  YILQIPRMTHKNSTPLGCLKNPYECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRL

Query:  HPLNPSNDFTIEDVGRFDIPAAISKILEMDGSCRK------------ANSFINGQNVASS----------------------------------------
        HPLNPSNDFTI DVGRFDIPAAI+KILEMDGSCRK            ++  + G +V+SS                                        
Subjt:  HPLNPSNDFTIEDVGRFDIPAAISKILEMDGSCRK------------ANSFINGQNVASS----------------------------------------

Query:  ---------------------------------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETS
                                                            P+VH +  +E                       D     N     E  
Subjt:  ---------------------------------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETS

Query:  APKT----------DSPFVT----------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGI
        A  T           SP  +           PKFRHER+VV+G+GHSDLLIGEKSCKEVFPHI+SH+KLAE EGA+TG+A+KR SR EALSWSEDPHD  
Subjt:  APKT----------DSPFVT----------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGI

Query:  SIFFT
          F T
Subjt:  SIFFT

A0A5A7SKF1 4Fe-4S ferredoxin-type domain-containing protein0.0e+0069.71Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D M G  V+NGFDAIV+GSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSM +TSAVR+ENRNLG+SFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  +  ++    GGSLVNAGVM+PTPVL+RR+PNWPKEWERDW FCE+AA AMLKVQS+PIKFPSAKVL+EIVDEE  G FE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SS+NLSI+FDLEESLSNS KIQQRG CLACGNC+AGCPYNAKSSTDKNYLLTAIQAGCVVHT CQVQYVVK+S NQEG TS+KRRWSVYLNE+DFI CDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGS APLN YGL REQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        +TTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMG+D+ DGKIMLQRDTDK+SFFPPLD  LPQK+NVFQRITKKLGG+LFI RYRSTSVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA
        VASDPSRGVCNA GQVFD + PASVHPGLYVCDASLIP SVGVNPSFTITIVSEHVSKHLVSDILKY+ Q+GIELSAINDNKHS  KTN NRSQRSIVM 
Subjt:  VASDPSRGVCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMA

Query:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK
        KETM+GYVGGMPCA+FLIMKMN E  KDF QSKESLGECHP LRGKVGGY EF  IEKDNLYIIDGEVNLCDTGCRTPFTQ+M Y LLLAASSG+RYILK
Subjt:  KETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILK

Query:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY
        GKKTLNPYLFGLYAW ETT L V IEK+ EN SM D  +L+GELSIS+LELLKSFLSL+G+++GQF+ LL++T +RTYILQIPR+T+KNSTP+G L+N  
Subjt:  GKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPY

Query:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI
          +SR EITTEDGI ISC KFS AQYPSRVR GK+ NPV+L+NGYSTESY+LPTEP DLARTLLGEGHD+WLLQSRLHPLNPSNDFTI DVGRFDIPAAI
Subjt:  ECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAI

Query:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------
        +KILEMDGSCRK            ++  + G +V+SS                                                               
Subjt:  SKILEMDGSCRK------------ANSFINGQNVASS---------------------------------------------------------------

Query:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---
                                     P+VH +  +E                       D     N     E  A  T           SP  +   
Subjt:  ----------------------------DPNVHGYTWQEQ----------------------DPPSFGNVKYQQETSAPKT----------DSPFVT---

Query:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT
                PKFRHER+VV+G+GHSDLLIGEKSCKEVFPHI+SH+KLAE EGA+TG+A+KR SR EALSWSEDPHD    F T
Subjt:  -------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSR-EALSWSEDPHDGISIFFT

A0A6J1BU62 uncharacterized protein LOC111005575 isoform X20.0e+0071.49Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D++CG GVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDF TDS K+TSAVR+ENRNLGLSFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  L  ++    GGSLVNAGVMLPTPV +RRNPNWPKEWE DWYFCEAAAAAMLKVQ  P KFPSAKVLEEI DEE  GSFE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SSVNLSINFDLEESLSNS+K+QQRG CLACGNCIAGCPYNAKSSTDKNYLLTA+QAGC VHTA QVQYVVK+  +QEG TS+K RWSVYLNE DF+TCDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VI+SAGVFGTTEILFRSQMRGLKVSEA+GCGFSCNGNAVAYLAGS APLNAYGLG+EQL KKAFHERPGPSISSSYT+SLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        ITTYGWPNGYWFFHGILD+LKQ+LSFKASQAIVLNAMG+DESDGKIMLQRDTDKMSFFPPLDP LPQKINVFQRITKKLGGILFIS YRS SVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM
        VASDPSRGVCNA GQVFDP K PASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAIND+KHSV KTNINR QR IVM
Subjt:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM

Query:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL
         KETMRGYVGGMPC VFL MKMNSEGQKD Y+SKESLGECHPLLRGKVGG+ EF AIEK+NLYIIDGEVNLCDT  RTPFTQ+MNYHLLLAASSGSRYIL
Subjt:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL

Query:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP
        KGKKTLNPYLFGLYAW ETT LHV +EK+ ENSSM D  +L+GELSIS+LE+LKSFLSL+GE+ GQF+ LL++TL+RTYILQIPR+  KNSTPLGCLKNP
Subjt:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP

Query:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA
        YE  SRYEI T DGIIISC+KFS AQY SRV+G K+L PVLLVNGYS ESY LPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIED+GRFDIPAA
Subjt:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA

Query:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------
        I+KILE+DGSCRK                      +N+ +   +  +S                                                    
Subjt:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------

Query:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------
          P     T               W E   PS  +  Y++  T  P    P +                                               
Subjt:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------

Query:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT
                 P FRHER+VVDGFGHSDLLIGEKSCKEVFPHILSH+KLAEKEGA TGDA+KRYSR+ALSWSEDPHDG   F T
Subjt:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT

A0A6J1BXD4 uncharacterized protein LOC111005575 isoform X10.0e+0071.57Show/hide
Query:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH
        ME+LKT D++CG GVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDF TDS K+TSAVR+ENRNLGLSFGPKDALFQ++          
Subjt:  MEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKLTSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIH

Query:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE
                         QN  L  ++    GGSLVNAGVMLPTPV +RRNPNWPKEWE DWYFCEAAAAAMLKVQ  P KFPSAKVLEEI DEE  GSFE
Subjt:  TYIDTGQSLSNRVLKPNQNSQLQDLSS--LGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSVPIKFPSAKVLEEIVDEESGGSFE

Query:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF
        SSVNLSINFDLEESLSNS+K+QQRG CLACGNCIAGCPYNAKSSTDKNYLLTA+QAGC VHTA QVQYVVK+  +QEG TS+K RWSVYLNE DF+TCDF
Subjt:  SSVNLSINFDLEESLSNSLKIQQRG-CLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSVYLNELDFITCDF

Query:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG
        VI+SAGVFGTTEILFRSQMRGLKVSEA+GCGFSCNGNAVAYLAGS APLNAYGLG+EQL KKAFHERPGPSISSSYT+SLGFTIQSAVLPSAYPNLLFKG
Subjt:  VILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLFKG

Query:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN
        ITTYGWPNGYWFFHGILD+LKQ+LSFKASQAIVLNAMG+DESDGKIMLQRDTDKMSFFPPLDP LPQKINVFQRITKKLGGILFIS YRS SVHHLGGCN
Subjt:  ITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCN

Query:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM
        VASDPSRGVCNA GQVFDP K PASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAIND+KHSV KTNINR QR IVM
Subjt:  VASDPSRGVCNAGGQVFDP-KGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVM

Query:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL
         KETMRGYVGGMPC VFL MKMNSEGQKD Y+SKESLGECHPLLRGKVGG+ EF AIEK+NLYIIDGEVNLCDT  RTPFTQ+MNYHLLLAASSGSRYIL
Subjt:  AKETMRGYVGGMPCAVFLIMKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYIL

Query:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP
        KGKKTLNPYLFGLYAW ETT LHV +EK+ ENSSM D  +L+GELSIS+LE+LKSFLSL+GE+ GQF+ LL++TL+RTYILQIPR+  KNSTPLGCLKNP
Subjt:  KGKKTLNPYLFGLYAWTETTKLHVTIEKIGENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNP

Query:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA
        YE  SRYEI TEDGIIISC+KFS AQY SRV+G K+L PVLLVNGYS ESY LPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIED+GRFDIPAA
Subjt:  YECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNPVLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAA

Query:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------
        I+KILE+DGSCRK                      +N+ +   +  +S                                                    
Subjt:  ISKILEMDGSCRK----------------------ANSFINGQNVASSD---------------------------------------------------

Query:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------
          P     T               W E   PS  +  Y++  T  P    P +                                               
Subjt:  --PNVHGYT---------------WQEQDPPSFGNVKYQQE-TSAPKTDSPFVT----------------------------------------------

Query:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT
                 P FRHER+VVDGFGHSDLLIGEKSCKEVFPHILSH+KLAEKEGA TGDA+KRYSR+ALSWSEDPHDG   F T
Subjt:  --------APKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFT

SwissProt top hitse value%identityAlignment
A0A0D3MU35 Protein ORANGE-GREEN, chloroplastic7.3e-9865.74Show/hide
Query:  RHRKSLNPPSRGTSRRSPLICSSSFNDAFGSVPSS-GDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDES
        + R+ L+   R  S  S    SSS +    SV S   D T ++FCIIEGPETVQDF +M+ QEIQ+NIRSRRNKIFL MEEVRRLRIQQRIK++E +  S
Subjt:  RHRKSLNPPSRGTSRRSPLICSSSFNDAFGSVPSS-GDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDES

Query:  DNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIE
          E   E+P+ PS IPFLP ++ + LK  Y T  S I+GII+FGGL+APTLELKLGLGGTSYEDFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E
Subjt:  DNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIE

Query:  ANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
         NNV+QQE KRCKYC GTGYLACARCS++G  +  +P+S      +PL +P T+RC NCSG+GKVMCPTCLCTGM MASEHDPR DPFD
Subjt:  ANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

A0A0D3MU50 Protein ORANGE-ORANGE, chloroplastic2.8e-9765.4Show/hide
Query:  RHRKSLNPPSRGTSRRSPLICSSSFNDAFGSVPSS-GDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDES
        + R+ L+   R  S  S    SSS +    SV S   D T ++FCIIEGPETVQDF +M+ QEIQ+NIRS RNKIFL MEEVRRLRIQQRIK++E +  S
Subjt:  RHRKSLNPPSRGTSRRSPLICSSSFNDAFGSVPSS-GDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDES

Query:  DNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIE
          E   E+P+ PS IPFLP ++ + LK  Y T  S I+GII+FGGL+APTLELKLGLGGTSYEDFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E
Subjt:  DNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIE

Query:  ANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
         NNV+QQE KRCKYC GTGYLACARCS++G  +  +P+S      +PL +P T+RC NCSG+GKVMCPTCLCTGM MASEHDPR DPFD
Subjt:  ANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

A2T1U1 Protein ORANGE, chloroplastic6.8e-9665.71Show/hide
Query:  SRGTSRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEM
        S G +RR     ++  +D+      S D   + FCIIEGPETVQDF +MQ QEIQDNIRSRRNKIFL MEEVRRLRIQQRI+++E  +IDE   E+ +E+
Subjt:  SRGTSRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEM

Query:  PDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQE
        P+ PS IPFLP +T   L+  Y T  S I+GII+FGGL+APTLELKLG+GGTSY+DFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E NNV+QQE
Subjt:  PDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQE

Query:  KKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPF
         KRCKYC GTGYLACARCSS+G  + ++P+S  A  +  +    T+RC NCSGAGKVMCPTCLCTGM MASEHDPR DPF
Subjt:  KKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPF

Q8VYD8 Protein ORANGE-LIKE, chloroplastic2.1e-11376.73Show/hide
Query:  SRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSS
        S RS L CS   N+     P SGD  P+NFCIIEG ETVQDFVQMQ QEIQDNIRSRRNKIFLLMEEVRRLR+QQRIKS + I+E    EA EMP+I SS
Subjt:  SRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSS

Query:  IPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKY
        IPFLP+VTPKTLKQLY TS++ ISGII FGGLIAP LELK+GLGGTSYEDFI ++HLP+QLSQVDPIVASFSGGAVGVIS LMLIE NNV+QQEKKRCKY
Subjt:  IPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKY

Query:  CHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        C GTGYL CARCS+SGVCLS DPI+    +++ ++V TT+RCLNCSGAGKVMCPTCLCTGM+ ASEHDPRFDPFD
Subjt:  CHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

Q9FKF4 Protein ORANGE, chloroplastic3.3e-9871.98Show/hide
Query:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT
        SS D   S FCIIEGPETVQDF +MQ QEIQDNIRSRRNKIFL MEEVRRLRIQQRIK++E  +I+E   E+ +E+P+ PS IPFLP +T   LK  Y T
Subjt:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT

Query:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC
          S I+GII+FGGL+APTLELKLG+GGTSY DFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E NNV+QQE KRCKYC GTGYLACARCSS+G  
Subjt:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC

Query:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        +  +P+S  A  +  L  P T+RC NCSGAGKVMCPTCLCTGM MASEHDPR DPFD
Subjt:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

Arabidopsis top hitse value%identityAlignment
AT2G36145.1 unknown protein1.8e-4373.77Show/hide
Query:  DPEDGVSLGTMKLPSDVDIARFEVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGD-GKTEVLVYIDCLVFPATTSTSPIFRAIRNGRLKDQS
        + EDGVSLGTMKLP D D+ARFE LLFQWANSLCQGANLPLPVPLKVD+I  G RLGFI + D GKT+V VYIDCLVF  TT    +F+A RNGR KD++
Subjt:  DPEDGVSLGTMKLPSDVDIARFEVLLFQWANSLCQGANLPLPVPLKVDKIPSGVRLGFITIGD-GKTEVLVYIDCLVFPATTSTSPIFRAIRNGRLKDQS

Query:  PPGEPRIMRSLLSALKKSVEIS
        PPGE RIMRSLL ALKK+VEI+
Subjt:  PPGEPRIMRSLLSALKKSVEIS

AT5G06130.1 chaperone protein dnaJ-related5.5e-10179.65Show/hide
Query:  MQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLG
        MQ QEIQDNIRSRRNKIFLLMEEVRRLR+QQRIKS + I+E    EA EMP+I SSIPFLP+VTPKTLKQLY TS++ ISGII FGGLIAP LELK+GLG
Subjt:  MQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLG

Query:  GTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLN
        GTSYEDFI ++HLP+QLSQVDPIVASFSGGAVGVIS LMLIE NNV+QQEKKRCKYC GTGYL CARCS+SGVCLS DPI+    +++ ++V TT+RCLN
Subjt:  GTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLN

Query:  CSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        CSGAGKVMCPTCLCTGM+ ASEHDPRFDPFD
Subjt:  CSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

AT5G06130.2 chaperone protein dnaJ-related1.5e-11476.73Show/hide
Query:  SRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSS
        S RS L CS   N+     P SGD  P+NFCIIEG ETVQDFVQMQ QEIQDNIRSRRNKIFLLMEEVRRLR+QQRIKS + I+E    EA EMP+I SS
Subjt:  SRRSPLICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSS

Query:  IPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKY
        IPFLP+VTPKTLKQLY TS++ ISGII FGGLIAP LELK+GLGGTSYEDFI ++HLP+QLSQVDPIVASFSGGAVGVIS LMLIE NNV+QQEKKRCKY
Subjt:  IPFLPHVTPKTLKQLYFTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKY

Query:  CHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        C GTGYL CARCS+SGVCLS DPI+    +++ ++V TT+RCLNCSGAGKVMCPTCLCTGM+ ASEHDPRFDPFD
Subjt:  CHGTGYLACARCSSSGVCLSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

AT5G61670.1 Encodes a close homolog of the Cauliflower OR (Orange) protein. The function of OR is to induce the differentiation of proplastids or other noncolored plastids into chromoplasts for carotenoid accumulation. Both proteins contain a Cysteine-rich zinc finger domain that is highly specific to DnaJ-like molecular chaperons.2.3e-9971.98Show/hide
Query:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT
        SS D   S FCIIEGPETVQDF +MQ QEIQDNIRSRRNKIFL MEEVRRLRIQQRIK++E  +I+E   E+ +E+P+ PS IPFLP +T   LK  Y T
Subjt:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT

Query:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC
          S I+GII+FGGL+APTLELKLG+GGTSY DFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E NNV+QQE KRCKYC GTGYLACARCSS+G  
Subjt:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC

Query:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        +  +P+S  A  +  L  P T+RC NCSGAGKVMCPTCLCTGM MASEHDPR DPFD
Subjt:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD

AT5G61670.2 Encodes a close homolog of the Cauliflower OR (Orange) protein. The function of OR is to induce the differentiation of proplastids or other noncolored plastids into chromoplasts for carotenoid accumulation. Both proteins contain a Cysteine-rich zinc finger domain that is highly specific to DnaJ-like molecular chaperons.2.3e-9971.98Show/hide
Query:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT
        SS D   S FCIIEGPETVQDF +MQ QEIQDNIRSRRNKIFL MEEVRRLRIQQRIK++E  +I+E   E+ +E+P+ PS IPFLP +T   LK  Y T
Subjt:  SSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSE--VIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLYFT

Query:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC
          S I+GII+FGGL+APTLELKLG+GGTSY DFI ++HLPMQLSQVDPIVASFSGGAVGVISALM++E NNV+QQE KRCKYC GTGYLACARCSS+G  
Subjt:  SLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVC

Query:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD
        +  +P+S  A  +  L  P T+RC NCSGAGKVMCPTCLCTGM MASEHDPR DPFD
Subjt:  LSADPISLSATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCCACTACTTATACAACAAAGCAATGCATTCGTCACACAAATGTGAATTCTCGCCCATGTCTTTTGTCTACAATGGTGTCTCATCCGCCACTGGTATTCGTCGG
TTGGAACACTCGACGTCGGCAAATCTCCAAGCTCTCCGAAGTATCACTTCCATGGCACCGGGATCCGAAGCTGGCGCCCAAGTCTTCGGATGAGAACAAAGATATTTTGC
CCTCGGAAGATGACCCTGAAGATGGAGTCTCGCTTGGGACCATGAAATTGCCTTCGGACGTTGACATTGCGAGATTCGAGGTCTTACTCTTCCAGTGGGCCAACAGTCTT
TGCCAGGGAGCTAACCTGCCGCTTCCAGTGCCTTTGAAGGTTGACAAAATACCAAGTGGAGTTAGACTTGGTTTTATCACAATTGGAGATGGAAAGACAGAAGTTCTCGT
GTATATAGATTGCTTAGTTTTCCCTGCTACTACCAGCACTAGTCCAATTTTTCGTGCCATAAGGAATGGACGCTTAAAGGATCAGTCACCTCCTGGTGAACCGAGAATTA
TGAGGAGTCTTTTGAGTGCTTTGAAAAAATCTGTTGAAATTTCTACATCTGATTCAGAAAACTATGCTAACGATTCTGCTAAATCAAACAAACAAAAGAATATGGCCTGC
CCAATGCTGGCCTTTTGGATGGCTTCTACTGATTTGGAAATTAGAATGCACTGCATTTTCTTTCCAGTTTTGGACTATCCACAAATTGTTAATGGAGACATTGGAAATAT
GCTGTCAAGAAGCTCAAATCAAGAGATGGAGAAACTGAAAACTGTAGACGAAATGTGTGGTGCTGGTGTAGATAATGGTTTCGATGCCATTGTTGTGGGGTCAGGATATG
GTGGTTCTGTTGCTGCATGTCGGATGTCTATGGCAGGAATAAAAGTATGTTTACTTGAGAAAGGCCGTAAATGGGAATCTCAGGATTTTGTTACTGACAGCATGAAATTA
ACTTCAGCTGTGAGGGTGGAAAATCGCAATTTAGGCCTAAGCTTTGGTCCAAAAGATGCATTATTCCAGATATATTTGAAGAGTGGCTTTGAAGTTATCATACATACATA
TATAGATACAGGACAATCACTATCTAACAGAGTGCTTAAACCAAATCAAAATTCACAGCTTCAGGATCTAAGTAGTTTGGGAGGTTCACTAGTGAATGCTGGAGTGATGC
TTCCAACTCCAGTTCTCATTAGAAGGAATCCAAACTGGCCAAAAGAATGGGAGAGGGATTGGTATTTCTGTGAAGCAGCTGCTGCGGCCATGTTGAAGGTACAAAGTGTT
CCCATCAAGTTTCCTTCTGCCAAAGTTTTAGAAGAAATTGTTGACGAAGAGAGTGGAGGGAGTTTTGAGTCTTCGGTGAATCTTAGCATTAACTTCGATCTTGAGGAATC
ACTGTCTAATTCGTTGAAGATCCAACAGAGGGGCTGCTTAGCTTGTGGAAATTGCATTGCTGGATGCCCTTATAATGCGAAGAGTTCAACAGACAAAAATTATTTACTGA
CAGCCATCCAGGCAGGATGTGTTGTTCATACTGCATGTCAAGTTCAATATGTCGTTAAAAGTTCATATAACCAAGAAGGCGAAACCTCCCAAAAAAGAAGATGGTCTGTT
TACTTGAATGAGCTTGACTTTATAACCTGTGATTTTGTAATCCTCTCAGCTGGAGTTTTTGGTACAACTGAGATACTCTTTCGGTCTCAAATGAGAGGACTAAAAGTTTC
TGAAGCACTTGGCTGTGGATTTAGCTGTAATGGAAATGCTGTGGCCTATCTTGCTGGGAGTCGAGCACCCTTGAATGCTTATGGGTTAGGTAGAGAGCAGCTTTGGAAGA
AAGCTTTTCATGAACGGCCAGGACCATCTATCTCTTCTTCTTACACTTCTTCATTGGGATTCACAATTCAGAGTGCTGTACTTCCTTCTGCTTATCCTAACCTGCTTTTT
AAAGGGATTACAACTTATGGATGGCCCAATGGCTACTGGTTCTTTCATGGGATTTTAGATAAGTTGAAACAAGTTCTAAGCTTCAAAGCAAGCCAAGCAATTGTTCTGAA
CGCAATGGGTCATGACGAGAGTGATGGGAAAATTATGTTGCAAAGGGACACAGATAAAATGTCTTTTTTTCCACCACTTGATCCTTTTCTTCCACAAAAAATAAATGTAT
TTCAAAGAATCACAAAAAAATTAGGTGGGATCCTTTTCATTTCAAGGTATCGAAGCACATCAGTTCACCATTTAGGTGGGTGCAATGTGGCATCTGATCCTTCTCGTGGT
GTTTGCAATGCCGGTGGTCAGGTTTTTGATCCCAAGGGTCCTGCTTCTGTGCATCCAGGCCTCTATGTTTGTGATGCTTCATTAATTCCATGTTCTGTTGGTGTAAATCC
ATCTTTCACAATCACAATTGTTTCTGAACATGTAAGCAAGCATCTTGTGAGCGATATTCTCAAGTACAAGAGCCAACAGGGCATTGAACTTTCTGCTATCAATGATAATA
AGCATTCTGTCTCCAAAACAAATATAAATAGATCCCAGAGATCAATAGTCATGGCTAAAGAAACCATGAGGGGTTATGTGGGAGGAATGCCTTGTGCTGTTTTTCTCATT
ATGAAGATGAACTCCGAGGGTCAGAAAGATTTCTATCAATCAAAAGAAAGTCTTGGAGAATGTCATCCACTTCTTAGAGGAAAAGTTGGTGGATATGCGGAATTTCCGGC
CATTGAGAAGGATAATTTATATATTATCGATGGGGAAGTAAATTTGTGTGATACAGGTTGTCGAACTCCCTTCACTCAGTTTATGAATTATCATCTTCTCCTTGCGGCTT
CTTCTGGATCAAGATATATTCTGAAGGGGAAAAAGACCTTGAATCCTTATCTCTTTGGTCTATATGCTTGGACAGAGACAACAAAACTGCATGTGACAATTGAAAAAATT
GGTGAAAACAGTTCAATGAAGGACACAGTCGTTTTAAAAGGGGAACTCAGCATCTCAATGTTGGAACTTCTCAAGAGTTTCTTAAGCCTTGAGGGAGAAAGGAAAGGACA
GTTTCTTTGTCTTCTAATAAGGACTCTTCTGAGAACCTATATTTTACAGATACCACGGATGACTCACAAAAACTCAACACCGTTGGGGTGTTTAAAAAACCCCTACGAAT
GCAGTTCTCGTTATGAAATCACAACAGAAGATGGAATTATCATCAGTTGCATAAAATTCAGCTCTGCCCAATATCCATCTAGGGTTCGAGGAGGAAAAGAACTAAATCCA
GTTCTCCTGGTTAATGGCTATTCTACAGAGAGTTACTTTCTGCCAACAGAACCCACTGATTTAGCTAGAACTTTACTTGGAGAAGGGCATGATATTTGGCTATTGCAATC
AAGATTACACCCTCTGAATCCTTCTAATGACTTCACAATAGAAGATGTTGGTAGATTTGACATCCCTGCTGCAATCAGCAAGATCCTAGAAATGGATGGATCATGCAGAA
AGGCTAACTCTTTCATCAATGGTCAAAATGTGGCTTCCTCTGATCCCAATGTCCATGGCTATACTTGGCAAGAACAAGATCCTCCCTCTTTTGGGAACGTCAAATATCAG
CAGGAGACATCGGCTCCTAAAACTGATAGCCCGTTTGTTACCGCGCCAAAATTCAGACATGAAAGGTTGGTTGTGGATGGTTTTGGGCATTCTGATCTATTGATTGGAGA
AAAGTCTTGTAAGGAAGTATTTCCCCACATTCTGTCACATATGAAACTAGCTGAAAAAGAAGGTGCAATGACTGGTGATGCAAGGAAGAGATACAGTAGGGAGGCATTGT
CTTGGAGTGAAGATCCACATGATGGAATCTCAATTTTCTTCACCCTCGATGATCGCCACCGCAAATCCCTCAATCCACCCTCTAGAGGCACTTCCCGACGATCGCCGCTC
ATCTGTTCTTCCTCGTTCAATGATGCGTTTGGTTCAGTTCCCTCCAGTGGTGATAACACTCCCAGTAACTTCTGCATCATAGAAGGACCGGAGACCGTTCAGGATTTTGT
TCAGATGCAATTCCAGGAAATCCAAGACAACATAAGGAGTCGGCGTAATAAAATTTTTCTCCTAATGGAAGAGGTTAGAAGATTACGAATACAACAACGCATAAAGAGTT
CAGAAGTTATTGATGAGAGTGATAATGAAGAGGCATATGAGATGCCTGACATTCCATCATCTATTCCTTTTCTTCCCCATGTGACACCAAAGACGTTGAAGCAGCTATAT
TTCACCAGCTTGTCGTTTATATCGGGAATAATTGTATTTGGTGGCCTCATTGCCCCAACTCTGGAGCTAAAATTGGGATTAGGTGGCACTTCATATGAAGATTTCATCCT
CAACATGCATTTGCCAATGCAATTAAGTCAAGTGGACCCCATTGTGGCGTCCTTTTCAGGTGGAGCGGTAGGTGTCATTTCGGCTTTGATGTTAATTGAAGCTAACAATG
TTGAGCAACAAGAGAAGAAAAGGTGTAAATATTGTCATGGAACTGGATATTTGGCCTGTGCTCGATGTTCTTCAAGTGGCGTATGCTTAAGTGCTGATCCCATTTCACTA
TCTGCCACTTCTAGCCGCCCTCTACGAGTGCCCACAACTCAAAGATGCCTCAATTGCTCCGGTGCAGGAAAGGTAATGTGCCCAACATGTTTGTGTACTGGGATGTTAAT
GGCAAGTGAGCACGACCCACGGTTCGACCCATTCGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCCACTACTTATACAACAAAGCAATGCATTCGTCACACAAATGTGAATTCTCGCCCATGTCTTTTGTCTACAATGGTGTCTCATCCGCCACTGGTATTCGTCGG
TTGGAACACTCGACGTCGGCAAATCTCCAAGCTCTCCGAAGTATCACTTCCATGGCACCGGGATCCGAAGCTGGCGCCCAAGTCTTCGGATGAGAACAAAGATATTTTGC
CCTCGGAAGATGACCCTGAAGATGGAGTCTCGCTTGGGACCATGAAATTGCCTTCGGACGTTGACATTGCGAGATTCGAGGTCTTACTCTTCCAGTGGGCCAACAGTCTT
TGCCAGGGAGCTAACCTGCCGCTTCCAGTGCCTTTGAAGGTTGACAAAATACCAAGTGGAGTTAGACTTGGTTTTATCACAATTGGAGATGGAAAGACAGAAGTTCTCGT
GTATATAGATTGCTTAGTTTTCCCTGCTACTACCAGCACTAGTCCAATTTTTCGTGCCATAAGGAATGGACGCTTAAAGGATCAGTCACCTCCTGGTGAACCGAGAATTA
TGAGGAGTCTTTTGAGTGCTTTGAAAAAATCTGTTGAAATTTCTACATCTGATTCAGAAAACTATGCTAACGATTCTGCTAAATCAAACAAACAAAAGAATATGGCCTGC
CCAATGCTGGCCTTTTGGATGGCTTCTACTGATTTGGAAATTAGAATGCACTGCATTTTCTTTCCAGTTTTGGACTATCCACAAATTGTTAATGGAGACATTGGAAATAT
GCTGTCAAGAAGCTCAAATCAAGAGATGGAGAAACTGAAAACTGTAGACGAAATGTGTGGTGCTGGTGTAGATAATGGTTTCGATGCCATTGTTGTGGGGTCAGGATATG
GTGGTTCTGTTGCTGCATGTCGGATGTCTATGGCAGGAATAAAAGTATGTTTACTTGAGAAAGGCCGTAAATGGGAATCTCAGGATTTTGTTACTGACAGCATGAAATTA
ACTTCAGCTGTGAGGGTGGAAAATCGCAATTTAGGCCTAAGCTTTGGTCCAAAAGATGCATTATTCCAGATATATTTGAAGAGTGGCTTTGAAGTTATCATACATACATA
TATAGATACAGGACAATCACTATCTAACAGAGTGCTTAAACCAAATCAAAATTCACAGCTTCAGGATCTAAGTAGTTTGGGAGGTTCACTAGTGAATGCTGGAGTGATGC
TTCCAACTCCAGTTCTCATTAGAAGGAATCCAAACTGGCCAAAAGAATGGGAGAGGGATTGGTATTTCTGTGAAGCAGCTGCTGCGGCCATGTTGAAGGTACAAAGTGTT
CCCATCAAGTTTCCTTCTGCCAAAGTTTTAGAAGAAATTGTTGACGAAGAGAGTGGAGGGAGTTTTGAGTCTTCGGTGAATCTTAGCATTAACTTCGATCTTGAGGAATC
ACTGTCTAATTCGTTGAAGATCCAACAGAGGGGCTGCTTAGCTTGTGGAAATTGCATTGCTGGATGCCCTTATAATGCGAAGAGTTCAACAGACAAAAATTATTTACTGA
CAGCCATCCAGGCAGGATGTGTTGTTCATACTGCATGTCAAGTTCAATATGTCGTTAAAAGTTCATATAACCAAGAAGGCGAAACCTCCCAAAAAAGAAGATGGTCTGTT
TACTTGAATGAGCTTGACTTTATAACCTGTGATTTTGTAATCCTCTCAGCTGGAGTTTTTGGTACAACTGAGATACTCTTTCGGTCTCAAATGAGAGGACTAAAAGTTTC
TGAAGCACTTGGCTGTGGATTTAGCTGTAATGGAAATGCTGTGGCCTATCTTGCTGGGAGTCGAGCACCCTTGAATGCTTATGGGTTAGGTAGAGAGCAGCTTTGGAAGA
AAGCTTTTCATGAACGGCCAGGACCATCTATCTCTTCTTCTTACACTTCTTCATTGGGATTCACAATTCAGAGTGCTGTACTTCCTTCTGCTTATCCTAACCTGCTTTTT
AAAGGGATTACAACTTATGGATGGCCCAATGGCTACTGGTTCTTTCATGGGATTTTAGATAAGTTGAAACAAGTTCTAAGCTTCAAAGCAAGCCAAGCAATTGTTCTGAA
CGCAATGGGTCATGACGAGAGTGATGGGAAAATTATGTTGCAAAGGGACACAGATAAAATGTCTTTTTTTCCACCACTTGATCCTTTTCTTCCACAAAAAATAAATGTAT
TTCAAAGAATCACAAAAAAATTAGGTGGGATCCTTTTCATTTCAAGGTATCGAAGCACATCAGTTCACCATTTAGGTGGGTGCAATGTGGCATCTGATCCTTCTCGTGGT
GTTTGCAATGCCGGTGGTCAGGTTTTTGATCCCAAGGGTCCTGCTTCTGTGCATCCAGGCCTCTATGTTTGTGATGCTTCATTAATTCCATGTTCTGTTGGTGTAAATCC
ATCTTTCACAATCACAATTGTTTCTGAACATGTAAGCAAGCATCTTGTGAGCGATATTCTCAAGTACAAGAGCCAACAGGGCATTGAACTTTCTGCTATCAATGATAATA
AGCATTCTGTCTCCAAAACAAATATAAATAGATCCCAGAGATCAATAGTCATGGCTAAAGAAACCATGAGGGGTTATGTGGGAGGAATGCCTTGTGCTGTTTTTCTCATT
ATGAAGATGAACTCCGAGGGTCAGAAAGATTTCTATCAATCAAAAGAAAGTCTTGGAGAATGTCATCCACTTCTTAGAGGAAAAGTTGGTGGATATGCGGAATTTCCGGC
CATTGAGAAGGATAATTTATATATTATCGATGGGGAAGTAAATTTGTGTGATACAGGTTGTCGAACTCCCTTCACTCAGTTTATGAATTATCATCTTCTCCTTGCGGCTT
CTTCTGGATCAAGATATATTCTGAAGGGGAAAAAGACCTTGAATCCTTATCTCTTTGGTCTATATGCTTGGACAGAGACAACAAAACTGCATGTGACAATTGAAAAAATT
GGTGAAAACAGTTCAATGAAGGACACAGTCGTTTTAAAAGGGGAACTCAGCATCTCAATGTTGGAACTTCTCAAGAGTTTCTTAAGCCTTGAGGGAGAAAGGAAAGGACA
GTTTCTTTGTCTTCTAATAAGGACTCTTCTGAGAACCTATATTTTACAGATACCACGGATGACTCACAAAAACTCAACACCGTTGGGGTGTTTAAAAAACCCCTACGAAT
GCAGTTCTCGTTATGAAATCACAACAGAAGATGGAATTATCATCAGTTGCATAAAATTCAGCTCTGCCCAATATCCATCTAGGGTTCGAGGAGGAAAAGAACTAAATCCA
GTTCTCCTGGTTAATGGCTATTCTACAGAGAGTTACTTTCTGCCAACAGAACCCACTGATTTAGCTAGAACTTTACTTGGAGAAGGGCATGATATTTGGCTATTGCAATC
AAGATTACACCCTCTGAATCCTTCTAATGACTTCACAATAGAAGATGTTGGTAGATTTGACATCCCTGCTGCAATCAGCAAGATCCTAGAAATGGATGGATCATGCAGAA
AGGCTAACTCTTTCATCAATGGTCAAAATGTGGCTTCCTCTGATCCCAATGTCCATGGCTATACTTGGCAAGAACAAGATCCTCCCTCTTTTGGGAACGTCAAATATCAG
CAGGAGACATCGGCTCCTAAAACTGATAGCCCGTTTGTTACCGCGCCAAAATTCAGACATGAAAGGTTGGTTGTGGATGGTTTTGGGCATTCTGATCTATTGATTGGAGA
AAAGTCTTGTAAGGAAGTATTTCCCCACATTCTGTCACATATGAAACTAGCTGAAAAAGAAGGTGCAATGACTGGTGATGCAAGGAAGAGATACAGTAGGGAGGCATTGT
CTTGGAGTGAAGATCCACATGATGGAATCTCAATTTTCTTCACCCTCGATGATCGCCACCGCAAATCCCTCAATCCACCCTCTAGAGGCACTTCCCGACGATCGCCGCTC
ATCTGTTCTTCCTCGTTCAATGATGCGTTTGGTTCAGTTCCCTCCAGTGGTGATAACACTCCCAGTAACTTCTGCATCATAGAAGGACCGGAGACCGTTCAGGATTTTGT
TCAGATGCAATTCCAGGAAATCCAAGACAACATAAGGAGTCGGCGTAATAAAATTTTTCTCCTAATGGAAGAGGTTAGAAGATTACGAATACAACAACGCATAAAGAGTT
CAGAAGTTATTGATGAGAGTGATAATGAAGAGGCATATGAGATGCCTGACATTCCATCATCTATTCCTTTTCTTCCCCATGTGACACCAAAGACGTTGAAGCAGCTATAT
TTCACCAGCTTGTCGTTTATATCGGGAATAATTGTATTTGGTGGCCTCATTGCCCCAACTCTGGAGCTAAAATTGGGATTAGGTGGCACTTCATATGAAGATTTCATCCT
CAACATGCATTTGCCAATGCAATTAAGTCAAGTGGACCCCATTGTGGCGTCCTTTTCAGGTGGAGCGGTAGGTGTCATTTCGGCTTTGATGTTAATTGAAGCTAACAATG
TTGAGCAACAAGAGAAGAAAAGGTGTAAATATTGTCATGGAACTGGATATTTGGCCTGTGCTCGATGTTCTTCAAGTGGCGTATGCTTAAGTGCTGATCCCATTTCACTA
TCTGCCACTTCTAGCCGCCCTCTACGAGTGCCCACAACTCAAAGATGCCTCAATTGCTCCGGTGCAGGAAAGGTAATGTGCCCAACATGTTTGTGTACTGGGATGTTAAT
GGCAAGTGAGCACGACCCACGGTTCGACCCATTCGACTGA
Protein sequenceShow/hide protein sequence
MDATTYTTKQCIRHTNVNSRPCLLSTMVSHPPLVFVGWNTRRRQISKLSEVSLPWHRDPKLAPKSSDENKDILPSEDDPEDGVSLGTMKLPSDVDIARFEVLLFQWANSL
CQGANLPLPVPLKVDKIPSGVRLGFITIGDGKTEVLVYIDCLVFPATTSTSPIFRAIRNGRLKDQSPPGEPRIMRSLLSALKKSVEISTSDSENYANDSAKSNKQKNMAC
PMLAFWMASTDLEIRMHCIFFPVLDYPQIVNGDIGNMLSRSSNQEMEKLKTVDEMCGAGVDNGFDAIVVGSGYGGSVAACRMSMAGIKVCLLEKGRKWESQDFVTDSMKL
TSAVRVENRNLGLSFGPKDALFQIYLKSGFEVIIHTYIDTGQSLSNRVLKPNQNSQLQDLSSLGGSLVNAGVMLPTPVLIRRNPNWPKEWERDWYFCEAAAAAMLKVQSV
PIKFPSAKVLEEIVDEESGGSFESSVNLSINFDLEESLSNSLKIQQRGCLACGNCIAGCPYNAKSSTDKNYLLTAIQAGCVVHTACQVQYVVKSSYNQEGETSQKRRWSV
YLNELDFITCDFVILSAGVFGTTEILFRSQMRGLKVSEALGCGFSCNGNAVAYLAGSRAPLNAYGLGREQLWKKAFHERPGPSISSSYTSSLGFTIQSAVLPSAYPNLLF
KGITTYGWPNGYWFFHGILDKLKQVLSFKASQAIVLNAMGHDESDGKIMLQRDTDKMSFFPPLDPFLPQKINVFQRITKKLGGILFISRYRSTSVHHLGGCNVASDPSRG
VCNAGGQVFDPKGPASVHPGLYVCDASLIPCSVGVNPSFTITIVSEHVSKHLVSDILKYKSQQGIELSAINDNKHSVSKTNINRSQRSIVMAKETMRGYVGGMPCAVFLI
MKMNSEGQKDFYQSKESLGECHPLLRGKVGGYAEFPAIEKDNLYIIDGEVNLCDTGCRTPFTQFMNYHLLLAASSGSRYILKGKKTLNPYLFGLYAWTETTKLHVTIEKI
GENSSMKDTVVLKGELSISMLELLKSFLSLEGERKGQFLCLLIRTLLRTYILQIPRMTHKNSTPLGCLKNPYECSSRYEITTEDGIIISCIKFSSAQYPSRVRGGKELNP
VLLVNGYSTESYFLPTEPTDLARTLLGEGHDIWLLQSRLHPLNPSNDFTIEDVGRFDIPAAISKILEMDGSCRKANSFINGQNVASSDPNVHGYTWQEQDPPSFGNVKYQ
QETSAPKTDSPFVTAPKFRHERLVVDGFGHSDLLIGEKSCKEVFPHILSHMKLAEKEGAMTGDARKRYSREALSWSEDPHDGISIFFTLDDRHRKSLNPPSRGTSRRSPL
ICSSSFNDAFGSVPSSGDNTPSNFCIIEGPETVQDFVQMQFQEIQDNIRSRRNKIFLLMEEVRRLRIQQRIKSSEVIDESDNEEAYEMPDIPSSIPFLPHVTPKTLKQLY
FTSLSFISGIIVFGGLIAPTLELKLGLGGTSYEDFILNMHLPMQLSQVDPIVASFSGGAVGVISALMLIEANNVEQQEKKRCKYCHGTGYLACARCSSSGVCLSADPISL
SATSSRPLRVPTTQRCLNCSGAGKVMCPTCLCTGMLMASEHDPRFDPFD