; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G16320 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G16320
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationClcChr03:28706049..28722702
RNA-Seq ExpressionClc03G16320
SyntenyClc03G16320
Gene Ontology termsGO:0006744 - ubiquinone biosynthetic process (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0009768 - photosynthesis, light harvesting in photosystem I (biological process)
GO:0009416 - response to light stimulus (biological process)
GO:0009522 - photosystem I (cellular component)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0005743 - mitochondrial inner membrane (cellular component)
GO:0008289 - lipid binding (molecular function)
GO:0016168 - chlorophyll binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001344 - Chlorophyll A-B binding protein, plant and chromista
IPR009327 - Cupin domain of unknown function DUF985
IPR011051 - RmlC-like cupin domain superfamily
IPR012762 - Ubiquinone biosynthesis protein COQ9
IPR013718 - COQ9
IPR014710 - RmlC-like jelly roll fold
IPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4357475.1 hypothetical protein F8388_002373 [Cannabis sativa]3.6e-18868.24Show/hide
Query:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQ----HHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEY
        MYRTAAKRL++  ++  NG  R R PP   F       ++ +S  FPN Q PNQ    HH + +   SSDSASS+SS  S+  S ++   H ++ P  +Y
Subjt:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQ----HHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEY

Query:  QEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQ
        QEEQARVL+A+L HV+KLGWTEAAM+AGARD G+SP+I+GS  RKEA LVEFFMDDCLQRLID I++GE LK+LI  ERI KLVR RLE+QAP+I+KW Q
Subjt:  QEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQ

Query:  ALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMG
        ALSIQAQP N+ TSFKQRAMLVDEIWHAAG + SDIDWYV RT+LGGIYS TEIYMLTDSS DF+DTW FL+ R+KDAFD KKT+QEAKYLAE VGAGMG
Subjt:  ALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMG

Query:  NSLQGFVEGGWGRT---KNPKMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGE
                 G+  +       MATAS+IVAKLNL PHPEGGFY ET RD+SV LSKS LPP+YKV+R V++CIYFL+PSG VS LHRIPCAETWHFYLGE
Subjt:  NSLQGFVEGGWGRT---KNPKMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGE

Query:  PLTVLELNEKDGRVKLTCLGSDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASE
        P+T++E+NE+DG VKLTCLG D IGDNQ PQYTVPPNVWFGAFPTKDFNI  DGT+ KAA RD E+HYSLVGC+CAPAFQF+DFELAKRS LV+ FP SE
Subjt:  PLTVLELNEKDGRVKLTCLGSDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASE

Query:  ALISLLT
         LISLLT
Subjt:  ALISLLT

TYK10012.1 ubiquinone biosynthesis protein COQ9-B [Cucumis melo var. makuwa]3.5e-15193.09Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T G NGI HRLRLP PSTFLDH SFSTATNS EFPNSQIPNQHHIRQETPTS+DSASSSSSSWSTSTSGEE RSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QAR+LQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQ+LIDLIEA EGLKNLILRER+YKLVRARLE+Q PFISKWAQALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYS TEIYMLTDSSP FQDTWTFLDNRLKDAFDIKKT+QEAKYLAEAVGAGMGNS 
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFV
        QGFV
Subjt:  QGFV

XP_004144042.2 ubiquinone biosynthesis protein COQ9-B, mitochondrial [Cucumis sativus]2.3e-15093.77Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T+G N I HRLR P PSTFLDH SFSTATNSQEFPNSQIPNQ+HIRQETPTSSDSASSSSSSWSTS SGEETRSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEG-LKNLILRERIYKLVRARLELQAPFISKWAQAL
        QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEA EG LKNLILRER+YKLVRARLE+Q PFISKWAQAL
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEG-LKNLILRERIYKLVRARLELQAPFISKWAQAL

Query:  SIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS
        SIQA PANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS
Subjt:  SIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS

Query:  LQGFV
         QGFV
Subjt:  LQGFV

XP_008450946.1 PREDICTED: ubiquinone biosynthesis protein COQ9-B, mitochondrial [Cucumis melo]4.6e-15193.09Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T G NGI HRLRLP PSTF DH SFSTATNS EFPNSQIPNQHHIRQETPTS+DSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QAR+LQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQ+LIDLIEA EGLKNLILRER+YKLVRARLE+Q PFISKWAQALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYS TEIYMLTDSSP FQDTWTFLDNRLKDAFDIKKT+QEAKYLAEAVGAGMGNS 
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFV
        QGFV
Subjt:  QGFV

XP_038878485.1 ubiquinone biosynthesis protein COQ9-B, mitochondrial [Benincasa hispida]3.7e-15394.06Show/hide
Query:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEEQ
        MYRTAAKRLIVGVTSG NGI RLRLP PSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPT SDSASSS+SSWSTSTSGEETRSH+NRRPRVEYQEEQ
Subjt:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEEQ

Query:  ARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALSI
        ARVLQAALSHVVKLGWTEAAMIAGARD+ MSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEA EGLKNLILRERIYKLVRARLE+QAPFISKWAQALSI
Subjt:  ARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALSI

Query:  QAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSLQ
        QAQPANLATSFKQRA LVDEIWHA+G DTSD+DWYV RTILGGIYSATEIYMLTDSSPDFQD+WTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGN+ Q
Subjt:  QAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSLQ

Query:  GFV
        GFV
Subjt:  GFV

TrEMBL top hitse value%identityAlignment
A0A0A0LZL8 Ubiquinone biosynthesis protein1.1e-15093.77Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T+G N I HRLR P PSTFLDH SFSTATNSQEFPNSQIPNQ+HIRQETPTSSDSASSSSSSWSTS SGEETRSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEG-LKNLILRERIYKLVRARLELQAPFISKWAQAL
        QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEA EG LKNLILRER+YKLVRARLE+Q PFISKWAQAL
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEG-LKNLILRERIYKLVRARLELQAPFISKWAQAL

Query:  SIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS
        SIQA PANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS
Subjt:  SIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS

Query:  LQGFV
         QGFV
Subjt:  LQGFV

A0A1S3BQF8 Ubiquinone biosynthesis protein2.2e-15193.09Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T G NGI HRLRLP PSTF DH SFSTATNS EFPNSQIPNQHHIRQETPTS+DSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QAR+LQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQ+LIDLIEA EGLKNLILRER+YKLVRARLE+Q PFISKWAQALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYS TEIYMLTDSSP FQDTWTFLDNRLKDAFDIKKT+QEAKYLAEAVGAGMGNS 
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFV
        QGFV
Subjt:  QGFV

A0A5A7UQC6 Ubiquinone biosynthesis protein2.2e-15193.09Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T G NGI HRLRLP PSTF DH SFSTATNS EFPNSQIPNQHHIRQETPTS+DSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QAR+LQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQ+LIDLIEA EGLKNLILRER+YKLVRARLE+Q PFISKWAQALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYS TEIYMLTDSSP FQDTWTFLDNRLKDAFDIKKT+QEAKYLAEAVGAGMGNS 
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFV
        QGFV
Subjt:  QGFV

A0A5D3CFB9 Ubiquinone biosynthesis protein1.7e-15193.09Show/hide
Query:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRLI G+T G NGI HRLRLP PSTFLDH SFSTATNS EFPNSQIPNQHHIRQETPTS+DSASSSSSSWSTSTSGEE RSHENRRPRVEYQEE
Subjt:  MYRTAAKRLIVGVTSGINGI-HRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QAR+LQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQ+LIDLIEA EGLKNLILRER+YKLVRARLE+Q PFISKWAQALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYV RTILGGIYS TEIYMLTDSSP FQDTWTFLDNRLKDAFDIKKT+QEAKYLAEAVGAGMGNS 
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFV
        QGFV
Subjt:  QGFV

A0A7J6EGB7 Ubiquinone biosynthesis protein1.7e-18868.24Show/hide
Query:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQ----HHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEY
        MYRTAAKRL++  ++  NG  R R PP   F       ++ +S  FPN Q PNQ    HH + +   SSDSASS+SS  S+  S ++   H ++ P  +Y
Subjt:  MYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQ----HHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEY

Query:  QEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQ
        QEEQARVL+A+L HV+KLGWTEAAM+AGARD G+SP+I+GS  RKEA LVEFFMDDCLQRLID I++GE LK+LI  ERI KLVR RLE+QAP+I+KW Q
Subjt:  QEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQ

Query:  ALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMG
        ALSIQAQP N+ TSFKQRAMLVDEIWHAAG + SDIDWYV RT+LGGIYS TEIYMLTDSS DF+DTW FL+ R+KDAFD KKT+QEAKYLAE VGAGMG
Subjt:  ALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMG

Query:  NSLQGFVEGGWGRT---KNPKMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGE
                 G+  +       MATAS+IVAKLNL PHPEGGFY ET RD+SV LSKS LPP+YKV+R V++CIYFL+PSG VS LHRIPCAETWHFYLGE
Subjt:  NSLQGFVEGGWGRT---KNPKMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGE

Query:  PLTVLELNEKDGRVKLTCLGSDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASE
        P+T++E+NE+DG VKLTCLG D IGDNQ PQYTVPPNVWFGAFPTKDFNI  DGT+ KAA RD E+HYSLVGC+CAPAFQF+DFELAKRS LV+ FP SE
Subjt:  PLTVLELNEKDGRVKLTCLGSDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASE

Query:  ALISLLT
         LISLLT
Subjt:  ALISLLT

SwissProt top hitse value%identityAlignment
P10708 Chlorophyll a-b binding protein 7, chloroplastic3.6e-6652.44Show/hide
Query:  YSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGA
        YS    A+S  +TVC    PDRPLWFPGS+PP WLDG                               RWAML  AGI +PE    +G++   SWY AG 
Subjt:  YSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGA

Query:  REYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYT
        +EYF D TTL +V+L L+GW EGRRWAD++ PG V+ D   P+ K    D+GYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYT
Subjt:  REYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYT

Query:  GQGPLENLAAHVADPGHCNIFSAFS
        G GP++NL AH+ADPGH  IF+AFS
Subjt:  GQGPLENLAAHVADPGHCNIFSAFS

P13869 Chlorophyll a-b binding protein, chloroplastic1.5e-6454.37Show/hide
Query:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG
        PDRPLWFPGS+PPEWLDG                               RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL V++L L+G
Subjt:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG

Query:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PG V+ D   P+ K    D+GYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFSAFS
        IF+AFS
Subjt:  IFSAFS

Q5RJV0 Ubiquinone biosynthesis protein COQ9, mitochondrial1.1e-3537.33Show/hide
Query:  TSTSGEETRSHENRRPRVEYQEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRER-
        T  +GEE+  +E+       ++ Q R+L AAL  V   GW+  A+  GA+ + MS +  G F    +ELV  F+  C  +L +L+E  + L  L   E+ 
Subjt:  TSTSGEETRSHENRRPRVEYQEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRER-

Query:  -----IYKLVRARLELQAPFISKWAQALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNR
             +   V ARL +  P+I +W QAL +   P N+ +S K    +VD+IWH AG  ++D+ WY  R +L GIY+ TE+ ML DSSPDF+DTW FL+NR
Subjt:  -----IYKLVRARLELQAPFISKWAQALSIQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNR

Query:  LKDAFDIKKTVQEAKYLAEAVGAGM
        + +A  +  +V++     EAV  G+
Subjt:  LKDAFDIKKTVQEAKYLAEAVGAGM

Q8LCQ4 Photosystem I chlorophyll a/b-binding protein 6, chloroplastic1.2e-8558.74Show/hide
Query:  MALSISSTALSTFPI------RETFHRGHFPG-KFPNYKLRRNYSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG--------------------
        MA +I+S   ST  +        T  R H         +L R       A   VS+VCEPLPPDRPLWFPGSSPPEWLDG                    
Subjt:  MALSISSTALSTFPI------RETFHRGHFPG-KFPNYKLRRNYSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG--------------------

Query:  -----------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGF
                   RWAMLAV GI++PE  E LG I+NFSWYDAG+REYFAD TTL V Q+ LMGW EGRRWADL+ PGSVD++ K PHK   KPD+GYPGG 
Subjt:  -----------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGF

Query:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSAFSS
        WFD MMWGRGSPEPVMVLRTKEIKNGRLAMLAF+G  FQA YT Q P+ENL AH+ADPGHCN+FSAF+S
Subjt:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSAFSS

Q9SYW8 Photosystem I chlorophyll a/b-binding protein 2, chloroplastic1.5e-6454.37Show/hide
Query:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG
        PDRP+WFPGS+PPEWLDG                               RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL VV+L L+G
Subjt:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG

Query:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PGSV+ D   P+ K    D+GYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFSAFS
        IF+AF+
Subjt:  IFSAFS

Arabidopsis top hitse value%identityAlignment
AT1G19130.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF985 (InterPro:IPR009327), RmlC-like jelly roll fold (InterPro:IPR014710); Has 1465 Blast hits to 1465 proteins in 584 species: Archae - 10; Bacteria - 1038; Metazoa - 19; Fungi - 43; Plants - 51; Viruses - 0; Other Eukaryotes - 304 (source: NCBI BLink).3.9e-7671.12Show/hide
Query:  KMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLG
        KM  +SEIV KLNL+ H EGGF++ETFRD SV LS S LPP +KVDR VST IYFL+PSG VS LHRIP AETWHFYLGEPLTV+EL + DG++K TCLG
Subjt:  KMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLG

Query:  SDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASEALISLLT
         D    +Q PQYTVPPNVWFG+FPTKD + S DG + KA  RDSENH+SLVGC+CAPAFQFEDFELAKRSDL+SRFP  E+LI++L+
Subjt:  SDFIGDNQLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASEALISLLT

AT1G19140.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: ubiquinone biosynthetic process; LOCATED IN: mitochondrion; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: COQ9 (InterPro:IPR013718), Ubiquinone biosynthesis protein COQ9 (InterPro:IPR012762); Has 748 Blast hits to 748 proteins in 260 species: Archae - 0; Bacteria - 218; Metazoa - 126; Fungi - 101; Plants - 39; Viruses - 0; Other Eukaryotes - 264 (source: NCBI BLink).5.1e-9259.8Show/hide
Query:  MYRTAAKRLIVGVTSGINGIHRLRLPP-PSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRL+ G   GI     LRLP   ST +  S  S    S    N++ P    + +++   + S  ++SSS  T   GE  R HE+R+PR E+QEE
Subjt:  MYRTAAKRLIVGVTSGINGIHRLRLPP-PSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QARVL A+L HV +LGWTE AM+AG+RD+G+SPSIVGSF+RKEA LVEFFMD+CLQ L+D I++G  L+NLI  ERI KL+R RLE+Q P++SKW QALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL
        IQA P N+ TSFKQRAMLVDEIWHA G   SD+DWYV RTILGG+YS TEIYMLTD S + +DTW FLD+R+KDAFD+KK++QEAKY AE +GAG+G S+
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSL

Query:  QGFVEG
        QG + G
Subjt:  QGFVEG

AT1G19140.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: ubiquinone biosynthetic process; LOCATED IN: mitochondrion; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: COQ9 (InterPro:IPR013718), Ubiquinone biosynthesis protein COQ9 (InterPro:IPR012762); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).7.4e-9159.61Show/hide
Query:  MYRTAAKRLIVGVTSGINGIHRLRLPP-PSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE
        MYRTAAKRL+ G   GI     LRLP   ST +  S  S    S    N++ P    + +++   + S  ++SSS  T   GE  R HE+R+PR E+QEE
Subjt:  MYRTAAKRLIVGVTSGINGIHRLRLPP-PSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENRRPRVEYQEE

Query:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS
        QARVL A+L HV +LGWTE AM+AG+RD+G+SPSIVGSF+RKEA LVEFFMD+CLQ L+D I++G  L+NLI  ERI KL+R RLE+Q P++SKW QALS
Subjt:  QARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALS

Query:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSS-PDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS
        IQA P N+ TSFKQRAMLVDEIWHA G   SD+DWYV RTILGG+YS TEIYMLTD S  + +DTW FLD+R+KDAFD+KK++QEAKY AE +GAG+G S
Subjt:  IQAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSS-PDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNS

Query:  LQGFVEG
        +QG + G
Subjt:  LQGFVEG

AT1G19150.1 photosystem I light harvesting complex gene 68.5e-8758.74Show/hide
Query:  MALSISSTALSTFPI------RETFHRGHFPG-KFPNYKLRRNYSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG--------------------
        MA +I+S   ST  +        T  R H         +L R       A   VS+VCEPLPPDRPLWFPGSSPPEWLDG                    
Subjt:  MALSISSTALSTFPI------RETFHRGHFPG-KFPNYKLRRNYSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDG--------------------

Query:  -----------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGF
                   RWAMLAV GI++PE  E LG I+NFSWYDAG+REYFAD TTL V Q+ LMGW EGRRWADL+ PGSVD++ K PHK   KPD+GYPGG 
Subjt:  -----------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGF

Query:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSAFSS
        WFD MMWGRGSPEPVMVLRTKEIKNGRLAMLAF+G  FQA YT Q P+ENL AH+ADPGHCN+FSAF+S
Subjt:  WFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCNIFSAFSS

AT3G61470.1 photosystem I light harvesting complex gene 21.1e-6554.37Show/hide
Query:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG
        PDRP+WFPGS+PPEWLDG                               RWAML  AGI +PE+   +G++   SWY AG +EYF D TTL VV+L L+G
Subjt:  PDRPLWFPGSSPPEWLDG-------------------------------RWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFADPTTLLVVQLALMG

Query:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN
        W EGRRWAD++ PGSV+ D   P+ K    D+GYPGG WFDP+ WG GSP  +  LRTKEIKNGRLAMLA +G WFQ IYTG GP++NL AH+ADPGH  
Subjt:  WVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADPGHCN

Query:  IFSAFS
        IF+AF+
Subjt:  IFSAFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTCTCCATTTCTTCCACTGCTCTCTCGACCTTCCCAATCAGAGAAACATTTCACAGAGGACATTTTCCGGGAAAATTCCCAAATTATAAGCTCCGGCGAAATTA
CTCCGATTTTAAAGCTGCAAAGTCCGGCGTGTCCACCGTCTGTGAACCGCTCCCTCCCGACAGACCGCTCTGGTTCCCCGGCAGCAGCCCGCCGGAGTGGCTCGACGGCA
GATGGGCAATGCTAGCCGTGGCCGGAATCCTACTGCCGGAATGGTTCGAAAGCTTAGGATTGATACAAAATTTCTCATGGTACGATGCCGGAGCGCGCGAGTATTTTGCA
GACCCGACGACATTGCTGGTGGTGCAATTGGCATTGATGGGGTGGGTGGAGGGGCGGCGGTGGGCAGACTTGGTCAATCCGGGCAGCGTGGATGTTGACCTAAAGCTGCC
CCACAAAAAGAAGGCAAAGCCAGACATGGGCTACCCGGGCGGGTTTTGGTTTGACCCCATGATGTGGGGCAGGGGCTCGCCGGAGCCGGTCATGGTGCTACGGACTAAAG
AGATCAAGAATGGGCGGTTGGCCATGCTGGCGTTTGTCGGATTGTGGTTTCAAGCTATTTATACTGGCCAAGGGCCGCTGGAGAATCTGGCCGCCCACGTGGCTGATCCT
GGCCATTGCAACATCTTTTCGGCATTTAGTTCGGGTTCGGAGCGGCGGACCGACGGAGACATGTATCGAACGGCGGCGAAGCGTCTGATCGTCGGCGTGACCTCCGGTAT
AAATGGCATTCACCGTCTCAGGTTGCCACCACCTTCCACCTTCCTCGACCATTCTTCCTTCTCCACAGCTACCAATTCACAAGAATTTCCTAATTCTCAAATCCCCAATC
AACACCATATTCGTCAAGAAACCCCTACTTCTTCTGATTCTGCATCTTCTTCATCTTCTTCTTGGTCTACATCAACTTCCGGCGAAGAAACCCGAAGCCATGAGAATCGG
CGGCCGAGAGTTGAGTATCAAGAGGAGCAGGCTCGCGTCCTCCAGGCTGCTCTTTCCCATGTGGTGAAGCTAGGATGGACTGAGGCCGCAATGATTGCTGGTGCAAGGGA
TATTGGCATGTCACCTTCCATTGTTGGATCGTTTGCCAGGAAGGAAGCCGAATTAGTTGAGTTTTTCATGGATGATTGCTTACAGCGGCTCATCGATCTAATTGAGGCAG
GAGAGGGCCTTAAGAATTTGATACTTCGTGAACGTATTTACAAGCTTGTTAGGGCTCGTCTAGAATTGCAAGCTCCCTTCATATCCAAATGGGCTCAGGCTCTCAGTATC
CAGGCACAACCAGCAAATCTAGCAACTAGCTTTAAACAACGGGCGATGCTTGTTGATGAGATATGGCATGCTGCTGGTGGCGACACCTCTGACATTGATTGGTACGTCAA
CCGCACTATTTTGGGAGGAATATACTCAGCTACTGAGATATACATGCTCACTGATAGTTCTCCAGATTTTCAGGATACATGGACTTTCTTGGACAACCGTTTGAAAGATG
CTTTCGATATAAAGAAAACCGTCCAAGAGGCAAAGTATCTGGCAGAAGCTGTAGGTGCTGGAATGGGGAACTCCCTTCAGGGATTTGTTGAAGGAGGTTGGGGAAGAACT
AAGAATCCAAAAATGGCTACTGCATCAGAAATTGTAGCGAAATTGAATTTGAAGCCACATCCAGAAGGCGGTTTTTACTCTGAAACCTTCAGAGATCACTCCGTTCATCT
CTCCAAATCTCACCTCCCACCGGAATACAAGGTTGATCGAGAGGTCAGCACTTGTATATACTTTCTGGTGCCTTCTGGATGTGTGTCTGCTCTTCATCGCATTCCATGTG
CAGAGACTTGGCATTTTTACTTGGGGGAACCTCTTACGGTACTGGAGTTGAATGAAAAGGACGGTCGAGTCAAATTGACTTGTCTTGGGTCTGATTTCATTGGAGACAAT
CAATTACCACAGTATACAGTGCCTCCTAATGTCTGGTTTGGTGCTTTCCCAACCAAAGACTTCAATATTTCTGCTGATGGGACTGTGACTAAAGCTGCTCCAAGGGACTC
TGAGAATCACTACTCCCTTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCAAAACGCTCTGATCTTGTTTCACGGTTTCCAGCTAGTGAAG
CTCTCATCTCATTGCTGACACCAGGAACAAATCCTGGTCCTCTTAGGAAGGAAATATTACGAAAGAATTATTCTTCATGGTGGAGTCCTTTTGCTTTTCTGCAGTCACAC
ATGGCAGGTAGCTTCTCAGATTTCTAA
mRNA sequenceShow/hide mRNA sequence
TACGATTTAAGAATTACCATCCATTAGGATAGAAATGAAGCTTAAAATCTCTATATCCACAACAAGACATCATGGGCCATCCATTCCCTAACATATATTTTATAGCAATA
TCCACCAATTCTGTTTTACCTTTTTCGAAGCTCTCCGGCCATGGCCCTCTCCATTTCTTCCACTGCTCTCTCGACCTTCCCAATCAGAGAAACATTTCACAGAGGACATT
TTCCGGGAAAATTCCCAAATTATAAGCTCCGGCGAAATTACTCCGATTTTAAAGCTGCAAAGTCCGGCGTGTCCACCGTCTGTGAACCGCTCCCTCCCGACAGACCGCTC
TGGTTCCCCGGCAGCAGCCCGCCGGAGTGGCTCGACGGCAGATGGGCAATGCTAGCCGTGGCCGGAATCCTACTGCCGGAATGGTTCGAAAGCTTAGGATTGATACAAAA
TTTCTCATGGTACGATGCCGGAGCGCGCGAGTATTTTGCAGACCCGACGACATTGCTGGTGGTGCAATTGGCATTGATGGGGTGGGTGGAGGGGCGGCGGTGGGCAGACT
TGGTCAATCCGGGCAGCGTGGATGTTGACCTAAAGCTGCCCCACAAAAAGAAGGCAAAGCCAGACATGGGCTACCCGGGCGGGTTTTGGTTTGACCCCATGATGTGGGGC
AGGGGCTCGCCGGAGCCGGTCATGGTGCTACGGACTAAAGAGATCAAGAATGGGCGGTTGGCCATGCTGGCGTTTGTCGGATTGTGGTTTCAAGCTATTTATACTGGCCA
AGGGCCGCTGGAGAATCTGGCCGCCCACGTGGCTGATCCTGGCCATTGCAACATCTTTTCGGCATTTAGTTCGGGTTCGGAGCGGCGGACCGACGGAGACATGTATCGAA
CGGCGGCGAAGCGTCTGATCGTCGGCGTGACCTCCGGTATAAATGGCATTCACCGTCTCAGGTTGCCACCACCTTCCACCTTCCTCGACCATTCTTCCTTCTCCACAGCT
ACCAATTCACAAGAATTTCCTAATTCTCAAATCCCCAATCAACACCATATTCGTCAAGAAACCCCTACTTCTTCTGATTCTGCATCTTCTTCATCTTCTTCTTGGTCTAC
ATCAACTTCCGGCGAAGAAACCCGAAGCCATGAGAATCGGCGGCCGAGAGTTGAGTATCAAGAGGAGCAGGCTCGCGTCCTCCAGGCTGCTCTTTCCCATGTGGTGAAGC
TAGGATGGACTGAGGCCGCAATGATTGCTGGTGCAAGGGATATTGGCATGTCACCTTCCATTGTTGGATCGTTTGCCAGGAAGGAAGCCGAATTAGTTGAGTTTTTCATG
GATGATTGCTTACAGCGGCTCATCGATCTAATTGAGGCAGGAGAGGGCCTTAAGAATTTGATACTTCGTGAACGTATTTACAAGCTTGTTAGGGCTCGTCTAGAATTGCA
AGCTCCCTTCATATCCAAATGGGCTCAGGCTCTCAGTATCCAGGCACAACCAGCAAATCTAGCAACTAGCTTTAAACAACGGGCGATGCTTGTTGATGAGATATGGCATG
CTGCTGGTGGCGACACCTCTGACATTGATTGGTACGTCAACCGCACTATTTTGGGAGGAATATACTCAGCTACTGAGATATACATGCTCACTGATAGTTCTCCAGATTTT
CAGGATACATGGACTTTCTTGGACAACCGTTTGAAAGATGCTTTCGATATAAAGAAAACCGTCCAAGAGGCAAAGTATCTGGCAGAAGCTGTAGGTGCTGGAATGGGGAA
CTCCCTTCAGGGATTTGTTGAAGGAGGTTGGGGAAGAACTAAGAATCCAAAAATGGCTACTGCATCAGAAATTGTAGCGAAATTGAATTTGAAGCCACATCCAGAAGGCG
GTTTTTACTCTGAAACCTTCAGAGATCACTCCGTTCATCTCTCCAAATCTCACCTCCCACCGGAATACAAGGTTGATCGAGAGGTCAGCACTTGTATATACTTTCTGGTG
CCTTCTGGATGTGTGTCTGCTCTTCATCGCATTCCATGTGCAGAGACTTGGCATTTTTACTTGGGGGAACCTCTTACGGTACTGGAGTTGAATGAAAAGGACGGTCGAGT
CAAATTGACTTGTCTTGGGTCTGATTTCATTGGAGACAATCAATTACCACAGTATACAGTGCCTCCTAATGTCTGGTTTGGTGCTTTCCCAACCAAAGACTTCAATATTT
CTGCTGATGGGACTGTGACTAAAGCTGCTCCAAGGGACTCTGAGAATCACTACTCCCTTGTGGGCTGCAGCTGTGCACCTGCTTTCCAGTTTGAGGACTTTGAGTTGGCA
AAACGCTCTGATCTTGTTTCACGGTTTCCAGCTAGTGAAGCTCTCATCTCATTGCTGACACCAGGAACAAATCCTGGTCCTCTTAGGAAGGAAATATTACGAAAGAATTA
TTCTTCATGGTGGAGTCCTTTTGCTTTTCTGCAGTCACACATGGCAGGTAGCTTCTCAGATTTCTAAAACGATATCTGCACAAGAAAATGGAAAAAACAATGAGGTTAGC
CTATAAAAGTACTTCTTTTTGGATGAAGTTCTGACATTGGAGCTTTGGGGAGGAGGAGCAACTGCAGCAGGAGTGGTAGCCGTTCCCCCTCCACAGACGAAAATCTTCTT
GAGAAGGAGGGATAAAGTTCTTTTGCCAATAACGATTCAAAGCTTAATTGTCTGTTGAGAAATATATTTGATGGACAATGCTTGCTGACTTCAAATTCCAAGGATGGACC
CGATTGCTTAAGGGAAGATGATGGATTCTCTTGTGACCTTTTTCGGTGTGCTTGATTCAGATTCTCATCTCCAAACGTTCCAATCGCCAGCAACACATGAGGCCAGTTGC
TGATTTCTCCTGGAAAATCCGGGTGGCTGCTCTATTGATGGAAGTTCATGCAGGAAGATGGACAAGGAAGAAAAGGACATGAAAGACTGAGCTATCACGCACGGAGAAGA
TTTGAGGAAGAACATCTCTTTTTATCTGAAAGATAAAATAATATATATGGATAGCCAATGGAAAGAGAATCCTCAACAGCAGCCTGTTTCTTTCTCTCTCTCTCTCTAAT
AAATTAGAGGAATCGCCACCATGCAAAGAAATATGCATTACACCTTTTGGACACGTGAAGCTGTCCATGGTGTGAATTCATATGGGCGGTGGAGCTTTAACCAACTCCAT
ATCGAACCATCGAATGAGGTTTGCTTGTTTACAAGAGGAGCAGGTATACCTACGAGACGCTTCGAGTCTCACAGCACATCTCTAAACAGCAGTCGAAACAACCGAAGCAG
GCAAAAGCGGTGCAGAGTTGTGTCAAGACAATCGCACATTGATTATTCATTTTAGTACAG
Protein sequenceShow/hide protein sequence
MALSISSTALSTFPIRETFHRGHFPGKFPNYKLRRNYSDFKAAKSGVSTVCEPLPPDRPLWFPGSSPPEWLDGRWAMLAVAGILLPEWFESLGLIQNFSWYDAGAREYFA
DPTTLLVVQLALMGWVEGRRWADLVNPGSVDVDLKLPHKKKAKPDMGYPGGFWFDPMMWGRGSPEPVMVLRTKEIKNGRLAMLAFVGLWFQAIYTGQGPLENLAAHVADP
GHCNIFSAFSSGSERRTDGDMYRTAAKRLIVGVTSGINGIHRLRLPPPSTFLDHSSFSTATNSQEFPNSQIPNQHHIRQETPTSSDSASSSSSSWSTSTSGEETRSHENR
RPRVEYQEEQARVLQAALSHVVKLGWTEAAMIAGARDIGMSPSIVGSFARKEAELVEFFMDDCLQRLIDLIEAGEGLKNLILRERIYKLVRARLELQAPFISKWAQALSI
QAQPANLATSFKQRAMLVDEIWHAAGGDTSDIDWYVNRTILGGIYSATEIYMLTDSSPDFQDTWTFLDNRLKDAFDIKKTVQEAKYLAEAVGAGMGNSLQGFVEGGWGRT
KNPKMATASEIVAKLNLKPHPEGGFYSETFRDHSVHLSKSHLPPEYKVDREVSTCIYFLVPSGCVSALHRIPCAETWHFYLGEPLTVLELNEKDGRVKLTCLGSDFIGDN
QLPQYTVPPNVWFGAFPTKDFNISADGTVTKAAPRDSENHYSLVGCSCAPAFQFEDFELAKRSDLVSRFPASEALISLLTPGTNPGPLRKEILRKNYSSWWSPFAFLQSH
MAGSFSDF