; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001000 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001000
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionclp protease adapter protein ClpF, chloroplastic
Genome locationscaffold36:761875..764074
RNA-Seq ExpressionMS001000
SyntenyMS001000
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009840 - chloroplastic endopeptidase Clp complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR001943 - UVR domain
IPR011722 - Hemimethylated DNA-binding domain
IPR036623 - Hemimethylated DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598627.1 Clp protease adapter protein ClpF, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.2e-16288.17Show/hide
Query:  EMVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFF
        EMVQD+SIHAL ALGDPKVYGSTL WRKNFKQTS P  L S   F HQ +RSFSL SQP R +RGNFKVKAGWLF+GGGQGL ARIERSE ANNDILIFF
Subjt:  EMVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFF

Query:  FQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLA
        FQLDLATRVQYALNIEQYEIAQ+LRTKLTEVEEE+IKQQESKRGLTSKSEVQDKGL+IIRLRADLQ AIESENY  AAQLRD+ISKLETESLAASA VLA
Subjt:  FQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLA

Query:  YENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYG
        YE+AEYSFRLGQK RHKIFGYRGV+CGMDPVCCESSSWMEIAQVEKLSRG NQPFYQVLVDVRT PDLLVAYV EENLL PEEPD ERFDHPYNSFLFYG
Subjt:  YENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYG

Query:  VDAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD
        VD+AGDFIP+KQLREKYNRPRHEVP DPQ DDEQRGDD
Subjt:  VDAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD

KAG7029563.1 Clp protease adapter protein ClpF, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-16288.17Show/hide
Query:  EMVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFF
        EMVQD+SIHAL ALGDPKVYGSTL WRKNFKQTS P  L S   F HQ +RSFSL SQP R +RGNFKVKAGWLF+GGGQGL ARIERSE ANNDILIFF
Subjt:  EMVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFF

Query:  FQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLA
        FQLDLATRVQYALNIEQYEIAQ+LRTKLTEVEEE+IKQQESKRGLTSKSEVQDKGL+IIRLRADLQ AIESENY  AAQLRD+ISKLETESLAASA VLA
Subjt:  FQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLA

Query:  YENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYG
        YE+AEYSFRLGQK RHKIFGYRGV+CGMDPVCCESSSWMEIAQVEKLSRG NQPFYQVLVDVRT PDLLVAYV EENLL PEEPD ERFDHPYNSFLFYG
Subjt:  YENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYG

Query:  VDAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD
        VD+AGDFIP+KQLREKYNRPRHEVP DPQ DDEQRGDD
Subjt:  VDAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD

XP_022132156.1 clp protease adapter protein ClpF, chloroplastic [Momordica charantia]3.0e-18498.81Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ
        MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFK KAGWLFKGGGQGLGARIERSE ANNDILIFFFQ
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ

Query:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE
        LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESK GLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRD+ISKLETESLAASATVLAYE
Subjt:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE

Query:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD
        NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD
Subjt:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD

Query:  AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
        AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
Subjt:  AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD

XP_022962595.1 clp protease adapter protein ClpF, chloroplastic [Cucurbita moschata]5.5e-16288.43Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFF
        MVQD+SIHAL ALGDPKVYGSTL WRKNFKQTS P  L S   F HQ +RSFSL SQP RL+RGNFKVKAGWLF+GGGQGL ARIERSE ANNDILIFFF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFF

Query:  QLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAY
        QLDLATRVQYALNIEQYEIAQ+LRTKLTEVEEE+IKQQESKRGLTSKSEVQDKGL+IIRLRADLQ AIESENY  AAQLRD+ISKLETESLAASA VLAY
Subjt:  QLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAY

Query:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV
        E+AEYSFRLGQK RHKIFGYRGV+CGMDPVCCESSSWMEIAQVEKLSRG NQPFYQVLVDVRT PDLLVAYV EENLL PEEPD ERFDHPYNSFLFYGV
Subjt:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV

Query:  DAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD
        D+AGDFIP+KQLREKYNRPRHEVP DPQ DDEQRGDD
Subjt:  DAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD

XP_038886599.1 clp protease adapter protein ClpF, chloroplastic [Benincasa hispida]1.0e-16389.35Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF
        MVQD+SIHAL ALGDPKVYGS L WR+N   FKQTS  QL S  CF +Q +RSFSLTSQP  L+RGNFKVKAGWLFKGGGQ LG RIERSE ANNDILIF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF

Query:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL
        FFQLDLATRVQYALNIEQYEIAQ+LRTKLTEVE EIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQ A+ESENYALAAQLRDEISKLETESLAASA VL
Subjt:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL

Query:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY
        AYE+AEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRT+PDLLVAYV EENLL PEEPDME FDHPYNSFLFY
Subjt:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY

Query:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
        GVD AGDFIPIKQLREKYNRPR+EVP DPQDDEQRGDD
Subjt:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD

TrEMBL top hitse value%identityAlignment
A0A1S3BCC1 uncharacterized protein LOC1034884811.1e-16087.28Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF
        MVQD+SIHA+ ALGDP VYGS L WR+N   FKQTS  QL S  CF HQ +RSFSLTSQP R +RG+FK++AGWLFKGGGQ  G RIERSE ANNDILIF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF

Query:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL
        FFQLDLATRVQYALNIEQYEIAQ+LR KLTEVE EIIKQQESK+GLTSKSEVQDKGLNIIRLRADLQ AIESENYALAAQLRDEISKLETESLAASA VL
Subjt:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL

Query:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY
        AYE+AEYSFRLGQKVRHKIFGY GVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRT+PDLLVAYV EENLLAPEEPD ERFDHPY+SFLFY
Subjt:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY

Query:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
        GVD+AGDFIPIKQLREKYNRPRHEVP DPQDDEQ+GDD
Subjt:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD

A0A5A7VGB7 UvrB/uvrC motif-containing protein isoform 11.5e-16086.98Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF
        MVQD+SIHA+ ALGDP VYGS L WR+N   FKQTS  QL S  CF HQ +RSFSLTSQP R +RG+FK++AGWLFKGGGQ  G RIERSE ANNDILIF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKN---FKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIF

Query:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL
        FFQLDLATRVQYALNIEQYEIAQ+LR KLTEVE EIIKQQESK+GLTSKSEVQDKGLNIIRLRADLQ AIESENYALAAQLRDEISKLETESLAASA VL
Subjt:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL

Query:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY
        AYE+AEYSFRLGQKVRHKIFGY GVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRT+PDLLVAYV EENLLAPEEPD ERFDHPY+SFLFY
Subjt:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY

Query:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
        GVD+AGDFIPIKQLREKYNRPRHE+P DPQDDEQ+GDD
Subjt:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD

A0A6J1BRG5 clp protease adapter protein ClpF, chloroplastic1.5e-18498.81Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ
        MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFK KAGWLFKGGGQGLGARIERSE ANNDILIFFFQ
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ

Query:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE
        LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESK GLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRD+ISKLETESLAASATVLAYE
Subjt:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE

Query:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD
        NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD
Subjt:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD

Query:  AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
        AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD
Subjt:  AAGDFIPIKQLREKYNRPRHEVPSDPQDDEQRGDD

A0A6J1HFJ3 clp protease adapter protein ClpF, chloroplastic2.7e-16288.43Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFF
        MVQD+SIHAL ALGDPKVYGSTL WRKNFKQTS P  L S   F HQ +RSFSL SQP RL+RGNFKVKAGWLF+GGGQGL ARIERSE ANNDILIFFF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKP-QLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFF

Query:  QLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAY
        QLDLATRVQYALNIEQYEIAQ+LRTKLTEVEEE+IKQQESKRGLTSKSEVQDKGL+IIRLRADLQ AIESENY  AAQLRD+ISKLETESLAASA VLAY
Subjt:  QLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAY

Query:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV
        E+AEYSFRLGQK RHKIFGYRGV+CGMDPVCCESSSWMEIAQVEKLSRG NQPFYQVLVDVRT PDLLVAYV EENLL PEEPD ERFDHPYNSFLFYGV
Subjt:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV

Query:  DAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD
        D+AGDFIP+KQLREKYNRPRHEVP DPQ DDEQRGDD
Subjt:  DAAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD

A0A6J1KAU5 clp protease adapter protein ClpF, chloroplastic isoform X21.2e-15785.42Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ
        MVQD+SIH+L +LGDPKVYGS L WR+NFKQTS   L S   F HQ +RSFSL SQP R +RGNFKVKAGWLF+GGGQGL ARIERSE ANNDILIFFFQ
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQ

Query:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE
        LDLATRVQYALNIEQYEIAQ+LRTKLTEVE E+IKQQESKRGLTSKSEVQDKGL+IIRLRADLQ AIESENY  AAQLRD+ISKLETESLAASA VLAYE
Subjt:  LDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYE

Query:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD
        +AEY+FRLGQK +HKIFGYRGV+CGMDPVCCE+SSWMEIAQVEKLSRGSNQPFYQVLVDVRT PDLLVAYV EENLL PEEPD ERFDHPYNSFLFYGVD
Subjt:  NAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVD

Query:  AAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD
         AGDFIP++QLREKYNRPRHEVP D Q DDEQRGDD
Subjt:  AAGDFIPIKQLREKYNRPRHEVPSDPQ-DDEQRGDD

SwissProt top hitse value%identityAlignment
O94952 F-box only protein 211.3e-0729.31Show/hide
Query:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV
        ++ +  + +G  ++HK +GY  V+ G DP C     W+    V  L  G +QPFY VLV+     D    Y  +ENL    EP  +   HP +   ++  
Subjt:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV

Query:  DAAGDFIPIKQLREKY
             +IP  +L  +Y
Subjt:  DAAGDFIPIKQLREKY

Q5R5S1 F-box only protein 211.3e-0729.31Show/hide
Query:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV
        ++ +  + +G  ++HK +GY  V+ G DP C     W+    V  L  G +QPFY VLV+     D    Y  +ENL    EP  +   HP +   ++  
Subjt:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV

Query:  DAAGDFIPIKQLREKY
             +IP  +L  +Y
Subjt:  DAAGDFIPIKQLREKY

Q67Y99 Clp protease adapter protein ClpF, chloroplastic7.4e-11769.07Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLR-SFSLTSQPNRLRRGN-FKVKAGWLFKGGG-QGLGARIERSETANNDILIF
        MVQ  S+  L   G  KV  S L  R N  + S   L    C   Q+LR S S  S    L++ N  +V+A W F+GGG QGL    ERSE+AN DILIF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLR-SFSLTSQPNRLRRGN-FKVKAGWLFKGGG-QGLGARIERSETANNDILIF

Query:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL
        FFQLDLATRVQYA+N+EQY+IAQQLR KLTEVEEE I+ QE KRG ++KSE QDKG++IIRLRADLQNAI+SE+Y LAA+LRDEISKLE ESLA SA  L
Subjt:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL

Query:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY
        A+E AEY+FRLGQK+RHK FGYR VVCGMDP+C ESSSWME A+VEKL RGSNQPFYQVLVDVRT PDLLVAYV E+NLLAPE+PD ERFDHPY SFL+Y
Subjt:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY

Query:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDE
        G D AGDFIP+KQLREKYNRPRHEVP D QD++
Subjt:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDE

Q8VDH1 F-box only protein 219.6e-0829.31Show/hide
Query:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV
        ++ +  + +G  ++HK +GY  V+ G DP C     W+    V  L  G +QPFY VLV+     D    Y  +ENL    EP  +   HP +   ++  
Subjt:  ENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGV

Query:  DAAGDFIPIKQLREKY
             +IP  +L  +Y
Subjt:  DAAGDFIPIKQLREKY

Arabidopsis top hitse value%identityAlignment
AT2G03390.1 uvrB/uvrC motif-containing protein5.2e-11869.07Show/hide
Query:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLR-SFSLTSQPNRLRRGN-FKVKAGWLFKGGG-QGLGARIERSETANNDILIF
        MVQ  S+  L   G  KV  S L  R N  + S   L    C   Q+LR S S  S    L++ N  +V+A W F+GGG QGL    ERSE+AN DILIF
Subjt:  MVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLR-SFSLTSQPNRLRRGN-FKVKAGWLFKGGG-QGLGARIERSETANNDILIF

Query:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL
        FFQLDLATRVQYA+N+EQY+IAQQLR KLTEVEEE I+ QE KRG ++KSE QDKG++IIRLRADLQNAI+SE+Y LAA+LRDEISKLE ESLA SA  L
Subjt:  FFQLDLATRVQYALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVL

Query:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY
        A+E AEY+FRLGQK+RHK FGYR VVCGMDP+C ESSSWME A+VEKL RGSNQPFYQVLVDVRT PDLLVAYV E+NLLAPE+PD ERFDHPY SFL+Y
Subjt:  AYENAEYSFRLGQKVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFY

Query:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDE
        G D AGDFIP+KQLREKYNRPRHEVP D QD++
Subjt:  GVDAAGDFIPIKQLREKYNRPRHEVPSDPQDDE

AT2G03390.2 uvrB/uvrC motif-containing protein4.3e-9678.64Show/hide
Query:  LNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYENAEYSFRLGQ
        +N+EQY+IAQQLR KLTEVEEE I+ QE KRG ++KSE QDKG++IIRLRADLQNAI+SE+Y LAA+LRDEISKLE ESLA SA  LA+E AEY+FRLGQ
Subjt:  LNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYENAEYSFRLGQ

Query:  KVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVDAAGDFIPIKQ
        K+RHK FGYR VVCGMDP+C ESSSWME A+VEKL RGSNQPFYQVLVDVRT PDLLVAYV E+NLLAPE+PD ERFDHPY SFL+YG D AGDFIP+KQ
Subjt:  KVRHKIFGYRGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVDAAGDFIPIKQ

Query:  LREKYNRPRHEVPSDPQDDE
        LREKYNRPRHEVP D QD++
Subjt:  LREKYNRPRHEVPSDPQDDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAATGGTGCAGGATATGTCAATACACGCTCTGGCAGCCTTGGGAGATCCCAAGGTTTATGGATCAACTCTTCCATGGAGGAAGAATTTCAAGCAGACAAGTAAACCCCA
ATTAAATTCTACACATTGCTTTCGGCATCAATACTTGCGAAGCTTTTCCTTAACTAGCCAACCCAATAGATTGAGACGAGGAAATTTCAAGGTTAAAGCTGGATGGTTGT
TTAAAGGGGGTGGACAAGGGTTGGGTGCTCGCATAGAGCGCAGCGAGACTGCTAACAATGATATTTTAATCTTCTTTTTTCAGCTGGACCTGGCAACACGAGTACAGTAC
GCATTGAACATCGAGCAGTATGAAATTGCACAGCAATTGAGAACAAAGCTAACCGAGGTTGAAGAAGAAATCATCAAGCAGCAAGAATCAAAAAGAGGATTAACTTCGAA
GAGTGAAGTACAAGACAAAGGTTTAAATATCATTCGTCTACGTGCAGACTTGCAGAATGCTATTGAAAGTGAAAATTATGCTCTGGCTGCACAACTGCGAGATGAAATTT
CCAAACTGGAAACTGAGTCTCTAGCTGCGTCTGCAACAGTTCTGGCTTATGAAAATGCTGAATATAGCTTTAGATTGGGACAGAAAGTGAGGCACAAAATTTTTGGATAC
AGGGGTGTGGTTTGTGGAATGGATCCTGTGTGCTGTGAATCAAGTTCCTGGATGGAGATAGCACAAGTTGAAAAATTGTCTCGAGGTTCTAATCAGCCATTTTATCAGGT
TTTAGTGGATGTCCGTACAGAACCTGATTTGCTGGTCGCATATGTTCCCGAGGAAAATCTTCTTGCTCCTGAAGAACCAGATATGGAGAGGTTTGATCATCCTTATAATT
CCTTCTTATTCTATGGAGTGGATGCAGCCGGGGATTTCATTCCTATAAAGCAGCTACGGGAGAAGTATAATCGGCCTCGACATGAAGTTCCCTCCGATCCTCAAGACGAC
GAGCAACGCGGTGATGAT
mRNA sequenceShow/hide mRNA sequence
GAAATGGTGCAGGATATGTCAATACACGCTCTGGCAGCCTTGGGAGATCCCAAGGTTTATGGATCAACTCTTCCATGGAGGAAGAATTTCAAGCAGACAAGTAAACCCCA
ATTAAATTCTACACATTGCTTTCGGCATCAATACTTGCGAAGCTTTTCCTTAACTAGCCAACCCAATAGATTGAGACGAGGAAATTTCAAGGTTAAAGCTGGATGGTTGT
TTAAAGGGGGTGGACAAGGGTTGGGTGCTCGCATAGAGCGCAGCGAGACTGCTAACAATGATATTTTAATCTTCTTTTTTCAGCTGGACCTGGCAACACGAGTACAGTAC
GCATTGAACATCGAGCAGTATGAAATTGCACAGCAATTGAGAACAAAGCTAACCGAGGTTGAAGAAGAAATCATCAAGCAGCAAGAATCAAAAAGAGGATTAACTTCGAA
GAGTGAAGTACAAGACAAAGGTTTAAATATCATTCGTCTACGTGCAGACTTGCAGAATGCTATTGAAAGTGAAAATTATGCTCTGGCTGCACAACTGCGAGATGAAATTT
CCAAACTGGAAACTGAGTCTCTAGCTGCGTCTGCAACAGTTCTGGCTTATGAAAATGCTGAATATAGCTTTAGATTGGGACAGAAAGTGAGGCACAAAATTTTTGGATAC
AGGGGTGTGGTTTGTGGAATGGATCCTGTGTGCTGTGAATCAAGTTCCTGGATGGAGATAGCACAAGTTGAAAAATTGTCTCGAGGTTCTAATCAGCCATTTTATCAGGT
TTTAGTGGATGTCCGTACAGAACCTGATTTGCTGGTCGCATATGTTCCCGAGGAAAATCTTCTTGCTCCTGAAGAACCAGATATGGAGAGGTTTGATCATCCTTATAATT
CCTTCTTATTCTATGGAGTGGATGCAGCCGGGGATTTCATTCCTATAAAGCAGCTACGGGAGAAGTATAATCGGCCTCGACATGAAGTTCCCTCCGATCCTCAAGACGAC
GAGCAACGCGGTGATGAT
Protein sequenceShow/hide protein sequence
EMVQDMSIHALAALGDPKVYGSTLPWRKNFKQTSKPQLNSTHCFRHQYLRSFSLTSQPNRLRRGNFKVKAGWLFKGGGQGLGARIERSETANNDILIFFFQLDLATRVQY
ALNIEQYEIAQQLRTKLTEVEEEIIKQQESKRGLTSKSEVQDKGLNIIRLRADLQNAIESENYALAAQLRDEISKLETESLAASATVLAYENAEYSFRLGQKVRHKIFGY
RGVVCGMDPVCCESSSWMEIAQVEKLSRGSNQPFYQVLVDVRTEPDLLVAYVPEENLLAPEEPDMERFDHPYNSFLFYGVDAAGDFIPIKQLREKYNRPRHEVPSDPQDD
EQRGDD