; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G003100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G003100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationCmo_Chr13:3338036..3342265
RNA-Seq ExpressionCmoCh13G003100
SyntenyCmoCh13G003100
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR038662 - F1F0 ATP synthase subunit C superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]2.0e-16173.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

TYK11702.1 F15O4.13 [Cucumis melo var. makuwa]2.0e-16173.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]2.0e-16173.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

TYK26105.1 F15O4.13 [Cucumis melo var. makuwa]2.0e-16173.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]1.7e-17182.44Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIVVQERQVDGRWRGRNNHATIMQPRRMERVHEDRDGGVKLKIPPFCGT
        MSHNGEAVPPVPDLNTTILQAIQGMM+MMMEDRQERRAQQQRQERT                             PRRMERVHEDRDGGVKLKIPPFC T
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIVVQERQVDGRWRGRNNHATIMQPRRMERVHEDRDGGVKLKIPPFCGT

Query:  ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTKSVAEYYQ
        ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHA  WYQHLKSERRRKEE+P+ETWEELKE MRKRY PKHYERNLKTKLQGLRQGTKSVAEYYQ
Subjt:  ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTKSVAEYYQ

Query:  EMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSPKGKFVTA
        EMETMM+RA VREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDM HYALKIEDQLKEEKEHSKRYTSRT+ FSNS TWNKDSF+N+N+SMSPK KFV A
Subjt:  EMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSPKGKFVTA

Query:  KRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEIAAYDDEYVEEGNSISLITRKEL
        KRVEAES N K NEASKTV+EKSSSIQCWKCKGFGHMSKECVN+KVMVIRNGILDSDDE             +YDDEYVEEGNSISLITR+ L
Subjt:  KRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEIAAYDDEYVEEGNSISLITRKEL

TrEMBL top hitse value%identityAlignment
A0A5D3BWE8 F15O4.139.9e-16273.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

A0A5D3C3D3 F15O4.139.9e-16273.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

A0A5D3CK70 F15O4.139.9e-16273.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

A0A5D3DRJ1 F15O4.139.9e-16273.57Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK
        MSH+GE VP V D N  ILQAIQGM+++M E+RQERRAQQQR+ R  QEDEG+      ERQ+ G    R RGRNN A +MQPRRMERVHEDRDGGVKLK
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIV---VQERQVDG----RWRGRNNHATIMQPRRMERVHEDRDGGVKLK

Query:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK
        IPPF GTADSE YLQWERKIEHVFDCNT+SENKKM+LAIAEFTN+AS+WY +LKSERRRKEE+PIETWEELKE MRKR+ PKHYER+LKTKLQ LRQGTK
Subjt:  IPPFCGTADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTK

Query:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP
        SVAEYY+EMET++ RA+++E+EEDTMSRFLGGLN+EIAHLVDRNPP  +EDMYHYA+KIE QLKEEKE SKRY S+T  FS+SNTWNKD F+N+N+SM  
Subjt:  SVAEYYQEMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSP

Query:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE
        +GKFV A++VE ES N K  EASK V+EK+SSIQCWKCKGFGHMSK+C+N++VMV+RNG++DS+DECE++D+QL EE     DDEY+EEG+SISLITR+ 
Subjt:  KGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEI-AAYDDEYVEEGNSISLITRKE

Query:  L
        L
Subjt:  L

A0A6J1EVI6 LOW QUALITY PROTEIN: uncharacterized protein LOC1114384598.1e-17282.44Show/hide
Query:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIVVQERQVDGRWRGRNNHATIMQPRRMERVHEDRDGGVKLKIPPFCGT
        MSHNGEAVPPVPDLNTTILQAIQGMM+MMMEDRQERRAQQQRQERT                             PRRMERVHEDRDGGVKLKIPPFC T
Subjt:  MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIVVQERQVDGRWRGRNNHATIMQPRRMERVHEDRDGGVKLKIPPFCGT

Query:  ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTKSVAEYYQ
        ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHA  WYQHLKSERRRKEE+P+ETWEELKE MRKRY PKHYERNLKTKLQGLRQGTKSVAEYYQ
Subjt:  ADSEAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTKSVAEYYQ

Query:  EMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSPKGKFVTA
        EMETMM+RA VREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDM HYALKIEDQLKEEKEHSKRYTSRT+ FSNS TWNKDSF+N+N+SMSPK KFV A
Subjt:  EMETMMKRAKVREEEEDTMSRFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSPKGKFVTA

Query:  KRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEIAAYDDEYVEEGNSISLITRKEL
        KRVEAES N K NEASKTV+EKSSSIQCWKCKGFGHMSKECVN+KVMVIRNGILDSDDE             +YDDEYVEEGNSISLITR+ L
Subjt:  KRVEAESFNGKMNEASKTVKEKSSSIQCWKCKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEIAAYDDEYVEEGNSISLITRKEL

SwissProt top hitse value%identityAlignment
A4QJA2 ATP synthase subunit c, chloroplastic3.9e-0693.94Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPLIS+ASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG

A8SE66 ATP synthase subunit c, chloroplastic1.8e-0696.97Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPLISSASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG

Q3V547 ATP synthase subunit c, chloroplastic3.9e-0693.94Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPLIS+ASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG

Q4FGF2 ATP synthase subunit c, chloroplastic3.9e-0693.94Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPLIS+ASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG

Q85FN2 ATP synthase subunit c, chloroplastic3.9e-0693.94Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPLIS+ASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG

Arabidopsis top hitse value%identityAlignment
AT4G07350.1 unknown protein9.9e-0528.05Show/hide
Query:  EAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKL
        E YL+WE+ ++  F    +    +   A++  T  A  W+     +R   +E PI  W +LKE++R +YA +   R+ K  +
Subjt:  EAYLQWERKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKL

ATCG00140.1 ATP synthase subunit C family protein3.6e-0790.91Show/hide
Query:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG
        MNPL+S+ASVIAAGLA GLASIGPGVGQGTAAG
Subjt:  MNPLISSASVIAAGLAGGLASIGPGVGQGTAAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACATAATGGAGAAGCAGTACCACCAGTTCCAGATCTTAATACAACCATTCTTCAAGCAATTCAAGGTATGATGAAAATGATGATGGAAGATAGACAAGAAAGAAG
GGCGCAACAACAAAGACAAGAACGAACATTACAAGAAGATGAAGGTATAGTAGTACAAGAAAGACAAGTTGATGGTAGATGGAGAGGAAGAAATAACCATGCAACTATTA
TGCAACCTAGAAGGATGGAAAGAGTACATGAAGATAGAGATGGGGGAGTTAAACTCAAAATCCCGCCCTTTTGTGGAACGGCAGATTCTGAGGCATACTTGCAGTGGGAA
AGAAAGATAGAGCATGTGTTTGATTGCAACACCTATAGTGAAAATAAGAAGATGAGACTTGCTATTGCCGAATTTACCAATCATGCTAGTGATTGGTACCAACATCTCAA
ATCCGAGAGAAGAAGAAAAGAGGAGAATCCAATAGAGACATGGGAAGAACTTAAAGAAGTCATGAGAAAAAGGTATGCTCCAAAACATTATGAAAGAAATTTGAAAACTA
AATTGCAAGGTTTGAGGCAAGGAACAAAAAGTGTGGCGGAATATTATCAAGAGATGGAAACTATGATGAAAAGAGCAAAGGTTAGAGAAGAAGAAGAAGATACCATGTCT
AGATTCCTTGGAGGTTTGAATCGAGAAATTGCTCATCTTGTTGACAGAAATCCACCACCATATCTGGAAGATATGTATCATTATGCTCTCAAAATTGAAGACCAATTGAA
GGAAGAAAAAGAGCATTCAAAAAGGTACACATCACGAACTAGCATCTTTTCAAATTCTAATACTTGGAACAAGGATAGTTTTATGAATAAAAATGATTCGATGTCACCAA
AAGGAAAGTTTGTGACTGCTAAAAGAGTGGAGGCTGAGAGTTTCAATGGTAAAATGAATGAAGCTTCAAAGACGGTAAAAGAGAAGTCTAGTTCTATTCAATGTTGGAAG
TGCAAAGGATTTGGACACATGAGCAAAGAGTGTGTTAATGAAAAAGTTATGGTGATAAGGAATGGTATACTTGATTCAGATGATGAATGTGAGGATCATGATTCACAGCT
TGTGGAAGAAATCGCCGCATATGATGATGAGTATGTTGAAGAAGGTAATTCCATATCTTTGATCACAAGAAAGGAACTTATTATGAATCCACTGATTTCTTCCGCTTCCG
TTATTGCTGCTGGATTGGCCGGCGGGCTTGCTTCTATTGGACCTGGGGTTGGTCAAGGTACTGCTGCAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACATAATGGAGAAGCAGTACCACCAGTTCCAGATCTTAATACAACCATTCTTCAAGCAATTCAAGGTATGATGAAAATGATGATGGAAGATAGACAAGAAAGAAG
GGCGCAACAACAAAGACAAGAACGAACATTACAAGAAGATGAAGGTATAGTAGTACAAGAAAGACAAGTTGATGGTAGATGGAGAGGAAGAAATAACCATGCAACTATTA
TGCAACCTAGAAGGATGGAAAGAGTACATGAAGATAGAGATGGGGGAGTTAAACTCAAAATCCCGCCCTTTTGTGGAACGGCAGATTCTGAGGCATACTTGCAGTGGGAA
AGAAAGATAGAGCATGTGTTTGATTGCAACACCTATAGTGAAAATAAGAAGATGAGACTTGCTATTGCCGAATTTACCAATCATGCTAGTGATTGGTACCAACATCTCAA
ATCCGAGAGAAGAAGAAAAGAGGAGAATCCAATAGAGACATGGGAAGAACTTAAAGAAGTCATGAGAAAAAGGTATGCTCCAAAACATTATGAAAGAAATTTGAAAACTA
AATTGCAAGGTTTGAGGCAAGGAACAAAAAGTGTGGCGGAATATTATCAAGAGATGGAAACTATGATGAAAAGAGCAAAGGTTAGAGAAGAAGAAGAAGATACCATGTCT
AGATTCCTTGGAGGTTTGAATCGAGAAATTGCTCATCTTGTTGACAGAAATCCACCACCATATCTGGAAGATATGTATCATTATGCTCTCAAAATTGAAGACCAATTGAA
GGAAGAAAAAGAGCATTCAAAAAGGTACACATCACGAACTAGCATCTTTTCAAATTCTAATACTTGGAACAAGGATAGTTTTATGAATAAAAATGATTCGATGTCACCAA
AAGGAAAGTTTGTGACTGCTAAAAGAGTGGAGGCTGAGAGTTTCAATGGTAAAATGAATGAAGCTTCAAAGACGGTAAAAGAGAAGTCTAGTTCTATTCAATGTTGGAAG
TGCAAAGGATTTGGACACATGAGCAAAGAGTGTGTTAATGAAAAAGTTATGGTGATAAGGAATGGTATACTTGATTCAGATGATGAATGTGAGGATCATGATTCACAGCT
TGTGGAAGAAATCGCCGCATATGATGATGAGTATGTTGAAGAAGGTAATTCCATATCTTTGATCACAAGAAAGGAACTTATTATGAATCCACTGATTTCTTCCGCTTCCG
TTATTGCTGCTGGATTGGCCGGCGGGCTTGCTTCTATTGGACCTGGGGTTGGTCAAGGTACTGCTGCAGGCTAA
Protein sequenceShow/hide protein sequence
MSHNGEAVPPVPDLNTTILQAIQGMMKMMMEDRQERRAQQQRQERTLQEDEGIVVQERQVDGRWRGRNNHATIMQPRRMERVHEDRDGGVKLKIPPFCGTADSEAYLQWE
RKIEHVFDCNTYSENKKMRLAIAEFTNHASDWYQHLKSERRRKEENPIETWEELKEVMRKRYAPKHYERNLKTKLQGLRQGTKSVAEYYQEMETMMKRAKVREEEEDTMS
RFLGGLNREIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRYTSRTSIFSNSNTWNKDSFMNKNDSMSPKGKFVTAKRVEAESFNGKMNEASKTVKEKSSSIQCWK
CKGFGHMSKECVNEKVMVIRNGILDSDDECEDHDSQLVEEIAAYDDEYVEEGNSISLITRKELIMNPLISSASVIAAGLAGGLASIGPGVGQGTAAG