; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G4801 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G4801
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionMitochondrial transcription termination factor family protein isoform 2
Genome locationctg1227:3851825..3866660
RNA-Seq ExpressionCucsat.G4801
SyntenyCucsat.G4801
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0042646 - plastid nucleoid (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138649.1 uncharacterized protein LOC101218603 isoform X2 [Cucumis sativus]1.70e-15699.54Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQN YPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

XP_008441218.1 PREDICTED: uncharacterized protein LOC103485418 [Cucumis melo]3.15e-15297.22Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS H AKHTIKGVRRQN YPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

XP_011649898.1 uncharacterized protein LOC101218603 isoform X1 [Cucumis sativus]4.83e-15999.54Show/hide
Query:  MNGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSV
        MNGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQN YPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSV
Subjt:  MNGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSV

Query:  EEQDKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY
        EEQDKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY
Subjt:  EEQDKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY

Query:  YVDCKGDCIAKCTRCWVRF
        YVDCKGDCIAKCTRCWVRF
Subjt:  YVDCKGDCIAKCTRCWVRF

XP_023549996.1 uncharacterized protein LOC111808316 [Cucurbita pepo subsp. pepo]5.69e-13085.19Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPIST+DS TR C ST CT TA S+AVCS+VSFS   AK+ I  VRRQ+ YPRKWEI S+ QVESL+LSDEDKKTWEACRQALS FSFS EEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDR+K+VPNI+ VN+ILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLL+KEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

XP_038885056.1 uncharacterized protein LOC120075592 [Benincasa hispida]9.59e-13890.28Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPISTIDSATRFCCSTR  ATAISDA  S+VSFS H AKH IK VRRQ  YP KWEICS+TQVESL LSDED+KTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAF HIHSPYWGEDR+K+VPNIE VNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

TrEMBL top hitse value%identityAlignment
A0A0A0LQN8 Uncharacterized protein8.21e-15799.54Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQN YPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

A0A1S3B2X0 uncharacterized protein LOC1034854181.53e-15297.22Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS H AKHTIKGVRRQN YPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

A0A5A7T0X2 Mitochondrial transcription termination factor family protein isoform 23.33e-13096.89Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCS+VSFS H AKHTIKGVRRQN YPRKWE+CSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPK
        DKMLGKAFGHIHSPYWGEDR+KKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPK
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPK

A0A6J1FK31 uncharacterized protein LOC1114449359.20e-12984.26Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPIST+DS TR C ST C  TA S+AVCS+VSFS   AK+ I  VRRQ+ YPRKWEI S+ QVESL+LSDEDKKTWEACRQALS+FSFS EEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDR+K+VPNI+ VN+ILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLL+KEWGIQGKSLR+LLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

A0A6J1JTP6 uncharacterized protein LOC1114896509.20e-12984.26Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ
        MLGKSL SPIST+DS TR C ST C  TA S AVCS+VSFS   AK+ I  VRRQ+ YPRKWEI S++QVESL+LSD+DKKTWEACRQALS+FSFS EEQ
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQ

Query:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD
        DKMLGKAFGHIHSPYWGEDR+K+VPNI+ VN+ILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLL+KEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVD

Query:  CKGDCIAKCTRCWVRF
        CKGDCIAKCTRCWVRF
Subjt:  CKGDCIAKCTRCWVRF

SwissProt top hitse value%identityAlignment
Q6AUK6 Transcription termination factor MTERF4, chloroplastic9.4e-0430.43Show/hide
Query:  EIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
        +I+   +E+L  +GL    ++++++K P VLG  LE ++K N++ L  E+G++ ++L  ++ + P +LG
Subjt:  EIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG

Q9ZT96 Transcription termination factor MTERF4, chloroplastic1.1e-0430.88Show/hide
Query:  IVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
        I+  ++EYL  LG+     ++L++K P +LG  L+  +K NVQ+L +++ ++  SL +++ + P+++G
Subjt:  IVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG

Arabidopsis top hitse value%identityAlignment
AT4G02990.1 Mitochondrial transcription termination factor family protein7.9e-0630.88Show/hide
Query:  IVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
        I+  ++EYL  LG+     ++L++K P +LG  L+  +K NVQ+L +++ ++  SL +++ + P+++G
Subjt:  IVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG

AT4G09620.1 Mitochondrial transcription termination factor family protein4.7e-6757.27Show/hide
Query:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAV--CSHVSFSCHSAKHTIKGVRRQNSYPR-KWEI-CSSTQVESLILSDEDKKTWEACRQALSVFSFS
        M+G SLASP++T+ SA       +C   +  D V   + +    +S+ H + G  R +SY R KW + CS+TQVE+   S+ED   WE C++ALS F FS
Subjt:  MLGKSLASPISTIDSATRFCCSTRCTATAISDAV--CSHVSFSCHSAKHTIKGVRRQNSYPR-KWEI-CSSTQVESLILSDEDKKTWEACRQALSVFSFS

Query:  VEEQDKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG
        VEE+DK+LGKAFGHIHSPYW E+R K+ P +E +N ILE+LR+LGLS++DL K++KKFPEVLGC+LE+E+K N+ +L+ +WGI GK LRNLLLRNPKVLG
Subjt:  VEEQDKMLGKAFGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLG

Query:  YYVDCKGDCIAKCTRCWVRF
        Y VDCKGDC+A+CTRCWVRF
Subjt:  YYVDCKGDCIAKCTRCWVRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGATGCTAGGAAAGTCGTTGGCGTCCCCCATATCAACCATTGATTCTGCTACTCGTTTCTGCTGTTCTACTCGCTGCACTGCCACAGCCATATCAGATGCTGT
ATGTTCACATGTGAGTTTCTCCTGTCATTCTGCAAAGCATACGATAAAGGGTGTGAGAAGACAAAATTCCTATCCTAGGAAGTGGGAAATCTGTTCATCTACTCAAGTTG
AAAGCTTAATATTAAGTGATGAAGATAAGAAGACATGGGAGGCTTGTCGGCAAGCTCTGTCTGTGTTCAGCTTCAGTGTTGAGGAGCAAGATAAGATGCTTGGAAAGGCG
TTCGGCCACATTCATTCACCCTACTGGGGTGAAGACAGAGAGAAGAAAGTTCCTAATATTGAAATTGTAAATGATATACTGGAATATCTGAGGACGCTTGGCCTTTCTAA
TGACGATCTCTCTAAGCTGCTAAAGAAATTCCCGGAAGTTCTTGGCTGCAATCTTGAGCAGGAGCTGAAAACCAACGTACAATTGTTGGACAAAGAGTGGGGAATTCAAG
GCAAATCACTAAGGAATCTTCTTCTGCGTAATCCCAAGGTATTGGGTTATTATGTTGATTGTAAAGGAGACTGCATAGCAAAATGCACCAGATGCTGGGTTCGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGATGCTAGGAAAGTCGTTGGCGTCCCCCATATCAACCATTGATTCTGCTACTCGTTTCTGCTGTTCTACTCGCTGCACTGCCACAGCCATATCAGATGCTGT
ATGTTCACATGTGAGTTTCTCCTGTCATTCTGCAAAGCATACGATAAAGGGTGTGAGAAGACAAAATTCCTATCCTAGGAAGTGGGAAATCTGTTCATCTACTCAAGTTG
AAAGCTTAATATTAAGTGATGAAGATAAGAAGACATGGGAGGCTTGTCGGCAAGCTCTGTCTGTGTTCAGCTTCAGTGTTGAGGAGCAAGATAAGATGCTTGGAAAGGCG
TTCGGCCACATTCATTCACCCTACTGGGGTGAAGACAGAGAGAAGAAAGTTCCTAATATTGAAATTGTAAATGATATACTGGAATATCTGAGGACGCTTGGCCTTTCTAA
TGACGATCTCTCTAAGCTGCTAAAGAAATTCCCGGAAGTTCTTGGCTGCAATCTTGAGCAGGAGCTGAAAACCAACGTACAATTGTTGGACAAAGAGTGGGGAATTCAAG
GCAAATCACTAAGGAATCTTCTTCTGCGTAATCCCAAGGTATTGGGTTATTATGTTGATTGTAAAGGAGACTGCATAGCAAAATGCACCAGATGCTGGGTTCGATTCTAG
Protein sequenceShow/hide protein sequence
MNGMLGKSLASPISTIDSATRFCCSTRCTATAISDAVCSHVSFSCHSAKHTIKGVRRQNSYPRKWEICSSTQVESLILSDEDKKTWEACRQALSVFSFSVEEQDKMLGKA
FGHIHSPYWGEDREKKVPNIEIVNDILEYLRTLGLSNDDLSKLLKKFPEVLGCNLEQELKTNVQLLDKEWGIQGKSLRNLLLRNPKVLGYYVDCKGDCIAKCTRCWVRF