; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021847 (gene) of Chayote v1 genome

Gene IDSed0021847
OrganismSechium edule (Chayote v1)
DescriptionDNA-directed RNA polymerases
Genome locationLG13:20971484..20974467
RNA-Seq ExpressionSed0021847
SyntenySed0021847
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]1.0e-4154.55Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT          DR + L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV

Query:  NTILLNNNH
        N +LLNN+H
Subjt:  NTILLNNNH

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]1.0e-4154.81Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT         DR + L D+VATDVN
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN

Query:  TILLNNNH
         +LLNN+H
Subjt:  TILLNNNH

XP_022982461.1 uncharacterized protein LOC111481277 isoform X1 [Cucurbita maxima]1.8e-4154.07Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+R+LGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT+         DR + L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV

Query:  NTILLNNNH
        N +LLNN+H
Subjt:  NTILLNNNH

XP_022982463.1 uncharacterized protein LOC111481277 isoform X2 [Cucurbita maxima]2.3e-4154.33Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+R+LGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT         DR + L D+VATDVN
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN

Query:  TILLNNNH
         +LLNN+H
Subjt:  TILLNNNH

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]1.6e-4255.02Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTD-------ILDRITSLHDNVATDV
        ITDEDIRDEF HR              HHVIK+GT+VRF                          LEK SVEGSVT+         DR + L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTD-------ILDRITSLHDNVATDV

Query:  NTILLNNNH
        N +LLNN+H
Subjt:  NTILLNNNH

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167573.3e-4152.15Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAY+AK  DKS+K+ SGVHPYFGVT+KAKL LFSPK +ML        R      IVLGFASA 
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDILDRITS-------LHDNVATDV
        ITDEDIRDEF HR              HHVIK+GT++RF                          LEK S+EGSVTD   + T        L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDILDRITS-------LHDNVATDV

Query:  NTILLNNNH
        N ++LNN+H
Subjt:  NTILLNNNH

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X15.0e-4254.55Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT          DR + L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV

Query:  NTILLNNNH
        N +LLNN+H
Subjt:  NTILLNNNH

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X25.0e-4254.81Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+RELGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT         DR + L D+VATDVN
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN

Query:  TILLNNNH
         +LLNN+H
Subjt:  TILLNNNH

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X18.6e-4254.07Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+R+LGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT+         DR + L D+VATDV
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL-------DRITSLHDNVATDV

Query:  NTILLNNNH
        N +LLNN+H
Subjt:  NTILLNNNH

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X21.1e-4154.33Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV
        L+ ++H  +SKKVSQAV+R+LGAMLLKFDE+F+GVLLAYEA  IDKS+K+ SGVHPYFGVTIKAKL LFSPK +ML        R      IVLGFASAV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADML-------FRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN
        ITDEDIR+EF HR              HHVIK+GT+VRF                          LEK SVEGSVT         DR + L D+VATDVN
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVRF--------------------------LEKTSVEGSVTDIL------DRITSLHDNVATDVN

Query:  TILLNNNH
         +LLNN+H
Subjt:  TILLNNNH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases4.8e-2140.58Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKAD-------MLFRLNQYMFIVLGFASAV
        L+ FIH  +S+ V Q + REL ++L +++E F GVLLAY+A    K +K+ +G+HPYFGV +  +L LF PK         +         IVLGF++AV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKAD-------MLFRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVR
        ITD DIR+EF +R               H +KLGT++R
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVR

AT1G75670.2 DNA-directed RNA polymerases4.8e-2140.58Show/hide
Query:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKAD-------MLFRLNQYMFIVLGFASAV
        L+ FIH  +S+ V Q + REL ++L +++E F GVLLAY+A    K +K+ +G+HPYFGV +  +L LF PK         +         IVLGF++AV
Subjt:  LLTFIH--RSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKAD-------MLFRLNQYMFIVLGFASAV

Query:  ITDEDIRDEFAHRA-------------HHVIKLGTVVR
        ITD DIR+EF +R               H +KLGT++R
Subjt:  ITDEDIRDEFAHRA-------------HHVIKLGTVVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATATGGTTATTAACGTTCATCCATCGAAGTAAGAAAGTTTCGCAGGCGGTGATTCGAGAGCTCGGCGCTATGCTTCTGAAATTTGATGAAAAATTTCAAGGTGT
GCTTCTGGCTTATGAGGCAAAAAGTATTGATAAGAGTTCAAAGTTAACATCCGGAGTGCATCCCTATTTTGGCGTGACAATAAAGGCAAAGCTATTTCTTTTTTCTCCAA
AAGCAGACATGCTTTTTAGGTTGAATCAATACATGTTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGATGAATTCGCGCATAGAGCGCAC
CATGTGATAAAACTTGGGACAGTTGTACGATTTTTGGAAAAGACTTCAGTTGAAGGTTCAGTGACAGATATTTTGGATCGTATCACATCGTTGCACGATAACGTTGCCAC
TGATGTAAATACTATTCTCTTGAACAATAACCACCATCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATATGGTTATTAACGTTCATCCATCGAAGTAAGAAAGTTTCGCAGGCGGTGATTCGAGAGCTCGGCGCTATGCTTCTGAAATTTGATGAAAAATTTCAAGGTGT
GCTTCTGGCTTATGAGGCAAAAAGTATTGATAAGAGTTCAAAGTTAACATCCGGAGTGCATCCCTATTTTGGCGTGACAATAAAGGCAAAGCTATTTCTTTTTTCTCCAA
AAGCAGACATGCTTTTTAGGTTGAATCAATACATGTTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGATGAATTCGCGCATAGAGCGCAC
CATGTGATAAAACTTGGGACAGTTGTACGATTTTTGGAAAAGACTTCAGTTGAAGGTTCAGTGACAGATATTTTGGATCGTATCACATCGTTGCACGATAACGTTGCCAC
TGATGTAAATACTATTCTCTTGAACAATAACCACCATCAGTGA
Protein sequenceShow/hide protein sequence
MPIWLLTFIHRSKKVSQAVIRELGAMLLKFDEKFQGVLLAYEAKSIDKSSKLTSGVHPYFGVTIKAKLFLFSPKADMLFRLNQYMFIVLGFASAVITDEDIRDEFAHRAH
HVIKLGTVVRFLEKTSVEGSVTDILDRITSLHDNVATDVNTILLNNNHHQ