; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037227 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037227
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEnzymatic polyprotein
Genome locationscaffold7:37004084..37010268
RNA-Seq ExpressionSpg037227
SyntenySpg037227
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025901.1 hypothetical protein E6C27_scaffold34G001890 [Cucumis melo var. makuwa]7.9e-1452.69Show/hide
Query:  QNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAALAQTTSDADFQRILQM---AASSASPASSAQGEDEDPI----TIWTPFLDSRPM
        Q+AT+  +KEWF  NGYLQDID  +  +FLN+KSKLLA LAQ T+DADFQR+L     A++S+   SS   EDE+ +     +  PFLDS+PM
Subjt:  QNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAALAQTTSDADFQRILQM---AASSASPASSAQGEDEDPI----TIWTPFLDSRPM

KAA0034823.1 hypothetical protein E6C27_scaffold213G00570 [Cucumis melo var. makuwa]3.6e-1458.7Show/hide
Query:  QNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAALAQTTSDADFQRILQMA----ASSASPASSAQGEDEDPITIWTPF--LDSRPM
        Q+AT+  +KEWFA N YLQDID ++  EFLNDKSKLL ALAQ T+DADFQR+L  A    +SS SP+SS Q EDE+ I        LDS+PM
Subjt:  QNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAALAQTTSDADFQRILQMA----ASSASPASSAQGEDEDPITIWTPF--LDSRPM

KAA0059031.1 polyprotein [Cucumis melo var. makuwa]3.8e-1648.85Show/hide
Query:  YASNGVFYSTSTPSANLVRPSGGVVQLRTSCLSLFSPGSSIPSSSTYSAAVTPEKRF--VPRPEIKGYFQ----------KSQNATLNKVKEWFAANGYL
        + S+GV  +   PSANL+R SG  VQ+R    SL S   S   S+TYS  VTP+K +     P+ K   +            Q+AT+  +KEWFA NGYL
Subjt:  YASNGVFYSTSTPSANLVRPSGGVVQLRTSCLSLFSPGSSIPSSSTYSAAVTPEKRF--VPRPEIKGYFQ----------KSQNATLNKVKEWFAANGYL

Query:  QDIDIKRTAEFLNDKSKLLAALAQTTSDADF
        QDID ++  EFLNDKSKLLAALAQ T+DADF
Subjt:  QDIDIKRTAEFLNDKSKLLAALAQTTSDADF

TYJ98361.1 hypothetical protein E5676_scaffold232G00950 [Cucumis melo var. makuwa]6.1e-2249.67Show/hide
Query:  PSANLVRPSGGVVQLRTSCLSLFSPGSSIPSS---STYSAAVTPEKRFVPRPEIKGYFQKSQNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAA
        PS NL+ P G V+Q+R+S     SP S+  SS   +TYS AVTP+K+FVPR EIK YFQK        ++  +  NG LQDID ++ A+FLNDK K LAA
Subjt:  PSANLVRPSGGVVQLRTSCLSLFSPGSSIPSS---STYSAAVTPEKRFVPRPEIKGYFQKSQNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAA

Query:  LAQTTSDADFQRILQMA---ASSASPASSAQGEDEDPITIW----TPFLDSRP
        L Q T DADFQR+L  A   +SS+ PA S   EDE+ + +      PFLDS+P
Subjt:  LAQTTSDADFQRILQMA---ASSASPASSAQGEDEDPITIW----TPFLDSRP

XP_022933039.1 uncharacterized protein LOC111439730 [Cucurbita moschata]1.4e-1333.95Show/hide
Query:  VQRKMQRDDKSRLRDMKAFCSQYGCERLEEPNRS-ANKMIRRRNSRRQYNPNPRFKKRQSRDQ----PRQRQAAFKTKG---KTFCFKCRKEGHYANRCP
        +Q K+     +R +++  FC QYGC+ +E P+ S  NK+         Y P   ++ +  + Q     R++    KT G   K  CFKCR+EGHYAN+CP
Subjt:  VQRKMQRDDKSRLRDMKAFCSQYGCERLEEPNRS-ANKMIRRRNSRRQYNPNPRFKKRQSRDQ----PRQRQAAFKTKG---KTFCFKCRKEGHYANRCP

Query:  -NLRINELDIEEE---------YSSIYDSSDQELNEFQFNDTDSEVSSDSQSKESEGVKACKKDCEGQCINVLTLEQEVLFNSIDHIEDKEMKKTILKTL
           +INELDI++E          +    SS+ E+ E Q  ++DS  S++ +S E EG    K+ CEG CINVLT +QE+L   ++ ++D E+++ I + L
Subjt:  -NLRINELDIEEE---------YSSIYDSSDQELNEFQFNDTDSEVSSDSQSKESEGVKACKKDCEGQCINVLTLEQEVLFNSIDHIEDKEMKKTILKTL

Query:  KQSLSSQRTSELREK
        + +++  +  E  E+
Subjt:  KQSLSSQRTSELREK

TrEMBL top hitse value%identityAlignment
A0A5A7U8D3 Enzymatic polyprotein7.3e-1335.1Show/hide
Query:  DKSRLRDMKAFCSQYGCER--LEEPNRS---ANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGKTFCFKCRKEGHYANRCP-NLRINELDIE
        D    +++  FC QYG  +   EE N+    ++K + RRN  +   P  R K   ++ + ++R   + +K  T CFKC ++GHYANRCP   RI  L I+
Subjt:  DKSRLRDMKAFCSQYGCER--LEEPNRS---ANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGKTFCFKCRKEGHYANRCP-NLRINELDIE

Query:  EE--------------YSSIYDSSDQE--LNEFQFNDTDSEVS--SDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTILKT
        EE               SS  +SS +E  +N  +  ++ SE    S S+S + EG   C   C G+C   INV+T +QE LF+ I+ I D++ K+T L  
Subjt:  EE--------------YSSIYDSSDQE--LNEFQFNDTDSEVS--SDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTILKT

Query:  LKQSLSSQ
        L+QSL  Q
Subjt:  LKQSLSSQ

A0A5A7VRE0 Reverse transcriptase4.3e-1333.2Show/hide
Query:  DKSRLRDMKAFCSQYGCERLEEPNRSANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGK--------TFCFKCRKEGHYANRCP-NLRINEL
        D    +++  FC QYG    + P     K  +R +SR+ +      K +     PR+R+  +K KGK        T CFKC ++GHYANRCP   +IN L
Subjt:  DKSRLRDMKAFCSQYGCERLEEPNRSANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGK--------TFCFKCRKEGHYANRCP-NLRINEL

Query:  DIEE--------------EYSSIYDSSDQE--LNEFQFNDT--DSEVSSDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTI
         ++E              E SS  +SS +E  +N  Q  ++  + E  S S S + EG   C   C G+C   INV+T +QE LF+ I+ I D+  K+T 
Subjt:  DIEE--------------EYSSIYDSSDQE--LNEFQFNDT--DSEVSSDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTI

Query:  LKTLKQSLSSQRTSEL--------------REKQEAEPPLQ
        L  LKQSL  Q   +               R K EA+ P+Q
Subjt:  LKTLKQSLSSQRTSEL--------------REKQEAEPPLQ

A0A5D3BI61 Uncharacterized protein2.9e-2249.67Show/hide
Query:  PSANLVRPSGGVVQLRTSCLSLFSPGSSIPSS---STYSAAVTPEKRFVPRPEIKGYFQKSQNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAA
        PS NL+ P G V+Q+R+S     SP S+  SS   +TYS AVTP+K+FVPR EIK YFQK        ++  +  NG LQDID ++ A+FLNDK K LAA
Subjt:  PSANLVRPSGGVVQLRTSCLSLFSPGSSIPSS---STYSAAVTPEKRFVPRPEIKGYFQKSQNATLNKVKEWFAANGYLQDIDIKRTAEFLNDKSKLLAA

Query:  LAQTTSDADFQRILQMA---ASSASPASSAQGEDEDPITIW----TPFLDSRP
        L Q T DADFQR+L  A   +SS+ PA S   EDE+ + +      PFLDS+P
Subjt:  LAQTTSDADFQRILQMA---ASSASPASSAQGEDEDPITIW----TPFLDSRP

A0A5D3DBS1 Reverse transcriptase4.3e-1333.2Show/hide
Query:  DKSRLRDMKAFCSQYGCERLEEPNRSANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGK--------TFCFKCRKEGHYANRCP-NLRINEL
        D    +++  FC QYG    + P     K  +R +SR+ +      K +     PR+R+  +K KGK        T CFKC ++GHYANRCP   +IN L
Subjt:  DKSRLRDMKAFCSQYGCERLEEPNRSANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGK--------TFCFKCRKEGHYANRCP-NLRINEL

Query:  DIEE--------------EYSSIYDSSDQE--LNEFQFNDT--DSEVSSDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTI
         ++E              E SS  +SS +E  +N  Q  ++  + E  S S S + EG   C   C G+C   INV+T +QE LF+ I+ I D+  K+T 
Subjt:  DIEE--------------EYSSIYDSSDQE--LNEFQFNDT--DSEVSSDSQSKESEGVKACKKDCEGQC---INVLTLEQEVLFNSIDHIEDKEMKKTI

Query:  LKTLKQSLSSQRTSEL--------------REKQEAEPPLQ
        L  LKQSL  Q   +               R K EA+ P+Q
Subjt:  LKTLKQSLSSQRTSEL--------------REKQEAEPPLQ

A0A6J1EYM2 uncharacterized protein LOC1114397306.6e-1433.95Show/hide
Query:  VQRKMQRDDKSRLRDMKAFCSQYGCERLEEPNRS-ANKMIRRRNSRRQYNPNPRFKKRQSRDQ----PRQRQAAFKTKG---KTFCFKCRKEGHYANRCP
        +Q K+     +R +++  FC QYGC+ +E P+ S  NK+         Y P   ++ +  + Q     R++    KT G   K  CFKCR+EGHYAN+CP
Subjt:  VQRKMQRDDKSRLRDMKAFCSQYGCERLEEPNRS-ANKMIRRRNSRRQYNPNPRFKKRQSRDQ----PRQRQAAFKTKG---KTFCFKCRKEGHYANRCP

Query:  -NLRINELDIEEE---------YSSIYDSSDQELNEFQFNDTDSEVSSDSQSKESEGVKACKKDCEGQCINVLTLEQEVLFNSIDHIEDKEMKKTILKTL
           +INELDI++E          +    SS+ E+ E Q  ++DS  S++ +S E EG    K+ CEG CINVLT +QE+L   ++ ++D E+++ I + L
Subjt:  -NLRINELDIEEE---------YSSIYDSSDQELNEFQFNDTDSEVSSDSQSKESEGVKACKKDCEGQCINVLTLEQEVLFNSIDHIEDKEMKKTILKTL

Query:  KQSLSSQRTSELREK
        + +++  +  E  E+
Subjt:  KQSLSSQRTSELREK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCTGTTCAGCGCAAAATGCAAAGGGATGATAAGAGCAGGCTCAGAGATATGAAAGCCTTTTGCTCTCAATATGGTTGTGAGCGTTTGGAGGAACCAAACCGCTC
TGCTAACAAAATGATCCGACGGCGTAATAGCCGAAGGCAATACAATCCAAATCCTCGATTCAAGAAAAGACAATCTCGGGATCAACCCAGACAAAGGCAAGCTGCCTTTA
AGACAAAGGGAAAAACCTTTTGTTTTAAATGTCGAAAGGAAGGACACTATGCAAACAGATGTCCCAACCTTAGAATCAATGAGCTGGATATAGAAGAAGAGTATTCCTCT
ATCTATGATAGCTCTGATCAAGAATTGAATGAATTTCAATTCAACGACACAGATTCAGAGGTCTCTAGCGATAGCCAGAGCAAAGAATCTGAAGGCGTTAAAGCTTGCAA
AAAGGACTGTGAAGGACAATGCATAAATGTCCTTACACTGGAACAAGAAGTTCTCTTTAACTCTATAGACCATATAGAGGATAAAGAAATGAAGAAGACAATTCTGAAAA
CCCTCAAGCAGTCTTTGTCTTCTCAAAGAACTTCTGAGTTAAGAGAAAAGcaagaggcagagcctcctcttcaaggggccttcgcccatcttcttctcctctgcacctcc
tcccatgagtgcagatcaatatgcaatggaattagggtttactccggtaacccgttccagatcaaggcaaggcggtatacgccctatgcctccaatggagtcttctactc
cacctccacgccttcggccaacctcgtccgtccttcaggcggagtcgttcaactgagaacctcctgtctctccctcttcagtccgggatcatcaatcccatcctcttcaa
cctactctgcggcggtaacaccggaaaagaggtttgtacctcgcccagagatcaaaggttattttcaaaaatcccagaacgcaaccctcaacaaagttaaagagtggttt
gcagccaatggatatcttcaggatatcgacatcaagaggactgcagaatttctcaatgacaaatccaagctcctagcagctctagcacagaccacatctgatgcagattt
tcaaagaattctccagatggcggcttcaagcgcgtccccagcttcttctgctcaaggagaagacgaagatccgattacgatctggaccccgttcctcgactcacgaccca
tgtga
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCTGTTCAGCGCAAAATGCAAAGGGATGATAAGAGCAGGCTCAGAGATATGAAAGCCTTTTGCTCTCAATATGGTTGTGAGCGTTTGGAGGAACCAAACCGCTC
TGCTAACAAAATGATCCGACGGCGTAATAGCCGAAGGCAATACAATCCAAATCCTCGATTCAAGAAAAGACAATCTCGGGATCAACCCAGACAAAGGCAAGCTGCCTTTA
AGACAAAGGGAAAAACCTTTTGTTTTAAATGTCGAAAGGAAGGACACTATGCAAACAGATGTCCCAACCTTAGAATCAATGAGCTGGATATAGAAGAAGAGTATTCCTCT
ATCTATGATAGCTCTGATCAAGAATTGAATGAATTTCAATTCAACGACACAGATTCAGAGGTCTCTAGCGATAGCCAGAGCAAAGAATCTGAAGGCGTTAAAGCTTGCAA
AAAGGACTGTGAAGGACAATGCATAAATGTCCTTACACTGGAACAAGAAGTTCTCTTTAACTCTATAGACCATATAGAGGATAAAGAAATGAAGAAGACAATTCTGAAAA
CCCTCAAGCAGTCTTTGTCTTCTCAAAGAACTTCTGAGTTAAGAGAAAAGcaagaggcagagcctcctcttcaaggggccttcgcccatcttcttctcctctgcacctcc
tcccatgagtgcagatcaatatgcaatggaattagggtttactccggtaacccgttccagatcaaggcaaggcggtatacgccctatgcctccaatggagtcttctactc
cacctccacgccttcggccaacctcgtccgtccttcaggcggagtcgttcaactgagaacctcctgtctctccctcttcagtccgggatcatcaatcccatcctcttcaa
cctactctgcggcggtaacaccggaaaagaggtttgtacctcgcccagagatcaaaggttattttcaaaaatcccagaacgcaaccctcaacaaagttaaagagtggttt
gcagccaatggatatcttcaggatatcgacatcaagaggactgcagaatttctcaatgacaaatccaagctcctagcagctctagcacagaccacatctgatgcagattt
tcaaagaattctccagatggcggcttcaagcgcgtccccagcttcttctgctcaaggagaagacgaagatccgattacgatctggaccccgttcctcgactcacgaccca
tgtga
Protein sequenceShow/hide protein sequence
MWSVQRKMQRDDKSRLRDMKAFCSQYGCERLEEPNRSANKMIRRRNSRRQYNPNPRFKKRQSRDQPRQRQAAFKTKGKTFCFKCRKEGHYANRCPNLRINELDIEEEYSS
IYDSSDQELNEFQFNDTDSEVSSDSQSKESEGVKACKKDCEGQCINVLTLEQEVLFNSIDHIEDKEMKKTILKTLKQSLSSQRTSELREKQEAEPPLQGAFAHLLLLCTS
SHECRSICNGIRVYSGNPFQIKARRYTPYASNGVFYSTSTPSANLVRPSGGVVQLRTSCLSLFSPGSSIPSSSTYSAAVTPEKRFVPRPEIKGYFQKSQNATLNKVKEWF
AANGYLQDIDIKRTAEFLNDKSKLLAALAQTTSDADFQRILQMAASSASPASSAQGEDEDPITIWTPFLDSRPM