; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028856 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028856
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag-protease polyprotein
Genome locationscaffold7:15799799..15803190
RNA-Seq ExpressionSpg028856
SyntenySpg028856
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0008233 - peptidase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]2.2e-0944.44Show/hide
Query:  VQAAVAGVIAGQQDQAPQNNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFCCK
        +Q  V   ++ Q  Q  QN  ++S EA+ LRDF+K+DPR FDG S DP +AE WLS +ET+FR+M C +  +    +F  K
Subjt:  VQAAVAGVIAGQQDQAPQNNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFCCK

XP_031739508.1 uncharacterized protein LOC116403159 [Cucumis sativus]7.5e-1050Show/hide
Query:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC
        QQAP    + VV  V +A  A  A    G   Q PQ     LS EA+ LRDFRK+DP+ FDG+ +DP  AELWLSS+ET+F +M C
Subjt:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQN-NEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC

XP_031741726.1 uncharacterized protein LOC116403920 [Cucumis sativus]1.7e-0950Show/hide
Query:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC
        QQAP    + VV  V +A  A  A    G   Q PQ     LS EA+ LRDFRK+DP+ FDG+ +DP  AELWLSS+ET+F +M C
Subjt:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC

XP_031742890.1 uncharacterized protein LOC116404512 [Cucumis sativus]1.7e-0950Show/hide
Query:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC
        QQAP    + VV  V +A  A  A    G   Q PQ     LS EA+ LRDFRK+DP+ FDG+ +DP  AELWLSS+ET+F +M C
Subjt:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC

XP_031743557.1 uncharacterized protein LOC116404620 isoform X1 [Cucumis sativus]6.4e-0947.67Show/hide
Query:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC
        QQAP    + V+  V +A  A       G   Q PQ     LS EA+ LRDFRK+DP+ FDG+ +DP  AELWLSS+ET+F +M C
Subjt:  QQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSC

TrEMBL top hitse value%identityAlignment
A0A5A7TYM4 Ty3-gypsy retrotransposon protein5.3e-0938.17Show/hide
Query:  RQGRERVRRLINCTYWWCHGGRCVPMAYQQAPCHRQ--SAVVGAVQSAVQAAVAGVIAGQQDQA----------PQNNEA------LSREARCLRDFRKW
        RQGR+R R       W   G  C       AP  +   +A+    Q  +QAA+A  +A QQ+QA          P   EA      LS EA+ LRDFRK+
Subjt:  RQGRERVRRLINCTYWWCHGGRCVPMAYQQAPCHRQ--SAVVGAVQSAVQAAVAGVIAGQQDQA----------PQNNEA------LSREARCLRDFRKW

Query:  DPRPFDGASKDPKVAELWLSSIETVFRHMSC
        +P+ FDG+  +P   ++WL+SIET+FR+M C
Subjt:  DPRPFDGASKDPKVAELWLSSIETVFRHMSC

A0A5A7V646 Reverse transcriptase9.0e-0945.45Show/hide
Query:  QSAVQAAVAGVIAGQQDQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRK
        Q  +QAA+A  +A QQ+QA          P   EA      LS EA+ LRDFRK++P+ FDG+  +P  A++WL+SIET+FR+M C K
Subjt:  QSAVQAAVAGVIAGQQDQA----------PQNNEA------LSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRK

A0A5D3CI62 Gag-protease polyprotein5.3e-0952.54Show/hide
Query:  EALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFC
        + LS EA+ LRDFRK++P  FDG+ +DP  A+LWLSS+ET+FR+M C +  +F  P  C
Subjt:  EALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFC

A0A5D3E4I1 Gag protease polyprotein1.2e-0850.77Show/hide
Query:  APQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFC
        APQ  ++ LS EA+ LRDFRK++P  FDG+  DP  A++WLSS+ET+FR+M C +  +F  P  C
Subjt:  APQ-NNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFC

A0A6J1DSJ6 uncharacterized protein LOC1110235121.1e-0944.44Show/hide
Query:  VQAAVAGVIAGQQDQAPQNNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFCCK
        +Q  V   ++ Q  Q  QN  ++S EA+ LRDF+K+DPR FDG S DP +AE WLS +ET+FR+M C +  +    +F  K
Subjt:  VQAAVAGVIAGQQDQAPQNNEALSREARCLRDFRKWDPRPFDGASKDPKVAELWLSSIETVFRHMSCRKTIRFIVPLFCCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGACGCCGACCACCATCGACCACTACCGTCTGTCAGCGCTGACCACTGCTTCTCGTGTAGCCTCCATTGTCGCCCTTCTCCGTTGCAGATCCACACGTCGCCGTCG
TCGTTCACCGCTCGCCGTCTGTTCTGCCGTCGCCGCCGTGAGCCCCTTTGCTTCTGTCACAGCGCCGCCACCGTCGACAGCCACCCACGAGCCCACGCCTCCACAAGCCT
CCGCGTTGCCGCTGCCTCAGACCGTCGTTGTCGTCGCGAACAGCCCGGCGTCGCCGTCTCCTCGCTTCATTGTGAGTTTTTGGCTAGTGATTCAGAGAATCATTGGAGGT
CACAATATAATTCGTGGCGATAGGGTCACGTATGGCGTTATGGCCAAGAGGCAAGGGCGAGAACGTGTTAGGCGACTGATCAATTGCACTTACTGGTGGTGCCATGGTGG
ACGGTGTGTTCCCATGGCCTACCAGCAAGCACCTTGTCATCGCCAGTCAGCGGTGGTCGGGGCAGTACAGTCTGCAGTACAAGCGGCAGTTGCAGGCGTGATTGCAGGAC
AGCAGGACCAAGCGCCTCAAAATAATGAAGCACTGTCGCGAGAAGCAAGGTGTTTAAGGGACTTTAGGAAGTGGGACCCCCGTCCATTCGATGGAGCATCGAAGGACCCC
AAAGTGGCGGAGCTGTGGCTGTCTTCCATTGAAACCGTCTTTCGTCACATGAGTTGTCGGAAGACCATAAGGTTTATTGTGCCACTTTTCTGTTGCAAGACAATGCCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGCTGACGCCGACCACCATCGACCACTACCGTCTGTCAGCGCTGACCACTGCTTCTCGTGTAGCCTCCATTGTCGCCCTTCTCCGTTGCAGATCCACACGTCGCCGTCG
TCGTTCACCGCTCGCCGTCTGTTCTGCCGTCGCCGCCGTGAGCCCCTTTGCTTCTGTCACAGCGCCGCCACCGTCGACAGCCACCCACGAGCCCACGCCTCCACAAGCCT
CCGCGTTGCCGCTGCCTCAGACCGTCGTTGTCGTCGCGAACAGCCCGGCGTCGCCGTCTCCTCGCTTCATTGTGAGTTTTTGGCTAGTGATTCAGAGAATCATTGGAGGT
CACAATATAATTCGTGGCGATAGGGTCACGTATGGCGTTATGGCCAAGAGGCAAGGGCGAGAACGTGTTAGGCGACTGATCAATTGCACTTACTGGTGGTGCCATGGTGG
ACGGTGTGTTCCCATGGCCTACCAGCAAGCACCTTGTCATCGCCAGTCAGCGGTGGTCGGGGCAGTACAGTCTGCAGTACAAGCGGCAGTTGCAGGCGTGATTGCAGGAC
AGCAGGACCAAGCGCCTCAAAATAATGAAGCACTGTCGCGAGAAGCAAGGTGTTTAAGGGACTTTAGGAAGTGGGACCCCCGTCCATTCGATGGAGCATCGAAGGACCCC
AAAGTGGCGGAGCTGTGGCTGTCTTCCATTGAAACCGTCTTTCGTCACATGAGTTGTCGGAAGACCATAAGGTTTATTGTGCCACTTTTCTGTTGCAAGACAATGCCTTG
A
Protein sequenceShow/hide protein sequence
MLTPTTIDHYRLSALTTASRVASIVALLRCRSTRRRRRSPLAVCSAVAAVSPFASVTAPPPSTATHEPTPPQASALPLPQTVVVVANSPASPSPRFIVSFWLVIQRIIGG
HNIIRGDRVTYGVMAKRQGRERVRRLINCTYWWCHGGRCVPMAYQQAPCHRQSAVVGAVQSAVQAAVAGVIAGQQDQAPQNNEALSREARCLRDFRKWDPRPFDGASKDP
KVAELWLSSIETVFRHMSCRKTIRFIVPLFCCKTMP