; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019197 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019197
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationscaffold1:47535321..47547371
RNA-Seq ExpressionSpg019197
SyntenySpg019197
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]3.8e-1553.33Show/hide
Query:  NQESNLEALMKEYMARND--------------VAVRNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPV
        N  ++LE + KEYMARND                +RNLEVQ+GQ A ++K RPQG+ P  TE+  R+G EQCKAVTLRSGL Y+GPK PV
Subjt:  NQESNLEALMKEYMARND--------------VAVRNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPV

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]3.4e-1643.54Show/hide
Query:  EQPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEV
        +Q  +  Q   ++N  SNLE +MKEYMAR D  +       RN   Q+G +A E+KNRPQG+ P  TE+P REGKEQCKAVTLRSGL YDGP  P     
Subjt:  EQPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEV

Query:  RKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESSSS
         +IP           +K T K+  N      T E E +++ N+ +SS
Subjt:  RKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESSSS

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]5.9e-1642.67Show/hide
Query:  VEQPQKNF----QPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP
        +  PQ+ +    Q   V+N  SNLE +MKEYMAR D  +       RN E Q+GQ+A E+KNRPQG+ P  TE+P REGKEQCKAVTLRSGL YD P  P
Subjt:  VEQPQKNF----QPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP

Query:  VNQEVRKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESS
              +IP         +      K    KGNE+       + EK++ +
Subjt:  VNQEVRKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESS

XP_030502440.1 uncharacterized protein LOC115717596 [Cannabis sativa]5.0e-1556.47Show/hide
Query:  QPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSG
        +PQ+  QP    +Q S+LE+LM++YMA+NDV +       RNLE+Q+GQ+A ++KNRPQGTLP+ TE P R+G+E CKAVTLRSG
Subjt:  QPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSG

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]8.5e-1547.9Show/hide
Query:  QPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEVR
        +PQ+  QP    +Q S+LE+LM++YMA+ND  +       RNLEVQ+GQ+A ++KNRPQGTLP+ TE P R+GKE CKAVTLRSG   +     V  +  
Subjt:  QPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEVR

Query:  KIPQEAQEKFSEKKSKITS
        K P   Q++   KK   TS
Subjt:  KIPQEAQEKFSEKKSKITS

TrEMBL top hitse value%identityAlignment
A0A6J1DAE9 uncharacterized protein LOC1110185141.8e-1553.33Show/hide
Query:  NQESNLEALMKEYMARND--------------VAVRNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPV
        N  ++LE + KEYMARND                +RNLEVQ+GQ A ++K RPQG+ P  TE+  R+G EQCKAVTLRSGL Y+GPK PV
Subjt:  NQESNLEALMKEYMARND--------------VAVRNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPV

A0A6J1DW02 uncharacterized protein LOC1110248971.7e-1643.54Show/hide
Query:  EQPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEV
        +Q  +  Q   ++N  SNLE +MKEYMAR D  +       RN   Q+G +A E+KNRPQG+ P  TE+P REGKEQCKAVTLRSGL YDGP  P     
Subjt:  EQPQKNFQPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEV

Query:  RKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESSSS
         +IP           +K T K+  N      T E E +++ N+ +SS
Subjt:  RKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESSSS

A0A6J1DYG0 uncharacterized protein LOC1110257642.8e-1642.67Show/hide
Query:  VEQPQKNF----QPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP
        +  PQ+ +    Q   V+N  SNLE +MKEYMAR D  +       RN E Q+GQ+A E+KNRPQG+ P  TE+P REGKEQCKAVTLRSGL YD P  P
Subjt:  VEQPQKNF----QPIQVKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP

Query:  VNQEVRKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESS
              +IP         +      K    KGNE+       + EK++ +
Subjt:  VNQEVRKIPQEAQEKFSEKKSKITSKVIHNKGNEEKTKEHEALKEKNESS

A0A6J1E110 uncharacterized protein LOC1110254247.0e-1555.95Show/hide
Query:  VKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP
        V+N  SNLE  MKEYMAR D  +       RN E Q+G +A  +KNRPQG+    TE+P  EGKE CKAVTLRSGL YD P  P
Subjt:  VKNQESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYP

A0A6J1H7K8 uncharacterized protein LOC1114611677.7e-1443.44Show/hide
Query:  ESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEVRKIPQEAQEKFSEKK
        E++LE+L+KEYMA+NDV +       RNLEVQ+GQ+A E++NRP G LP+ TE+P REG EQC+A+ LRSG E    +  + +      QE  +    K+
Subjt:  ESNLEALMKEYMARNDVAV-------RNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEVRKIPQEAQEKFSEKK

Query:  SKITSKVIHNKGNEEK--TKEH
          +  +  HNK + E    KEH
Subjt:  SKITSKVIHNKGNEEK--TKEH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTGGAGGCTGGAAATCGCAGATCTTGCTATACAGAAGTTGCTCGCCTAGGACTTGCGGCGCTAGCTCTTGCAATTAGCGGAGGGAGTCGACACGCAGGATCGAG
TAGCAGCGGCGCGAGGTTGCTCAGGCGGCAGGACGATGCGAAGTTTAGAGCGTGGGAAGGAGAAAACATCGAGAGAGAGAGAGAGAGAGCCGGGGAGTTCTCAAGGCTGG
CGGTCCTCCGAAGGGGTGGTTGCTCCAGGGCAAAACGCAGGCGAATCATAAGACTCTCACGGTTAATTGAGCTTAGGTCTTGTCGAGAGTCAATCACGCTAATTGGTGTT
AATTCGGGCCAATTACGGAGTTTTTGGAGCCATCTCGATGTCTTGGACGGAGAGGAGGTGAAGAGACCCAATTGTGCTGAAGCGGGCCTGAATTATGACACCGAAAGCCA
AAGATCGAAGCCCCGAACCGAGAAGAAGCGAGACAAGATCGAGAAAAGAATCAGGAAGTCGAGATTAGAGGCTGGCGTCTCGACGTCGACGCGATTTCGGCCGAAGACAA
AAGTTCGCAGCGTCGAGACGCTGTGCCCTGGCGTCTCGACGCCGACGTATTTTCTATATATATTAGGTCTCGGTTTAGTCTGCCAATTCACGATTAGCGAGTTAATTAGC
ACAGGTATTCTTGGTCAGTTTCCAGGGCCAGACCGCATGAACCGACCCGATCCGCATCGAACCGACCTCCAAAAACCCGAATCAAGCCTAAACCGGGTTGGACTAGGCCA
AACCGACCCCAGTCCAAACCAGACTGCCGAACCGGACCACGAAACTGTTGCAAACCACTCATCATGTAGCCTTAATTCACGCTGCATGAAAACAGAGGAAAGTTGGAATT
TGCCCAGAAATGCGACCGCATTTCTGGAAAAACAGAGGGTTATAGATTGCATTCGTGAATTATCTATGTCGGAGTCGACTCACATAAATGTGCATAACGTCTATACTTGC
ACTAAGGTTGAGCAGCCGCAAAAGAACTTCCAACCAATCCAAGTGAAGAATCAAGAGTCAAATCTAGAGGCTCTTATGAAAGAGTACATGGCAAGAAACGACGTCGCTGT
CAGAAATTTGGAAGTACAAATTGGACAGATTGCTCAAGAAATAAAAAATAGACCACAAGGGACGCTGCCCAACAAAACTGAGATCCCTCACCGAGAAGGAAAAGAGCAGT
GCAAGGCAGTCACCTTGAGAAGTGGACTAGAGTATGATGGTCCGAAATACCCCGTGAATCAAGAAGTAAGGAAAATCCCACAAGAAGCTCAAGAAAAGTTTTCAGAAAAG
AAGTCAAAAATTACCTCAAAAGTTATACATAATAAAGGCAATGAAGAAAAGACCAAGGAGCATGAAGCCTTGAAAGAAAAAAATGAAAGTTCAAGTTCAGCGAAAAAATA
G
mRNA sequenceShow/hide mRNA sequence
ATGCCATTGGAGGCTGGAAATCGCAGATCTTGCTATACAGAAGTTGCTCGCCTAGGACTTGCGGCGCTAGCTCTTGCAATTAGCGGAGGGAGTCGACACGCAGGATCGAG
TAGCAGCGGCGCGAGGTTGCTCAGGCGGCAGGACGATGCGAAGTTTAGAGCGTGGGAAGGAGAAAACATCGAGAGAGAGAGAGAGAGAGCCGGGGAGTTCTCAAGGCTGG
CGGTCCTCCGAAGGGGTGGTTGCTCCAGGGCAAAACGCAGGCGAATCATAAGACTCTCACGGTTAATTGAGCTTAGGTCTTGTCGAGAGTCAATCACGCTAATTGGTGTT
AATTCGGGCCAATTACGGAGTTTTTGGAGCCATCTCGATGTCTTGGACGGAGAGGAGGTGAAGAGACCCAATTGTGCTGAAGCGGGCCTGAATTATGACACCGAAAGCCA
AAGATCGAAGCCCCGAACCGAGAAGAAGCGAGACAAGATCGAGAAAAGAATCAGGAAGTCGAGATTAGAGGCTGGCGTCTCGACGTCGACGCGATTTCGGCCGAAGACAA
AAGTTCGCAGCGTCGAGACGCTGTGCCCTGGCGTCTCGACGCCGACGTATTTTCTATATATATTAGGTCTCGGTTTAGTCTGCCAATTCACGATTAGCGAGTTAATTAGC
ACAGGTATTCTTGGTCAGTTTCCAGGGCCAGACCGCATGAACCGACCCGATCCGCATCGAACCGACCTCCAAAAACCCGAATCAAGCCTAAACCGGGTTGGACTAGGCCA
AACCGACCCCAGTCCAAACCAGACTGCCGAACCGGACCACGAAACTGTTGCAAACCACTCATCATGTAGCCTTAATTCACGCTGCATGAAAACAGAGGAAAGTTGGAATT
TGCCCAGAAATGCGACCGCATTTCTGGAAAAACAGAGGGTTATAGATTGCATTCGTGAATTATCTATGTCGGAGTCGACTCACATAAATGTGCATAACGTCTATACTTGC
ACTAAGGTTGAGCAGCCGCAAAAGAACTTCCAACCAATCCAAGTGAAGAATCAAGAGTCAAATCTAGAGGCTCTTATGAAAGAGTACATGGCAAGAAACGACGTCGCTGT
CAGAAATTTGGAAGTACAAATTGGACAGATTGCTCAAGAAATAAAAAATAGACCACAAGGGACGCTGCCCAACAAAACTGAGATCCCTCACCGAGAAGGAAAAGAGCAGT
GCAAGGCAGTCACCTTGAGAAGTGGACTAGAGTATGATGGTCCGAAATACCCCGTGAATCAAGAAGTAAGGAAAATCCCACAAGAAGCTCAAGAAAAGTTTTCAGAAAAG
AAGTCAAAAATTACCTCAAAAGTTATACATAATAAAGGCAATGAAGAAAAGACCAAGGAGCATGAAGCCTTGAAAGAAAAAAATGAAAGTTCAAGTTCAGCGAAAAAATA
G
Protein sequenceShow/hide protein sequence
MPLEAGNRRSCYTEVARLGLAALALAISGGSRHAGSSSSGARLLRRQDDAKFRAWEGENIERERERAGEFSRLAVLRRGGCSRAKRRRIIRLSRLIELRSCRESITLIGV
NSGQLRSFWSHLDVLDGEEVKRPNCAEAGLNYDTESQRSKPRTEKKRDKIEKRIRKSRLEAGVSTSTRFRPKTKVRSVETLCPGVSTPTYFLYILGLGLVCQFTISELIS
TGILGQFPGPDRMNRPDPHRTDLQKPESSLNRVGLGQTDPSPNQTAEPDHETVANHSSCSLNSRCMKTEESWNLPRNATAFLEKQRVIDCIRELSMSESTHINVHNVYTC
TKVEQPQKNFQPIQVKNQESNLEALMKEYMARNDVAVRNLEVQIGQIAQEIKNRPQGTLPNKTEIPHREGKEQCKAVTLRSGLEYDGPKYPVNQEVRKIPQEAQEKFSEK
KSKITSKVIHNKGNEEKTKEHEALKEKNESSSSAKK