; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007901 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007901
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
Genome locationscaffold5:23297028..23306555
RNA-Seq ExpressionSpg007901
SyntenySpg007901
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022960431.1 uncharacterized protein LOC111461167 [Cucurbita moschata]2.3e-1748.03Show/hide
Query:  FAKSQGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEP
        +   Q   Q    +  Q  PE+SLE+++KEYMA+ D  IQS QAS+R LE+Q+GQLANEL+ RP GKLP+DTE P+REG EQ QA+ LRSGK +  R+E 
Subjt:  FAKSQGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEP

Query:  SK---------TQDLEKNSDKNVVVEK
         K         T D ++  ++ VV E+
Subjt:  SK---------TQDLEKNSDKNVVVEK

XP_030494874.1 uncharacterized protein LOC115710657 [Cannabis sativa]1.3e-1745.14Show/hide
Query:  RELSMSESSHRNVHNGVRLVHQRSSW--EFAKSQGMPQQNKQAL-------------LQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLAN
        +  + +  +  N +N     H   SW  + A S   P Q +QA               Q +  SSLE++M++YMA+ DA IQS  AS+R LELQ+G LAN
Subjt:  RELSMSESSHRNVHNGVRLVHQRSSW--EFAKSQGMPQQNKQAL-------------LQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLAN

Query:  ELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPSK
        ELKARPQG LP+DTE+PRR+GKEQ  A+ LRSGK L+  +E  K
Subjt:  ELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPSK

XP_030498047.1 uncharacterized protein LOC115713707 [Cannabis sativa]5.1e-1757.55Show/hide
Query:  QGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEER------K
        Q  PQQ+ Q   Q +  SSLE++M++YMA+ DA IQS  AS+R LE+Q+GQLAN LK RPQG LP+DTE+PRR+GKE  +A+TLRSGK LE        K
Subjt:  QGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEER------K

Query:  EPSKTQ
        EPS  Q
Subjt:  EPSKTQ

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]3.9e-1744.16Show/hide
Query:  RELSMSESSHRNVHNGVRLVHQRSSW--EFAKSQGMPQQNKQAL---LQQNPE------------SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQL
        +  + + + + N +N     H   SW  + A S G   Q KQ+      Q P             SSLE++M++YMA+ DA IQS  AS+R LE+Q+GQL
Subjt:  RELSMSESSHRNVHNGVRLVHQRSSW--EFAKSQGMPQQNKQAL---LQQNPE------------SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQL

Query:  ANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEER------KEPSKTQ
        AN+LK RPQG LP+DTE+PRR+GKE  +AVTLRSGK +E        KEPS  Q
Subjt:  ANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEER------KEPSKTQ

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]2.3e-1746.84Show/hide
Query:  IGCIRELSMSESSHRNVHNGVRLVHQRSSWEFAKSQ-------GMPQQNKQALLQQNPE-SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELK
        +  I  L+   +S  N H  +    Q +S   A +Q       G  QQ + +   QN + SSLE++M++YMA+ DA IQS  AS+R LELQ+G LANELK
Subjt:  IGCIRELSMSESSHRNVHNGVRLVHQRSSWEFAKSQ-------GMPQQNKQALLQQNPE-SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELK

Query:  ARPQGKLPADTEHPRREGKEQVQAVTLRSGKPL----EERK---EPSKTQDLEKNSDK
        ARPQG LP+DTE+PRR+GKEQ +++ LRSGK L    EE K   EP+  Q+ EK S K
Subjt:  ARPQGKLPADTEHPRREGKEQVQAVTLRSGKPL----EERK---EPSKTQDLEKNSDK

TrEMBL top hitse value%identityAlignment
A0A6J1DW02 uncharacterized protein LOC1110248971.5e-1456.25Show/hide
Query:  PQQNKQALLQ----QNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPSKT
        PQQ      Q    QN  S+LE MMKEYMARTDA IQS  ASMR    Q+G LANELK RPQG  P  TE PRREGKEQ +AVTLRSG   +    P  T
Subjt:  PQQNKQALLQ----QNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPSKT

Query:  QDLEKNSDKNVV
         D++  S K  V
Subjt:  QDLEKNSDKNVV

A0A6J1DWK1 uncharacterized protein LOC1110250531.7e-1349.12Show/hide
Query:  SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGK--PLEERKEPSKTQDLEKNSDKNVVVEKE
        +SLE +MK+YMA  DA +QS  AS+R LELQ+GQLA +LK+RP G LP+DTE P+R+ KEQ  A+TLRSGK  P      P+ T++  +        E++
Subjt:  SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGK--PLEERKEPSKTQDLEKNSDKNVVVEKE

Query:  LEPAWVRVANTPTQ
         EPA V V   P Q
Subjt:  LEPAWVRVANTPTQ

A0A6J1DYG0 uncharacterized protein LOC1110257642.7e-1646.58Show/hide
Query:  HRNVHNGVRLVHQRSSW-----EFAKSQGMPQQNKQALLQ----------------------QNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQ
        + N +N     H   SW         +QG  QQNKQ  +                       QN  S+LE MMKEYMARTDA IQS  ASMR  E Q+GQ
Subjt:  HRNVHNGVRLVHQRSSW-----EFAKSQGMPQQNKQALLQ----------------------QNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQ

Query:  LANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPS
        LANELK RPQG  P  TE P+REGKEQ +AVTLRSG   +E   P+
Subjt:  LANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPS

A0A6J1GJ68 uncharacterized protein LOC1114543444.0e-1545.16Show/hide
Query:  QQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPS---------KTQDLE
        Q    +S+E+++KEYMA+ D  IQS QAS++ LE+Q+GQLA EL+ RP GKLPADTE P+REGKEQ QA+ LRSGK +    E +         +T D +
Subjt:  QQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEPS---------KTQDLE

Query:  KNSDKNVVVEKELEPAWVRVANTP
        + +D+   V+KE    + ++   P
Subjt:  KNSDKNVVVEKELEPAWVRVANTP

A0A6J1H7K8 uncharacterized protein LOC1114611671.1e-1748.03Show/hide
Query:  FAKSQGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEP
        +   Q   Q    +  Q  PE+SLE+++KEYMA+ D  IQS QAS+R LE+Q+GQLANEL+ RP GKLP+DTE P+REG EQ QA+ LRSGK +  R+E 
Subjt:  FAKSQGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEERKEP

Query:  SK---------TQDLEKNSDKNVVVEK
         K         T D ++  ++ VV E+
Subjt:  SK---------TQDLEKNSDKNVVVEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAACACGCCAGAGCCATCATCATCACGCAAGATTACTCGATCCCAGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCAACGGCAAGAGGGGAACCCCGA
AACACCCATGCACGACACGAGAAGAACGAGACCCACGGGTTTTTCATCGCGACGTGCTGCTCTTGAAGAAGAAGGGAATAAGCAAGATGAAGAAGAAGCCGCCAAGGCAG
CAGGAAGCTCTCGGCAAGAAGGAACTTCAACAGGTAAAAATTATGAACCTCAAGCTAACCCTTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTATAGTGCAAGGAAG
AGGAGTCCTAAGAAAGTTGTGCCCGAAAAGCCGCTTGTAATTGAGCCCCTCAAGGTAGCAAGAATGCCACCGGACGTGTTCGAAGGAATAATCCGCCAAGCTGTGGCAAA
GGCTCTCGTAATTGCTGAAGGTTACAGGGTTGAACAAGAAGCCTTGCAGGATATTGAGGTTGAGAGAGAGAAGGAAAATCGACACATGAGGGAAGAAGATGAAGGTGCAA
GAGAAAGAGATCTTGAAGAAGAAAAGAAAAAGGAAGAGGAAAGATTGGAGGCCGAGATGGCCAAATTAGCTGAAGAAGAAGAGAGAAATTGCGCGATCAAAGGTGTTGTG
CTTGCTGTCAGCGCATCCATCTTGGTAGTCAACACATCGATTTTTGCGCTTACTAAAGTCAATGGGTCAACTTCAAAACTTCCAGAACTGGGTTGGCTGTACTATCCCTG
GGTTGAAATTGCAGTTTGCAGAGGCTGCATAATCTCTGATTCCTCTGGCTCTATCATCAGCCATTTGGATGGGATTTACTGGCGGGAGTTAGAGAAAAAGCTCGAAAGAG
ATTGTTCGGCGGTATTGGGCAAGCTACCTAAGGCACCCTATATCGACTATAGGGTTATAGGTTGCATTCGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCAT
AACGGTGTGAGATTGGTGCATCAGCGATCCTCCTGGGAGTTTGCTAAATCACAGGGAATGCCCCAGCAAAATAAGCAAGCATTGCTCCAGCAAAATCCAGAGAGTTCTCT
GGAGGCAATGATGAAAGAATATATGGCTCGTACTGATGCCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGA
AGGCACGACCTCAAGGGAAACTTCCTGCGGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTGCAGGCAGTAACTCTAAGGAGTGGTAAGCCACTAGAAGAGAGA
AAAGAGCCTAGTAAAACCCAAGATTTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGCCTGCATGGGTGAGAGTGGCCAATACGCCGACTCAATA
TGTCTTCCTTTTTGGAGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACATCTAGGAGATGTAGATGAGTGTTGTTCTTTAG
AGGAGCACTGCATGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTGAAGCTGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTG
AAGCTGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTGAAGCTGGTTCAGTGCGGTTCAACCCGTTTTCTCGCCGGTTCAAAGGGTTTGGACGTGGT
TCGAGGCTATTTTGGGCTGGTTCACGGCATGATAGGGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAACACGCCAGAGCCATCATCATCACGCAAGATTACTCGATCCCAGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCAACGGCAAGAGGGGAACCCCGA
AACACCCATGCACGACACGAGAAGAACGAGACCCACGGGTTTTTCATCGCGACGTGCTGCTCTTGAAGAAGAAGGGAATAAGCAAGATGAAGAAGAAGCCGCCAAGGCAG
CAGGAAGCTCTCGGCAAGAAGGAACTTCAACAGGTAAAAATTATGAACCTCAAGCTAACCCTTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTATAGTGCAAGGAAG
AGGAGTCCTAAGAAAGTTGTGCCCGAAAAGCCGCTTGTAATTGAGCCCCTCAAGGTAGCAAGAATGCCACCGGACGTGTTCGAAGGAATAATCCGCCAAGCTGTGGCAAA
GGCTCTCGTAATTGCTGAAGGTTACAGGGTTGAACAAGAAGCCTTGCAGGATATTGAGGTTGAGAGAGAGAAGGAAAATCGACACATGAGGGAAGAAGATGAAGGTGCAA
GAGAAAGAGATCTTGAAGAAGAAAAGAAAAAGGAAGAGGAAAGATTGGAGGCCGAGATGGCCAAATTAGCTGAAGAAGAAGAGAGAAATTGCGCGATCAAAGGTGTTGTG
CTTGCTGTCAGCGCATCCATCTTGGTAGTCAACACATCGATTTTTGCGCTTACTAAAGTCAATGGGTCAACTTCAAAACTTCCAGAACTGGGTTGGCTGTACTATCCCTG
GGTTGAAATTGCAGTTTGCAGAGGCTGCATAATCTCTGATTCCTCTGGCTCTATCATCAGCCATTTGGATGGGATTTACTGGCGGGAGTTAGAGAAAAAGCTCGAAAGAG
ATTGTTCGGCGGTATTGGGCAAGCTACCTAAGGCACCCTATATCGACTATAGGGTTATAGGTTGCATTCGTGAATTATCTATGTCGGAGTCGAGTCACAGAAATGTGCAT
AACGGTGTGAGATTGGTGCATCAGCGATCCTCCTGGGAGTTTGCTAAATCACAGGGAATGCCCCAGCAAAATAAGCAAGCATTGCTCCAGCAAAATCCAGAGAGTTCTCT
GGAGGCAATGATGAAAGAATATATGGCTCGTACTGATGCCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGTCAGCTAGCTAATGAGCTGA
AGGCACGACCTCAAGGGAAACTTCCTGCGGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTGCAGGCAGTAACTCTAAGGAGTGGTAAGCCACTAGAAGAGAGA
AAAGAGCCTAGTAAAACCCAAGATTTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGCCTGCATGGGTGAGAGTGGCCAATACGCCGACTCAATA
TGTCTTCCTTTTTGGAGACAAGACCGAGTGGGAGGCTGGGGACATGACAACACAAGAAGGAATTCACTCCTTCCCACATCTAGGAGATGTAGATGAGTGTTGTTCTTTAG
AGGAGCACTGCATGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTGAAGCTGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTG
AAGCTGGTTCAGTTCGGTTCAAGCAATTTTGAGCTGTTTCAAGGAGTTTTGAAGCTGGTTCAGTGCGGTTCAACCCGTTTTCTCGCCGGTTCAAAGGGTTTGGACGTGGT
TCGAGGCTATTTTGGGCTGGTTCACGGCATGATAGGGCTGTAA
Protein sequenceShow/hide protein sequence
MNNTPEPSSSRKITRSQSNPTAHEAEASVQRQEGNPETPMHDTRRTRPTGFSSRRAALEEEGNKQDEEEAAKAAGSSRQEGTSTGKNYEPQANPSSSCRNKPFVTYSARK
RSPKKVVPEKPLVIEPLKVARMPPDVFEGIIRQAVAKALVIAEGYRVEQEALQDIEVEREKENRHMREEDEGARERDLEEEKKKEEERLEAEMAKLAEEEERNCAIKGVV
LAVSASILVVNTSIFALTKVNGSTSKLPELGWLYYPWVEIAVCRGCIISDSSGSIISHLDGIYWRELEKKLERDCSAVLGKLPKAPYIDYRVIGCIRELSMSESSHRNVH
NGVRLVHQRSSWEFAKSQGMPQQNKQALLQQNPESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQLANELKARPQGKLPADTEHPRREGKEQVQAVTLRSGKPLEER
KEPSKTQDLEKNSDKNVVVEKELEPAWVRVANTPTQYVFLFGDKTEWEAGDMTTQEGIHSFPHLGDVDECCSLEEHCMVQFGSSNFELFQGVLKLVQFGSSNFELFQGVL
KLVQFGSSNFELFQGVLKLVQCGSTRFLAGSKGLDVVRGYFGLVHGMIGL