; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032457 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032457
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein FAM91A1
Genome locationscaffold2:30634389..30643884
RNA-Seq ExpressionSpg032457
SyntenySpg032457
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74312.1 hypothetical protein VITISV_037520 [Vitis vinifera]4.5e-2132.97Show/hide
Query:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDEN-DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQ
        +WN  FRR L + EI     L+  +  V L   + D   W L   G++S KSFF +++  S  +    +  +W  K P KVK L W + +  +NT+D LQ
Subjt:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDEN-DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQ

Query:  RKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHAYQLTGGAQVRNWEI
         +    +L P  C +C +  ES+DHLFLHC      W+ + NL+G+ +  P+ IE+ L+     L ++ +L  G Q R +++
Subjt:  RKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHAYQLTGGAQVRNWEI

ONI36148.1 hypothetical protein PRUPE_1G572100 [Prunus persica]2.2e-2036.94Show/hide
Query:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR
        W+  FRR L ERE +  + L++ ++ + L   + D   W LE  G+++ KSF   + N       P  SL+W+ KSP KVKV +W +    +NT D++QR
Subjt:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR

Query:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL
        K     LSP  C +C   EES+DHLFLHC F+ + W  +   +G  +  PK   ++L
Subjt:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL

RVX22527.1 hypothetical protein CK203_012735 [Vitis vinifera]2.9e-2033.53Show/hide
Query:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR
        +WN  FRR L + EI     L+  +  V     +D   W L   G +S KSFFL+++  S  I    +  +W  K P KVK+L W + Y  +NT+D L+ 
Subjt:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR

Query:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHA
        +    +L P  C +C    ES+DHLFLHC      W+ + NL+G+ +  P+ +E+ L+     L ++
Subjt:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHA

TYK07901.1 DUF21 domain-containing protein [Cucumis melo var. makuwa]1.0e-2041.94Show/hide
Query:  EISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCR
        E+ +   +++ ++  +  D+ D + W L   G+++ KS FL +T  SP I  P    IW+ K  K+VK  +WS+ YRSLNT   +Q+ F+   LSPS+C 
Subjt:  EISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCR

Query:  MCIKAEESLDHLFLHCQFAGAAWN
        +C K EE+LDHLFLHC F   AWN
Subjt:  MCIKAEESLDHLFLHCQFAGAAWN

TYK31299.1 protein FAM91A1 [Cucumis melo var. makuwa]3.2e-2749.61Show/hide
Query:  DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGA
        D + WKLE  G +STK  F  MT  + K N  T +LIW+ K  KKVK  +WS+ YRSLN  + LQRKF + SLSPS+C +C+K  E+ DHLFLHC FA  
Subjt:  DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGA

Query:  AWNFVANLLGISFCKPKKIEEWLLEGL
         WN + +L  +  C PKKI++ + +GL
Subjt:  AWNFVANLLGISFCKPKKIEEWLLEGL

TrEMBL top hitse value%identityAlignment
A0A251RJG1 zf-RVT domain-containing protein1.1e-2036.94Show/hide
Query:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR
        W+  FRR L ERE +  + L++ ++ + L   + D   W LE  G+++ KSF   + N       P  SL+W+ KSP KVKV +W +    +NT D++QR
Subjt:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR

Query:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL
        K     LSP  C +C   EES+DHLFLHC F+ + W  +   +G  +  PK   ++L
Subjt:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL

A0A5D3C7R2 DUF21 domain-containing protein4.9e-2141.94Show/hide
Query:  EISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCR
        E+ +   +++ ++  +  D+ D + W L   G+++ KS FL +T  SP I  P    IW+ K  K+VK  +WS+ YRSLNT   +Q+ F+   LSPS+C 
Subjt:  EISSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCR

Query:  MCIKAEESLDHLFLHCQFAGAAWN
        +C K EE+LDHLFLHC F   AWN
Subjt:  MCIKAEESLDHLFLHCQFAGAAWN

A0A5D3E632 Protein FAM91A11.6e-2749.61Show/hide
Query:  DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGA
        D + WKLE  G +STK  F  MT  + K N  T +LIW+ K  KKVK  +WS+ YRSLN  + LQRKF + SLSPS+C +C+K  E+ DHLFLHC FA  
Subjt:  DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGA

Query:  AWNFVANLLGISFCKPKKIEEWLLEGL
         WN + +L  +  C PKKI++ + +GL
Subjt:  AWNFVANLLGISFCKPKKIEEWLLEGL

A5B978 Reverse transcriptase domain-containing protein2.2e-2132.97Show/hide
Query:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDEN-DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQ
        +WN  FRR L + EI     L+  +  V L   + D   W L   G++S KSFF +++  S  +    +  +W  K P KVK L W + +  +NT+D LQ
Subjt:  TWNLAFRRGLFEREISSWLVLVDKIKDVSLIDEN-DLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQ

Query:  RKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHAYQLTGGAQVRNWEI
         +    +L P  C +C +  ES+DHLFLHC      W+ + NL+G+ +  P+ IE+ L+     L ++ +L  G Q R +++
Subjt:  RKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHAYQLTGGAQVRNWEI

M5XJT6 zf-RVT domain-containing protein (Fragment)1.1e-2036.94Show/hide
Query:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR
        W+  FRR L ERE +  + L++ ++ + L   + D   W LE  G+++ KSF   + N       P  SL+W+ KSP KVKV +W +    +NT D++QR
Subjt:  WNLAFRRGLFEREISSWLVLVDKIKDVSL-IDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQR

Query:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL
        K     LSP  C +C   EES+DHLFLHC F+ + W  +   +G  +  PK   ++L
Subjt:  KFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAWNFVANLLGISFCKPKKIEEWL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25270.1 Ribonuclease H-like superfamily protein4.1e-0431.82Show/hide
Query:  IWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAW
        IW+ K+  K+K  +W +L  +L T D L+R+      +   C  C + +E+  HLF  C +A   W
Subjt:  IWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHLFLHCQFAGAAW

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0430.48Show/hide
Query:  ENDLICWKLEGFGA---YSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLS-PSVCRMCIKAEESLDHLFLH
        E+DL  WK+ G  A   +S+ + ++ +     K++      IW      K   + W  +   L T D    K  SW L  PS+C +C   +E+  HLF  
Subjt:  ENDLICWKLEGFGA---YSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLS-PSVCRMCIKAEESLDHLFLH

Query:  CQFAG
        C FAG
Subjt:  CQFAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATTCTTCCTCGGCTGGAAAAGAGACGGAACCTACCACCGTCCTCTCACCACGATCAACAACCGTATGCTTGTTGTCGGTTGAACAAGATACGAAAGTATTGAA
AAATGATGTGGGCGAGATTAAGAAAATTTTGGAGATGATTTGCGAAAAAATGGGCGGCAGATCGGACCAGCAAGTTTTTGAACCGAGAGCTCACAAAAATATGGAGAGGA
AACATCTAGATCATCAAGGGGAAGACTTTAAACCAAGACAATGGCAAGAAAGACAAGCTGTGGAGCAGAAATTCACTCAAGAAACAAATTCTGCACCAAGATTCTATCAA
GAACGTCGCACGGGGCAGCAAGAATTCAAACAAAATCCTTTATTCAGAAGGCATTCCGAGTGGTTGGAGGATAGCTCTAGTGAGGAAGATTATGATTATCAAGAAACGCA
AAGGGAAGGCCAACAAGCAGCAATTCGCCACGCGCTCTGCATCAAAGTGGATAAAGATGAGCTTGATTTGTGGGCCACCAAGTTTGGATGCCACTCTAATGAGCTTCCTT
TTAATTACCTCGGTTTTCAACTAGGAGAGGCCACTATAGCTGATTGTTGGTGCATTGCCTCTCAAACGTGGAACCTAGCCTTTAGAAGAGGCCTTTTCGAAAGGGAAATC
AGCAGCTGGCTGGTCTTGGTGGATAAAATCAAAGATGTCTCCTTGATAGATGAGAATGACCTTATTTGCTGGAAATTGGAGGGGTTTGGAGCCTATTCTACAAAATCTTT
CTTCCTCTCAATGACTAATGCTTCTCCTAAAATCAACCAGCCCACTAGTAGTCTCATTTGGAGACACAAGAGCCCAAAGAAAGTGAAAGTTTTGATGTGGTCCATTCTGT
ACAGAAGCTTAAATACTGATGATATCTTACAAAGGAAATTTAGAAGTTGGTCGCTATCTCCATCTGTTTGCAGGATGTGTATCAAGGCAGAAGAAAGCTTAGATCATTTG
TTCCTTCATTGTCAGTTTGCGGGGGCTGCTTGGAATTTTGTGGCGAATTTGTTGGGTATTTCGTTTTGTAAGCCTAAGAAGATTGAGGAGTGGCTTCTTGAAGGGCTGAA
CGCCCTATCACATGCTTACCAACTCACTGGAGGTGCACAAGTGAGAAATTGGGAGATCTCTACTCCTACTGATGGTAGAAGGCACCCAGTCAGTGGAGCCAAGTCTCACC
CCTCAAGGCACCTATGTGTGCATAGTACTCTAACAACTAACTCACTGGTGGGGCGTCCATGGGTTTTTAGTCGAAACTTTACTTCCACAAATAGTAGATGGCACCTAGTC
TGCGAAGCCGAGTCCCGCCCATTTAGCTCCTCATCTCAGTCTATAAACATCTCAATCCTAGGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGATTCTTCCTCGGCTGGAAAAGAGACGGAACCTACCACCGTCCTCTCACCACGATCAACAACCGTATGCTTGTTGTCGGTTGAACAAGATACGAAAGTATTGAA
AAATGATGTGGGCGAGATTAAGAAAATTTTGGAGATGATTTGCGAAAAAATGGGCGGCAGATCGGACCAGCAAGTTTTTGAACCGAGAGCTCACAAAAATATGGAGAGGA
AACATCTAGATCATCAAGGGGAAGACTTTAAACCAAGACAATGGCAAGAAAGACAAGCTGTGGAGCAGAAATTCACTCAAGAAACAAATTCTGCACCAAGATTCTATCAA
GAACGTCGCACGGGGCAGCAAGAATTCAAACAAAATCCTTTATTCAGAAGGCATTCCGAGTGGTTGGAGGATAGCTCTAGTGAGGAAGATTATGATTATCAAGAAACGCA
AAGGGAAGGCCAACAAGCAGCAATTCGCCACGCGCTCTGCATCAAAGTGGATAAAGATGAGCTTGATTTGTGGGCCACCAAGTTTGGATGCCACTCTAATGAGCTTCCTT
TTAATTACCTCGGTTTTCAACTAGGAGAGGCCACTATAGCTGATTGTTGGTGCATTGCCTCTCAAACGTGGAACCTAGCCTTTAGAAGAGGCCTTTTCGAAAGGGAAATC
AGCAGCTGGCTGGTCTTGGTGGATAAAATCAAAGATGTCTCCTTGATAGATGAGAATGACCTTATTTGCTGGAAATTGGAGGGGTTTGGAGCCTATTCTACAAAATCTTT
CTTCCTCTCAATGACTAATGCTTCTCCTAAAATCAACCAGCCCACTAGTAGTCTCATTTGGAGACACAAGAGCCCAAAGAAAGTGAAAGTTTTGATGTGGTCCATTCTGT
ACAGAAGCTTAAATACTGATGATATCTTACAAAGGAAATTTAGAAGTTGGTCGCTATCTCCATCTGTTTGCAGGATGTGTATCAAGGCAGAAGAAAGCTTAGATCATTTG
TTCCTTCATTGTCAGTTTGCGGGGGCTGCTTGGAATTTTGTGGCGAATTTGTTGGGTATTTCGTTTTGTAAGCCTAAGAAGATTGAGGAGTGGCTTCTTGAAGGGCTGAA
CGCCCTATCACATGCTTACCAACTCACTGGAGGTGCACAAGTGAGAAATTGGGAGATCTCTACTCCTACTGATGGTAGAAGGCACCCAGTCAGTGGAGCCAAGTCTCACC
CCTCAAGGCACCTATGTGTGCATAGTACTCTAACAACTAACTCACTGGTGGGGCGTCCATGGGTTTTTAGTCGAAACTTTACTTCCACAAATAGTAGATGGCACCTAGTC
TGCGAAGCCGAGTCCCGCCCATTTAGCTCCTCATCTCAGTCTATAAACATCTCAATCCTAGGTGTTTGA
Protein sequenceShow/hide protein sequence
MGDSSSAGKETEPTTVLSPRSTTVCLLSVEQDTKVLKNDVGEIKKILEMICEKMGGRSDQQVFEPRAHKNMERKHLDHQGEDFKPRQWQERQAVEQKFTQETNSAPRFYQ
ERRTGQQEFKQNPLFRRHSEWLEDSSSEEDYDYQETQREGQQAAIRHALCIKVDKDELDLWATKFGCHSNELPFNYLGFQLGEATIADCWCIASQTWNLAFRRGLFEREI
SSWLVLVDKIKDVSLIDENDLICWKLEGFGAYSTKSFFLSMTNASPKINQPTSSLIWRHKSPKKVKVLMWSILYRSLNTDDILQRKFRSWSLSPSVCRMCIKAEESLDHL
FLHCQFAGAAWNFVANLLGISFCKPKKIEEWLLEGLNALSHAYQLTGGAQVRNWEISTPTDGRRHPVSGAKSHPSRHLCVHSTLTTNSLVGRPWVFSRNFTSTNSRWHLV
CEAESRPFSSSSQSINISILGV