; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021686 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021686
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold2:17408107..17414074
RNA-Seq ExpressionSpg021686
SyntenySpg021686
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.6e-4645.29Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    +E  +LW +V++SIHG   F+W T G+ G SLRSPW++I++ W   +SLA F LGNG R+ F  D W+G+ P   +F  LFRIA  P  S     
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE
            FSW ++FR+ L++EE  E Q L+ +L+  ++ +  D R WS+E  G F+ KSL  HL  +S M K +F A+ +S  P+RINIL WIM+F  + SSE
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE

Query:  VLQRRIPSHVLSPSICPLCSSAS
        +LQ++ P +V SPSICPLC  AS
Subjt:  VLQRRIPSHVLSPSICPLCSSAS

TYK06564.1 hypothetical protein E5676_scaffold453G00250 [Cucumis melo var. makuwa]3.0e-4247.98Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    EE +SLW +VV+SI+G    +WHT G+ G SL+SPWI IS V  + E+L  FKLGNG+R+ FW D W+ +  F   F  LFR++ +PN S+  LW
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFK
        D  T SWSI FR+L KEEE  E QQL+ +L++ +++D +D R WSL+ S  F+ KS  NHLA SS +++  +K
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFK

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]6.5e-4544.39Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    +E  +LW +V++SIHG   F+W T G+ G SLRS W++I++ W   +SLA F LGNG R+ F  D W+G+ P   +F  LF IA  P        
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE
            FSW ++FR+ L++EE  E Q L+ +L+  ++ +  D R WS+E  G F+ KSL  HL  +S M K +F A+ +S  P+RINIL WIM+F  +NSSE
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE

Query:  VLQRRIPSHVLSPSICPLCSSAS
        +LQ++ P +V SPSICPLC  AS
Subjt:  VLQRRIPSHVLSPSICPLCSSAS

TYK22579.1 hypothetical protein E5676_scaffold584G00340 [Cucumis melo var. makuwa]2.3e-4247.8Show/hide
Query:  EEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETF
        +E  ++W +V++SIHG   F+WHT  + G SLRSPW+SIS+ W + E LA FKLGNG RV FW D W  ++P   +   LF+IA LP  S+   WD +T 
Subjt:  EEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETF

Query:  SWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINIL
        SW+I FR+LLK+EE  + Q L+ +L+  R+++L D RIWSL+  G ++VKSL  HL+ S  + K  +KALWK+   +R NIL
Subjt:  SWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINIL

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]7.9e-5144.49Show/hide
Query:  SGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETFSWSISFRQLLKEEEFLELQQLMGI
        +G+ G SL+SPW+SISK W + E LA FKLGNGSR+ FW D W G  PF + F SLFRI +LPN S+ D WD    SWSI+FR+ LK+EE    Q L+  
Subjt:  SGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETFSWSISFRQLLKEEEFLELQQLMGI

Query:  LNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSEVLQRRIPSHVLSPSICPLCSSASCRGVVI
        +     S   D R+WS+  +  +TVKSL NHL   S + K +F  +WK+K P+R+NIL WIM+FG LN +EVLQ++ P+  LSP++CP C   S   + +
Subjt:  LNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSEVLQRRIPSHVLSPSICPLCSSASCRGVVI

Query:  KVKNNYCGFAPVEILCQSNRKKAIVHEVNYEDPQLL
             Y  +   ++LC  N    + ++      QLL
Subjt:  KVKNNYCGFAPVEILCQSNRKKAIVHEVNYEDPQLL

TrEMBL top hitse value%identityAlignment
A0A5A7T2Y0 zf-RVT domain-containing protein1.3e-4645.29Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    +E  +LW +V++SIHG   F+W T G+ G SLRSPW++I++ W   +SLA F LGNG R+ F  D W+G+ P   +F  LFRIA  P  S     
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE
            FSW ++FR+ L++EE  E Q L+ +L+  ++ +  D R WS+E  G F+ KSL  HL  +S M K +F A+ +S  P+RINIL WIM+F  + SSE
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE

Query:  VLQRRIPSHVLSPSICPLCSSAS
        +LQ++ P +V SPSICPLC  AS
Subjt:  VLQRRIPSHVLSPSICPLCSSAS

A0A5D3BQW7 Uncharacterized protein2.7e-4146.11Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    EE DSLWCKV +SIHG + FNWH +G+ G  LRSPW+S+S+ W + ++L  F+LGN  RV FW D W  ++PF    P LF IA L N  + D W
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKE--EEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIM
        D    SW I F  LLK+     + +        +   S+  DSR+ SL  SG FTVKS+ +HL+ S  + K +FKA+WKS  P+R+NIL WIM
Subjt:  DGETFSWSISFRQLLKE--EEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIM

A0A5D3C5F2 Uncharacterized protein1.5e-4247.98Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    EE +SLW +VV+SI+G    +WHT G+ G SL+SPWI IS V  + E+L  FKLGNG+R+ FW D W+ +  F   F  LFR++ +PN S+  LW
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFK
        D  T SWSI FR+L KEEE  E QQL+ +L++ +++D +D R WSL+ S  F+ KS  NHLA SS +++  +K
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFK

A0A5D3DE60 zf-RVT domain-containing protein3.1e-4544.39Show/hide
Query:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW
        G    +E  +LW +V++SIHG   F+W T G+ G SLRS W++I++ W   +SLA F LGNG R+ F  D W+G+ P   +F  LF IA  P        
Subjt:  GLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLW

Query:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE
            FSW ++FR+ L++EE  E Q L+ +L+  ++ +  D R WS+E  G F+ KSL  HL  +S M K +F A+ +S  P+RINIL WIM+F  +NSSE
Subjt:  DGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSE

Query:  VLQRRIPSHVLSPSICPLCSSAS
        +LQ++ P +V SPSICPLC  AS
Subjt:  VLQRRIPSHVLSPSICPLCSSAS

A0A5D3DH32 Uncharacterized protein1.1e-4247.8Show/hide
Query:  EEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETF
        +E  ++W +V++SIHG   F+WHT  + G SLRSPW+SIS+ W + E LA FKLGNG RV FW D W  ++P   +   LF+IA LP  S+   WD +T 
Subjt:  EEPDSLWCKVVKSIHGHNRFNWHTSGRVGLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETF

Query:  SWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINIL
        SW+I FR+LLK+EE  + Q L+ +L+  R+++L D RIWSL+  G ++VKSL  HL+ S  + K  +KALWK+   +R NIL
Subjt:  SWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRIWSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-0535.59Show/hide
Query:  QKSKLNWLKLGDENTSFFHRYLAAKKRKNLISELISSDGASLVSFREIEHEILTFFSSL
        QKS++ WL+ GD NT FFH+ + A + KNLI  L   D   + +  +++  I+ +++ L
Subjt:  QKSKLNWLKLGDENTSFFHRYLAAKKRKNLISELISSDGASLVSFREIEHEILTFFSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTTATTTGTCTGATGAAAGGAATTTAATCCAGAAAAGTAAGCTTAATTGGTTAAAGCTTGGAGACGAGAATACTAGCTTTTTTCATCGATATTTGGCAGCCAA
GAAGAGGAAGAATCTGATTTCAGAATTAATTTCCAGCGATGGGGCTTCTTTAGTTTCTTTTAGAGAAATTGAACATGAAATTCTGACTTTTTTCTCTTCTTTATATCAGA
AAATCCCAGTGGGGTTGGAGATATTCGAGGAGCCGGATTCGTTGTGGTGCAAAGTTGTTAAGAGTATTCATGGGCATAATCGTTTTAATTGGCATACTTCCGGAAGGGTT
GGCTTGAGTCTTCGAAGTCCTTGGATTAGTATATCTAAGGTGTGGAATCAAAGTGAGAGTTTGGCTATTTTTAAACTTGGAAATGGCTCTCGAGTAGTTTTTTGGCATGA
CTTTTGGATTGGAGATCTTCCGTTTTATCTAAAATTTCCAAGTTTATTTCGAATTGCTTCACTTCCAAACGCTTCTATTAATGATCTTTGGGATGGGGAGACTTTTTCAT
GGAGTATTTCTTTCCGTCAGCTTCTTAAAGAGGAAGAATTTCTTGAGCTTCAACAGCTCATGGGCATTTTAAATGATGCCAGAATTTCAGATCTTTTGGATTCCCGCATT
TGGTCTTTAGAGAGATCGGGTTTGTTCACGGTTAAATCTCTTTACAATCATTTGGCTGCTTCTTCAAATATGCACAAGGATGTGTTTAAAGCTCTTTGGAAGTCTAAATG
TCCAAAGCGTATTAACATTCTTTGTTGGATCATGATCTTTGGTTCTTTAAACAGCTCGGAGGTTCTTCAAAGGCGGATCCCGTCTCATGTTTTATCTCCTTCCATTTGCC
CTTTATGTTCAAGTGCCAGTTGTAGGGGGGTTGTTATTAAAGTTAAGAATAACTATTGTGGTTTTGCTCCGGTAGAGATTTTGTGTCAAAGCAACCGGAAGAAAGCCATT
GTTCATGAAGTCAACTATGAAGATCCCCAGTTGCTCGAAACGAGAAAAGTGGATGTCCATGGTAGATTCTCTAGTGAAACAACTAGGGTTTTCTTCGAGGAAGATGAATC
AGGAGATGGGTCGTATACAGTAGACAAGACTAGAGTTATGAGCTCTTTTGAACACTTACTTGAGGTGGGACCGATCACAGAAGCAAAGGTAGAAGACCTAGGTTCTCTTT
TTGTAGAAGAAGATGTGGAGGAAGATCCAACCTTGGACAAAGAGAACATGGCTCTAGTTCTCTCAGAGAAAGACGTCAAAGAACCGATGAAAACCATGTCGATAAAGCAA
GATGGAAAGGATAGGAAAGTGAGGATAGACCTAAGGGAGAAAGGTCAGGAAGGCGAGGAGGGTAGCAGTTCTACAGGCATAAGGGGTCTAGTCGTCCAATTTTTAAGGAA
ATGGTTGAGGGGCATGGGAGATATTGAAAAAAGGAGGGTAGTTACAGATTTGCTAACTAGTGTCAATCCAAATGTTGCTTTGAGATTCAGGAGACCAAGCTTGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTTTATTTGTCTGATGAAAGGAATTTAATCCAGAAAAGTAAGCTTAATTGGTTAAAGCTTGGAGACGAGAATACTAGCTTTTTTCATCGATATTTGGCAGCCAA
GAAGAGGAAGAATCTGATTTCAGAATTAATTTCCAGCGATGGGGCTTCTTTAGTTTCTTTTAGAGAAATTGAACATGAAATTCTGACTTTTTTCTCTTCTTTATATCAGA
AAATCCCAGTGGGGTTGGAGATATTCGAGGAGCCGGATTCGTTGTGGTGCAAAGTTGTTAAGAGTATTCATGGGCATAATCGTTTTAATTGGCATACTTCCGGAAGGGTT
GGCTTGAGTCTTCGAAGTCCTTGGATTAGTATATCTAAGGTGTGGAATCAAAGTGAGAGTTTGGCTATTTTTAAACTTGGAAATGGCTCTCGAGTAGTTTTTTGGCATGA
CTTTTGGATTGGAGATCTTCCGTTTTATCTAAAATTTCCAAGTTTATTTCGAATTGCTTCACTTCCAAACGCTTCTATTAATGATCTTTGGGATGGGGAGACTTTTTCAT
GGAGTATTTCTTTCCGTCAGCTTCTTAAAGAGGAAGAATTTCTTGAGCTTCAACAGCTCATGGGCATTTTAAATGATGCCAGAATTTCAGATCTTTTGGATTCCCGCATT
TGGTCTTTAGAGAGATCGGGTTTGTTCACGGTTAAATCTCTTTACAATCATTTGGCTGCTTCTTCAAATATGCACAAGGATGTGTTTAAAGCTCTTTGGAAGTCTAAATG
TCCAAAGCGTATTAACATTCTTTGTTGGATCATGATCTTTGGTTCTTTAAACAGCTCGGAGGTTCTTCAAAGGCGGATCCCGTCTCATGTTTTATCTCCTTCCATTTGCC
CTTTATGTTCAAGTGCCAGTTGTAGGGGGGTTGTTATTAAAGTTAAGAATAACTATTGTGGTTTTGCTCCGGTAGAGATTTTGTGTCAAAGCAACCGGAAGAAAGCCATT
GTTCATGAAGTCAACTATGAAGATCCCCAGTTGCTCGAAACGAGAAAAGTGGATGTCCATGGTAGATTCTCTAGTGAAACAACTAGGGTTTTCTTCGAGGAAGATGAATC
AGGAGATGGGTCGTATACAGTAGACAAGACTAGAGTTATGAGCTCTTTTGAACACTTACTTGAGGTGGGACCGATCACAGAAGCAAAGGTAGAAGACCTAGGTTCTCTTT
TTGTAGAAGAAGATGTGGAGGAAGATCCAACCTTGGACAAAGAGAACATGGCTCTAGTTCTCTCAGAGAAAGACGTCAAAGAACCGATGAAAACCATGTCGATAAAGCAA
GATGGAAAGGATAGGAAAGTGAGGATAGACCTAAGGGAGAAAGGTCAGGAAGGCGAGGAGGGTAGCAGTTCTACAGGCATAAGGGGTCTAGTCGTCCAATTTTTAAGGAA
ATGGTTGAGGGGCATGGGAGATATTGAAAAAAGGAGGGTAGTTACAGATTTGCTAACTAGTGTCAATCCAAATGTTGCTTTGAGATTCAGGAGACCAAGCTTGAGATAA
Protein sequenceShow/hide protein sequence
MEFYLSDERNLIQKSKLNWLKLGDENTSFFHRYLAAKKRKNLISELISSDGASLVSFREIEHEILTFFSSLYQKIPVGLEIFEEPDSLWCKVVKSIHGHNRFNWHTSGRV
GLSLRSPWISISKVWNQSESLAIFKLGNGSRVVFWHDFWIGDLPFYLKFPSLFRIASLPNASINDLWDGETFSWSISFRQLLKEEEFLELQQLMGILNDARISDLLDSRI
WSLERSGLFTVKSLYNHLAASSNMHKDVFKALWKSKCPKRINILCWIMIFGSLNSSEVLQRRIPSHVLSPSICPLCSSASCRGVVIKVKNNYCGFAPVEILCQSNRKKAI
VHEVNYEDPQLLETRKVDVHGRFSSETTRVFFEEDESGDGSYTVDKTRVMSSFEHLLEVGPITEAKVEDLGSLFVEEDVEEDPTLDKENMALVLSEKDVKEPMKTMSIKQ
DGKDRKVRIDLREKGQEGEEGSSSTGIRGLVVQFLRKWLRGMGDIEKRRVVTDLLTSVNPNVALRFRRPSLR