; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036429 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036429
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-2 polyprotein
Genome locationchr3:46360339..46361482
RNA-Seq ExpressionLag0036429
SyntenyLag0036429
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG51266.1 hypothetical protein EZV62_023790 [Acer yangbiense]6.2e-4547.86Show/hide
Query:  STKETTSSKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGC
        +++E   SK+K+ EP  F GSR  K LENFLWDME+YFKA    D  QV ITSMYL  DAKL WR R  +++SAGRP I+ WES+ KELK+QFL CN   
Subjt:  STKETTSSKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGC

Query:  VTREVLKKLKLTKMIQEYVKEFSSLMLDINNMFEEDK-------------TSLTRTEL-------------------------VSMKINGNAVSSMMDTS
        + RE LKKLK T  +++YVKEFSSLMLDI NM EEDK             T L R  +                         V++++NG A+ +M+DT 
Subjt:  VTREVLKKLKLTKMIQEYVKEFSSLMLDINNMFEEDK-------------TSLTRTEL-------------------------VSMKINGNAVSSMMDTS

Query:  TTNNFVAARMIDILGLEIAQSDACITAVNSTTQP
         TNNFVA R  D LGL + +S   I AVNS   P
Subjt:  TTNNFVAARMIDILGLEIAQSDACITAVNSTTQP

XP_009767233.1 PREDICTED: uncharacterized protein LOC104218438, partial [Nicotiana sylvestris]1.1e-3663.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

XP_009779304.1 PREDICTED: uncharacterized protein LOC104228518, partial [Nicotiana sylvestris]1.1e-3663.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

XP_009780550.1 PREDICTED: uncharacterized protein LOC104229593 [Nicotiana sylvestris]1.8e-3663.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

XP_016435143.1 PREDICTED: uncharacterized protein LOC107761437 [Nicotiana tabacum]3.1e-3663.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R +++VSAGRPKI   E + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NMFEEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

TrEMBL top hitse value%identityAlignment
A0A1S3X604 uncharacterized protein LOC1077614371.5e-3663.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R +++VSAGRPKI   E + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NMFEEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

A0A1U7VLK2 uncharacterized protein LOC1042184385.1e-3763.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

A0A1U7WX92 uncharacterized protein LOC1042285185.1e-3763.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

A0A1U7X259 uncharacterized protein LOC1042295938.7e-3763.08Show/hide
Query:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK
        +KV+I EP  F G+R AK LENFLWDME+YF AA   DE +V IT+MYLV DAKL WR R  ++VSAGRPKI  WE + KELKDQF   N G + R+ LK
Subjt:  SKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGCVTREVLK

Query:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK
        KLK T  +++YVK+FSSLMLDI+NM EEDK
Subjt:  KLKLTKMIQEYVKEFSSLMLDINNMFEEDK

A0A5C7H4V1 Retrotrans_gag domain-containing protein3.0e-4547.86Show/hide
Query:  STKETTSSKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGC
        +++E   SK+K+ EP  F GSR  K LENFLWDME+YFKA    D  QV ITSMYL  DAKL WR R  +++SAGRP I+ WES+ KELK+QFL CN   
Subjt:  STKETTSSKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKELKDQFLSCNIGC

Query:  VTREVLKKLKLTKMIQEYVKEFSSLMLDINNMFEEDK-------------TSLTRTEL-------------------------VSMKINGNAVSSMMDTS
        + RE LKKLK T  +++YVKEFSSLMLDI NM EEDK             T L R  +                         V++++NG A+ +M+DT 
Subjt:  VTREVLKKLKLTKMIQEYVKEFSSLMLDINNMFEEDK-------------TSLTRTEL-------------------------VSMKINGNAVSSMMDTS

Query:  TTNNFVAARMIDILGLEIAQSDACITAVNSTTQP
         TNNFVA R  D LGL + +S   I AVNS   P
Subjt:  TTNNFVAARMIDILGLEIAQSDACITAVNSTTQP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCGCATCCAAGCATTGGTGCGTTGAAGGGCTTCATGAATGTGACGCTCTTGCGGGGACTTCCACTAAGGAAACCACGAGTTCAAAGGTGAAGATCCTTGAGCC
TTCATCTTTTAAGGGTTCCCGGGGTGCCAAGGGATTAGAAAATTTCCTTTGGGATATGGAAGAGTACTTTAAGGCTGCGATGACCTCCGATGAGGTGCAAGTCGGTATAA
CAAGTATGTACCTTGTTAGAGATGCCAAGTTAAGGTGGAGAATCCGAACTCTAAACAATGTAAGTGCTGGTAGGCCAAAGATTATTCAATGGGAATCAATAAATAAGGAA
TTGAAGGATCAGTTCCTTTCTTGCAACATAGGATGTGTTACTAGGGAGGTCCTGAAGAAGCTCAAGCTTACCAAGATGATTCAGGAGTATGTCAAGGAATTTAGTTCCTT
GATGTTGGACATCAACAATATGTTTGAGGAAGACAAAACTAGTCTAACCCGGACTGAGCTTGTCTCTATGAAGATAAATGGGAATGCAGTGTCATCCATGATGGATACGA
GCACTACTAACAATTTTGTGGCAGCAAGGATGATTGACATATTGGGTCTTGAGATAGCTCAAAGCGATGCTTGCATTACGGCTGTTAACTCCACGACCCAACCAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGCGCATCCAAGCATTGGTGCGTTGAAGGGCTTCATGAATGTGACGCTCTTGCGGGGACTTCCACTAAGGAAACCACGAGTTCAAAGGTGAAGATCCTTGAGCC
TTCATCTTTTAAGGGTTCCCGGGGTGCCAAGGGATTAGAAAATTTCCTTTGGGATATGGAAGAGTACTTTAAGGCTGCGATGACCTCCGATGAGGTGCAAGTCGGTATAA
CAAGTATGTACCTTGTTAGAGATGCCAAGTTAAGGTGGAGAATCCGAACTCTAAACAATGTAAGTGCTGGTAGGCCAAAGATTATTCAATGGGAATCAATAAATAAGGAA
TTGAAGGATCAGTTCCTTTCTTGCAACATAGGATGTGTTACTAGGGAGGTCCTGAAGAAGCTCAAGCTTACCAAGATGATTCAGGAGTATGTCAAGGAATTTAGTTCCTT
GATGTTGGACATCAACAATATGTTTGAGGAAGACAAAACTAGTCTAACCCGGACTGAGCTTGTCTCTATGAAGATAAATGGGAATGCAGTGTCATCCATGATGGATACGA
GCACTACTAACAATTTTGTGGCAGCAAGGATGATTGACATATTGGGTCTTGAGATAGCTCAAAGCGATGCTTGCATTACGGCTGTTAACTCCACGACCCAACCAACTTAA
Protein sequenceShow/hide protein sequence
MEGASKHWCVEGLHECDALAGTSTKETTSSKVKILEPSSFKGSRGAKGLENFLWDMEEYFKAAMTSDEVQVGITSMYLVRDAKLRWRIRTLNNVSAGRPKIIQWESINKE
LKDQFLSCNIGCVTREVLKKLKLTKMIQEYVKEFSSLMLDINNMFEEDKTSLTRTELVSMKINGNAVSSMMDTSTTNNFVAARMIDILGLEIAQSDACITAVNSTTQPT