; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034801 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034801
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:10958532..10960894
RNA-Seq ExpressionLag0034801
SyntenyLag0034801
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0140097 - catalytic activity, acting on DNA (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045287.1 uncharacterized protein E6C27_scaffold316G00450 [Cucumis melo var. makuwa]1.1e-1942.19Show/hide
Query:  LSPSQIPNEFSLLVETCGLQLCKISSPSPKETKQSKI--DLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVC
        L  S +  E  + ++T G++ CK          +SKI      IK+LWS  +IG  F+E+ G+SGG+L MWDES++SV E +KG + LS KC T+C K C
Subjt:  LSPSQIPNEFSLLVETCGLQLCKISSPSPKETKQSKI--DLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVC

Query:  WVTNVYGPNDYKERRFLWPELRSLSYYC
        W++NVYGP  ++ER+ +W EL   +  C
Subjt:  WVTNVYGPNDYKERRFLWPELRSLSYYC

KAA0063088.1 uncharacterized protein E6C27_scaffold623G00050 [Cucumis melo var. makuwa]1.9e-3261.54Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP
        +E+K+ + D+ FIKSLWSSK+ GW   E +G SGG+L +WD SKL V+E LKGGY+LS   +T+C K CW+TNVYGPND+KERR +WPEL SLS YCT  
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP

Query:  WCIA
        WCI+
Subjt:  WCIA

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.5e-2962.89Show/hide
Query:  KIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDPWCI
        +ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGY+LS   +T C K CW+TNVYGP DY+ERRF+W  L SLS YCT  WCI
Subjt:  KIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDPWCI

XP_010269625.1 PREDICTED: uncharacterized protein LOC104606223 [Nelumbo nucifera]4.7e-1537.14Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD
        +E+K   +D ++++S W S+ +GW+   ++G SGG++ +W E  + V+E L G +++S KC  +     WV TNVYGPN Y+ER  +W EL ++      
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD

Query:  PWCIA
        PWC++
Subjt:  PWCIA

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]6.3e-2860.19Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP
        +ETK+ +I+  FIKSLWSSKE+G  FVEA GKSGGLL +WD+SK+ V    K  ++LS KC T+  K+CW+TNVYGP DY+ERR LW EL SL+    DP
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP

Query:  WCI
        WCI
Subjt:  WCI

TrEMBL top hitse value%identityAlignment
A0A1U8B190 uncharacterized protein LOC1046062232.3e-1537.14Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD
        +E+K   +D ++++S W S+ +GW+   ++G SGG++ +W E  + V+E L G +++S KC  +     WV TNVYGPN Y+ER  +W EL ++      
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD

Query:  PWCIA
        PWC++
Subjt:  PWCIA

A0A438EG68 Transposon TX1 uncharacterized 149 kDa protein8.6e-1533.1Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD
        +ETK  ++  + +KS+   + +GW  ++A G +GG+L+MWD+  L  LEF  G +++S +         WV + +YGP+  +ERR LW EL ++   C D
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWV-TNVYGPNDYKERRFLWPELRSLSYYCTD

Query:  PWCIADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESF
        PWCIA    FN      F  + ++G  +S  M     + + F
Subjt:  PWCIADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESF

A0A5A7TTX5 Uncharacterized protein5.2e-2042.19Show/hide
Query:  LSPSQIPNEFSLLVETCGLQLCKISSPSPKETKQSKI--DLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVC
        L  S +  E  + ++T G++ CK          +SKI      IK+LWS  +IG  F+E+ G+SGG+L MWDES++SV E +KG + LS KC T+C K C
Subjt:  LSPSQIPNEFSLLVETCGLQLCKISSPSPKETKQSKI--DLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVC

Query:  WVTNVYGPNDYKERRFLWPELRSLSYYC
        W++NVYGP  ++ER+ +W EL   +  C
Subjt:  WVTNVYGPNDYKERRFLWPELRSLSYYC

A0A5A7V639 Uncharacterized protein9.2e-3361.54Show/hide
Query:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP
        +E+K+ + D+ FIKSLWSSK+ GW   E +G SGG+L +WD SKL V+E LKGGY+LS   +T+C K CW+TNVYGPND+KERR +WPEL SLS YCT  
Subjt:  KETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDP

Query:  WCIA
        WCI+
Subjt:  WCIA

A0A5D3BHE3 Uncharacterized protein7.3e-3062.89Show/hide
Query:  KIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDPWCI
        +ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGY+LS   +T C K CW+TNVYGP DY+ERRF+W  L SLS YCT  WCI
Subjt:  KIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVLEFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDPWCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30470.1 SIT4 phosphatase-associated family protein2.5e-0644.78Show/hide
Query:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG
        + DP+RF  GT  ++  Q+ HG S+    E+VEG   S G LL LL+VSS     +T +GK QP LG
Subjt:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG

AT1G30470.2 SIT4 phosphatase-associated family protein2.5e-0644.78Show/hide
Query:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG
        + DP+RF  GT  ++  Q+ HG S+    E+VEG   S G LL LL+VSS     +T +GK QP LG
Subjt:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG

AT1G30470.3 SIT4 phosphatase-associated family protein2.5e-0644.78Show/hide
Query:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG
        + DP+RF  GT  ++  Q+ HG S+    E+VEG   S G LL LL+VSS     +T +GK QP LG
Subjt:  IADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSS-----VTIFGKFQPFLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCATGATTCTTGGAAGGAGGTTAAGCTTGTCCTTGAGGATTTCTTCAAATCTTCAGTCTTGATCAATCCTTTTATGGATGATAAAGCTTTGATTCAGGTGGCTGA
TTGTAGTTTGGATCCATCTGTGAATGGTAAGTGGAAACTATTCGAGAACCTTCATTTGAAATTGGAATTTTGGTCCTCTGATTTTCATTCCCAGCCAAAATTTATAATAA
GTTATGGAGGATGGATTGCAATAAAAAGATTACCTTTGGATTTGTGGCATGGGACTCCTTTGAAGCTATTGGAAAAAACCTTGGTGGGTTGCATTGGAAAAGTGGGTAAT
CAGCCAGCAGCTTCCGAAAGTGTTATTAATAGCATTGAAAATGAGTGTATCCAGCAGGCAGCTTTAAAGACTTACTCTCGGAAAAAGGGGTCTCGGTTATTGGTTGAAAA
GGCCAATATTAATGCTGATCATTTGGAATCTGAATGTACTAATATGATTATTTCAAATAAGGCATTGGGATCCTCAAAAAACAGTGGTGAAAATAGTCTTTCAAAGTCCA
AGGCATTTATTGAATCGTCTGTGCAATTTCCAGGGGTGAAAAATCAATTTGTCAAAGGAATTGTTTGTTCTTCCAGCCCTAAAGTTCATTCTTCTATAGATTCTGATGAT
GAGTCTTCGGTTAGTGTGAGTAGTGATGATTCTGAGTCTTTGATTGCTGAAGAAGATTGGGAGGATGTTGGTTTTGGCAATCAAATTCAAGATACCTTGTTGTCTCCTTC
TCAAATCCCTAATGAGTTCTCTTTACTAGTGGAAACTTGTGGACTTCAATTGTGCAAGATTTCATCTCCATCACCGAAAGAAACCAAACAGTCGAAAATTGATTTAAAAT
TCATTAAATCTTTATGGAGTTCAAAGGAAATTGGATGGACTTTTGTGGAAGCTTATGGGAAATCAGGAGGTCTTCTTATTATGTGGGATGAGAGCAAATTATCAGTGCTG
GAATTCTTAAAGGGTGGTTATACTCTTTCAACTAAATGTCTTACTCTTTGTATAAAAGTTTGTTGGGTCACCAATGTTTATGGTCCGAATGACTACAAGGAAAGGAGATT
CTTATGGCCTGAATTGCGTTCCCTCTCTTACTATTGCACGGATCCATGGTGTATTGCAGATCCTAGGAGGTTTAATACTGGAACAACTTTTGTATTCACCGGCCAAATAG
CCCATGGATATTCTGTTTCAGCAACTATGGAGTCGGTGGAAGGTTGGTCGGAGAGTTTTGGTAGGTTGCTCAAGCTTCTGGATGTTTCTTCAGTTACTATATTTGGAAAG
TTTCAGCCATTTCTTGGATACTCTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCATGATTCTTGGAAGGAGGTTAAGCTTGTCCTTGAGGATTTCTTCAAATCTTCAGTCTTGATCAATCCTTTTATGGATGATAAAGCTTTGATTCAGGTGGCTGA
TTGTAGTTTGGATCCATCTGTGAATGGTAAGTGGAAACTATTCGAGAACCTTCATTTGAAATTGGAATTTTGGTCCTCTGATTTTCATTCCCAGCCAAAATTTATAATAA
GTTATGGAGGATGGATTGCAATAAAAAGATTACCTTTGGATTTGTGGCATGGGACTCCTTTGAAGCTATTGGAAAAAACCTTGGTGGGTTGCATTGGAAAAGTGGGTAAT
CAGCCAGCAGCTTCCGAAAGTGTTATTAATAGCATTGAAAATGAGTGTATCCAGCAGGCAGCTTTAAAGACTTACTCTCGGAAAAAGGGGTCTCGGTTATTGGTTGAAAA
GGCCAATATTAATGCTGATCATTTGGAATCTGAATGTACTAATATGATTATTTCAAATAAGGCATTGGGATCCTCAAAAAACAGTGGTGAAAATAGTCTTTCAAAGTCCA
AGGCATTTATTGAATCGTCTGTGCAATTTCCAGGGGTGAAAAATCAATTTGTCAAAGGAATTGTTTGTTCTTCCAGCCCTAAAGTTCATTCTTCTATAGATTCTGATGAT
GAGTCTTCGGTTAGTGTGAGTAGTGATGATTCTGAGTCTTTGATTGCTGAAGAAGATTGGGAGGATGTTGGTTTTGGCAATCAAATTCAAGATACCTTGTTGTCTCCTTC
TCAAATCCCTAATGAGTTCTCTTTACTAGTGGAAACTTGTGGACTTCAATTGTGCAAGATTTCATCTCCATCACCGAAAGAAACCAAACAGTCGAAAATTGATTTAAAAT
TCATTAAATCTTTATGGAGTTCAAAGGAAATTGGATGGACTTTTGTGGAAGCTTATGGGAAATCAGGAGGTCTTCTTATTATGTGGGATGAGAGCAAATTATCAGTGCTG
GAATTCTTAAAGGGTGGTTATACTCTTTCAACTAAATGTCTTACTCTTTGTATAAAAGTTTGTTGGGTCACCAATGTTTATGGTCCGAATGACTACAAGGAAAGGAGATT
CTTATGGCCTGAATTGCGTTCCCTCTCTTACTATTGCACGGATCCATGGTGTATTGCAGATCCTAGGAGGTTTAATACTGGAACAACTTTTGTATTCACCGGCCAAATAG
CCCATGGATATTCTGTTTCAGCAACTATGGAGTCGGTGGAAGGTTGGTCGGAGAGTTTTGGTAGGTTGCTCAAGCTTCTGGATGTTTCTTCAGTTACTATATTTGGAAAG
TTTCAGCCATTTCTTGGATACTCTGTCTGA
Protein sequenceShow/hide protein sequence
MAHDSWKEVKLVLEDFFKSSVLINPFMDDKALIQVADCSLDPSVNGKWKLFENLHLKLEFWSSDFHSQPKFIISYGGWIAIKRLPLDLWHGTPLKLLEKTLVGCIGKVGN
QPAASESVINSIENECIQQAALKTYSRKKGSRLLVEKANINADHLESECTNMIISNKALGSSKNSGENSLSKSKAFIESSVQFPGVKNQFVKGIVCSSSPKVHSSIDSDD
ESSVSVSSDDSESLIAEEDWEDVGFGNQIQDTLLSPSQIPNEFSLLVETCGLQLCKISSPSPKETKQSKIDLKFIKSLWSSKEIGWTFVEAYGKSGGLLIMWDESKLSVL
EFLKGGYTLSTKCLTLCIKVCWVTNVYGPNDYKERRFLWPELRSLSYYCTDPWCIADPRRFNTGTTFVFTGQIAHGYSVSATMESVEGWSESFGRLLKLLDVSSVTIFGK
FQPFLGYSV