; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017356 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017356
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:2670798..2671550
RNA-Seq ExpressionLag0017356
SyntenyLag0017356
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]1.0e-5146.44Show/hide
Query:  GLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDMS-WRFTGFYGDP
        GLGNP A+ +L+  +RK SPS++F+SETK     A+ ++ ++ F   F V   G SGGL+L WN+  ++SVKSFS GH+D ++K P ++ WRFTGFYG+P
Subjt:  GLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDMS-WRFTGFYGDP

Query:  DPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFLINHDMHRSF
          + R  SWQLL RL    +LPWI GGDFNEI+  NEK+GG  ++   I DF++ LD   L+D GF     TW   +   + V+ERL+R+  N + H  F
Subjt:  DPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFLINHDMHRSF

Query:  QSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF
         S+KV N  F  SDHRPI A LE+     +R+  KKK F
Subjt:  QSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF

KAG4109659.1 hypothetical protein ERO13_1Z049785v2, partial [Gossypium hirsutum]1.4e-4841.49Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW
        M+++CWN RG+GNP A+R LK  +  N P I+F+ ETK N NK D ++ +   + C AV+  G SGGLV+ W +S  + ++++S  H+D++I  E D   
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW

Query:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL
        RFTGFYG+ DPNKR  SW +L R+    N  WI+GGDFN ++ E EKEG  +KA   +DDFR  +D   ++D        TW   K  T++VKERL+RFL
Subjt:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL

Query:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRR
        ++ ++  SF  ++ + +  +SSDH  I    E   P+   R
Subjt:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRR

KAG6649980.1 hypothetical protein CIPAW_06G011600 [Carya illinoinensis]1.2e-4941.56Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEP--DMS
        M  L WN RGLGNPR I  L   ++  SP ++F+ ETKC+  K ++++++ GFD CFAV N G SGGL L WN+S ++ V +F+  H+   +K P  D S
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEP--DMS

Query:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF
        W  TGFYG P+  KR+++WQ+L+ L     +PW+  GDFNEI C +EK G A +  K + DFR+TL    L D G+   K+TW   +  ++  KERL+R 
Subjt:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF

Query:  LINHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRK
        L N      F+   V++L+  SSDH+P+L TL++   +  +++
Subjt:  LINHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRK

XP_028075737.1 uncharacterized protein LOC114277953 [Camellia sinensis]1.5e-5042.17Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPD--MS
        MK+LCWN RGLGNPR +R L+  ++K  P+++F+ ETK +    + ++ +LG   CF V   G SGGL L W     L +KSFS+GHVD++I       S
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPD--MS

Query:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF
        W FTGFYG+P  + R+ SW+LL RL D  +LPW+  GDFNEI+  +EK G A ++++ +D FR+ L +  L D GF  A  TW  G+     ++ERL+R 
Subjt:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF

Query:  LINHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF
        ++N     SF   +V +L   SSDH PIL  LE    + + R   +K +
Subjt:  LINHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF

XP_038697213.1 uncharacterized protein LOC119994946 [Tripterygium wilfordii]6.2e-4941.39Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPD-MSW
        MK++ WN RGLG+PR + +L   ++ +SP ILF+SET+ + + ++ L+L+L F+FCF V   G  GGL++ WN S  L+V SFS  H+DTVI   D +SW
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPD-MSW

Query:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL
        RFTG YG P    R   W LL RL    +LPW+ GGDFNEI+  +E  G   +A+  + +FR+ L +  L D G+S    TW   ++ T+ V+ERL+R +
Subjt:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL

Query:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRKAK
         +    + F    +++L+  +SDH P+L T+ S GP  S R  +
Subjt:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRRKAK

TrEMBL top hitse value%identityAlignment
A0A5B6WSX4 Reverse transcriptase1.5e-4843.67Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDM--S
        MK LCWNVRGLG+ RA+R L+  I+++ P ++F+ ETK N  +  +++   GFDF   V   GS GGL L W    D+ +KSFSK H+D +IKE ++   
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDM--S

Query:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF
        W+FTGFYG P    +N  W LL+RL    + PW++ GDFNEI+   EK GG  +  K ++ FRETL +  L D GFS   +TW RG    + ++ERL+R 
Subjt:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF

Query:  LINHDMHRSFQSIKVQNLSFNSSDHRPIL
        + N      F S  +Q+L ++ SDH P+L
Subjt:  LINHDMHRSFQSIKVQNLSFNSSDHRPIL

A0A5B6WXI9 Reverse transcriptase1.1e-4841.48Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDM--S
        MK +CWNVRGLG+PRA+R L+   ++ +P I+F+ ETK N  + +S++    F   F +   GS GGL L W +   ++++S+SK H+D ++ E  +   
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDM--S

Query:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF
        WRFTGFYG P   +RN  W LL+RL+   + PW++ GDFNEI+   EK GG  + +K ++ FR+TL+   L D G+S  ++TW RG    + ++ERL+R 
Subjt:  WRFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRF

Query:  LINHDMHRSFQSIKVQNLSFNSSDHRPIL
        + N +    F    +Q L F+SSDH PIL
Subjt:  LINHDMHRSFQSIKVQNLSFNSSDHRPIL

A0A5D2GPG4 Endo/exonuclease/phosphatase domain-containing protein1.3e-4740.25Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW
        M+++CWN RG+GNP  +  LK  +  N P I+F+ ETK N NK D ++ +   + C AV+  G SGGLV+ W +S  + ++++S  H+D++I  E D   
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW

Query:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL
        RFTGFYG+ DPNKR  SW +L R++   N  WI+GGDFN ++ E EKEG  +KA   ++DFR  +D   ++D        TW   K  T++VKERL+RFL
Subjt:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL

Query:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRR
        ++ +   SF  ++ + +  +SSDH  I    E   P+   R
Subjt:  INHDMHRSFQSIKVQNLSFNSSDHRPILATLESHGPKASRR

A0A7J6DZ24 CCHC-type domain-containing protein4.9e-5246.44Show/hide
Query:  GLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDMS-WRFTGFYGDP
        GLGNP A+ +L+  +RK SPS++F+SETK     A+ ++ ++ F   F V   G SGGL+L WN+  ++SVKSFS GH+D ++K P ++ WRFTGFYG+P
Subjt:  GLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDMS-WRFTGFYGDP

Query:  DPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFLINHDMHRSF
          + R  SWQLL RL    +LPWI GGDFNEI+  NEK+GG  ++   I DF++ LD   L+D GF     TW   +   + V+ERL+R+  N + H  F
Subjt:  DPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFLINHDMHRSF

Query:  QSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF
         S+KV N  F  SDHRPI A LE+     +R+  KKK F
Subjt:  QSIKVQNLSFNSSDHRPILATLESHGPKASRRKAKKKKF

A0A803PPS5 Uncharacterized protein9.6e-4842.98Show/hide
Query:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW
        MK L WNV+G+GNP  +RSLK+ + + +P ++FISE++   +KA+ L++ LGF  CF V   G SG L+L W+     +++SFS  H+D+ IK E D  W
Subjt:  MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIK-EPDMSW

Query:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL
        RFTGFYGDPDPN+R  SW+LL R+    + PW++ GDFNEI+ +  K GG  K    +++FR+ L++ CL +  F  +K+TW  G+   + + ERL+R  
Subjt:  RFTGFYGDPDPNKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFL

Query:  INHDMHRSFQSIKVQNLSFNSSDHRPIL
         N +    F + KV +L   +SDH P+L
Subjt:  INHDMHRSFQSIKVQNLSFNSSDHRPIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTACTATGCTGGAACGTTCGGGGATTGGGGAATCCTCGAGCGATCCGCAGCCTAAAGAATGAGATAAGGAAAAATTCCCCAAGCATCTTATTCATTTCTGAAAC
CAAATGCAACGATAACAAGGCAGATTCGTTGAAGCTCGAGTTGGGCTTTGACTTTTGTTTCGCGGTTAGCAATAGGGGCAGTAGTGGAGGGTTAGTCCTATACTGGAACA
ACTCAACTGATTTATCAGTTAAGTCTTTTTCTAAGGGGCATGTCGACACTGTTATTAAAGAGCCTGATATGAGCTGGCGGTTTACGGGTTTCTATGGGGATCCAGACCCT
AACAAAAGAAACCAGTCCTGGCAGCTTTTGGAGAGACTGAACGATAACGGCAACCTCCCTTGGATCTTGGGAGGAGATTTCAATGAAATTATTTGCGAGAATGAGAAGGA
AGGTGGAGCCCAAAAGGCAAAAAAGGGCATAGATGATTTCAGAGAGACTTTGGACAATTTTTGCCTGTTGGATCCCGGGTTTTCAAGGGCCAAGCACACTTGGAGGAGAG
GCAAGACAGCAACCTCTAAAGTCAAAGAGAGACTTAATAGATTCCTGATAAATCATGATATGCACAGAAGCTTCCAGTCCATCAAGGTTCAAAACTTAAGCTTCAACTCG
TCAGATCACAGACCTATTTTGGCCACATTAGAGTCGCACGGTCCAAAGGCCTCTAGAAGGAAAGCCAAGAAGAAAAAATTCGAGGAAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTACTATGCTGGAACGTTCGGGGATTGGGGAATCCTCGAGCGATCCGCAGCCTAAAGAATGAGATAAGGAAAAATTCCCCAAGCATCTTATTCATTTCTGAAAC
CAAATGCAACGATAACAAGGCAGATTCGTTGAAGCTCGAGTTGGGCTTTGACTTTTGTTTCGCGGTTAGCAATAGGGGCAGTAGTGGAGGGTTAGTCCTATACTGGAACA
ACTCAACTGATTTATCAGTTAAGTCTTTTTCTAAGGGGCATGTCGACACTGTTATTAAAGAGCCTGATATGAGCTGGCGGTTTACGGGTTTCTATGGGGATCCAGACCCT
AACAAAAGAAACCAGTCCTGGCAGCTTTTGGAGAGACTGAACGATAACGGCAACCTCCCTTGGATCTTGGGAGGAGATTTCAATGAAATTATTTGCGAGAATGAGAAGGA
AGGTGGAGCCCAAAAGGCAAAAAAGGGCATAGATGATTTCAGAGAGACTTTGGACAATTTTTGCCTGTTGGATCCCGGGTTTTCAAGGGCCAAGCACACTTGGAGGAGAG
GCAAGACAGCAACCTCTAAAGTCAAAGAGAGACTTAATAGATTCCTGATAAATCATGATATGCACAGAAGCTTCCAGTCCATCAAGGTTCAAAACTTAAGCTTCAACTCG
TCAGATCACAGACCTATTTTGGCCACATTAGAGTCGCACGGTCCAAAGGCCTCTAGAAGGAAAGCCAAGAAGAAAAAATTCGAGGAAGCTTAG
Protein sequenceShow/hide protein sequence
MKLLCWNVRGLGNPRAIRSLKNEIRKNSPSILFISETKCNDNKADSLKLELGFDFCFAVSNRGSSGGLVLYWNNSTDLSVKSFSKGHVDTVIKEPDMSWRFTGFYGDPDP
NKRNQSWQLLERLNDNGNLPWILGGDFNEIICENEKEGGAQKAKKGIDDFRETLDNFCLLDPGFSRAKHTWRRGKTATSKVKERLNRFLINHDMHRSFQSIKVQNLSFNS
SDHRPILATLESHGPKASRRKAKKKKFEEA