; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012449 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012449
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:41130300..41134471
RNA-Seq ExpressionLag0012449
SyntenyLag0012449
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]4.8e-2036.77Show/hide
Query:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLS---------QVSMAPTDIKIEAPFWQELYDLSCLCSDIWM
        TK A  DR  + S+W+ RN  W  L A G+SGGIL+MW+        +  G +S+S+  +              P    +   FW+EL D+ CL S  W 
Subjt:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLS---------QVSMAPTDIKIEAPFWQELYDLSCLCSDIWM

Query:  LAGDFNVTRWSHEKSKGG-VTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
        + GDFNV R   EK  GG +T SMK  + F  ++ LID PL++  +TWS+ +  P
Subjt:  LAGDFNVTRWSHEKSKGG-VTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

KAA0037455.1 hypothetical protein E6C27_scaffold277G00430 [Cucumis melo var. makuwa]8.2e-2037.01Show/hide
Query:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS--------MAPTDIKIEAPFWQELYDLSCLCSDIWML
        +K+++L      ++WS + I W SL + GSSG I+++WND  + + N   G++S+S+  S  +          P+     + FW EL  L   C   W+L
Subjt:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS--------MAPTDIKIEAPFWQELYDLSCLCSDIWML

Query:  AGDFNVTRWSHE-KSKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
        AGDFN+ RW  E  +K   TR+M FFN F   + LID PL N  +TWS+ R  P
Subjt:  AGDFNVTRWSHE-KSKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

XP_010647647.1 PREDICTED: uncharacterized protein LOC104878737, partial [Vitis vinifera]8.2e-2038.71Show/hide
Query:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL---------LLSQVSMAPTDIKIEAPFWQELYDLSCLCSDIWM
        TK    DR  + S+W+ RN  W +L A G+SGGILI+W+        +  G +S+S+         L       P    ++  FW EL D++ L S  W 
Subjt:  TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL---------LLSQVSMAPTDIKIEAPFWQELYDLSCLCSDIWM

Query:  LAGDFNVTRWSHEKSKGG-VTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
        + GDFNV R S EK  G  +T SMK F+ F +D  LIDLPL++ L+TWS+ +  P
Subjt:  LAGDFNVTRWSHEKSKGG-VTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]2.7e-2338.82Show/hide
Query:  DISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEA
        D  ++LC  +V L+    TK + ++   IKSLWS  +I+W SLDASG+SGGI+++W+  +     +  G +S+S+                +P   K   
Subjt:  DISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEA

Query:  PFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKGGVTRS-MKFFNQFFADSNLIDLPLQNGLYTWSDNR
         FWQEL+DL+ LC  IW+L  DFN+ RWSHE S     ++ M  FN F   + LID  + NG YTWS+ R
Subjt:  PFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKGGVTRS-MKFFNQFFADSNLIDLPLQNGLYTWSDNR

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.5e-3447.06Show/hide
Query:  FLRDNINGTDI--SSTLCQILVQLANPFT-----TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL-------LL
        FL  N+ G D      L +  +   NP       TKL+++D LI+KSLWS   I+W++LDASG + GILI+WNDP      + +GV+SL++        L
Subjt:  FLRDNINGTDI--SSTLCQILVQLANPFT-----TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL-------LL

Query:  SQVS--MAPTDIKIEAPFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKG-GVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRS
          VS    P+  +    FWQEL DLS LC + W+LAGDFNVTRWS EKS G  +T+SM  FN F  DS+LID+PL NG +TWS N S
Subjt:  SQVS--MAPTDIKIEAPFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKG-GVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRS

TrEMBL top hitse value%identityAlignment
A0A6J1CVN2 uncharacterized protein LOC1110146571.3e-2338.82Show/hide
Query:  DISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEA
        D  ++LC  +V L+    TK + ++   IKSLWS  +I+W SLDASG+SGGI+++W+  +     +  G +S+S+                +P   K   
Subjt:  DISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEA

Query:  PFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKGGVTRS-MKFFNQFFADSNLIDLPLQNGLYTWSDNR
         FWQEL+DL+ LC  IW+L  DFN+ RWSHE S     ++ M  FN F   + LID  + NG YTWS+ R
Subjt:  PFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKGGVTRS-MKFFNQFFADSNLIDLPLQNGLYTWSDNR

A0A6J1E2G6 uncharacterized protein LOC1110254057.4e-3547.06Show/hide
Query:  FLRDNINGTDI--SSTLCQILVQLANPFT-----TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL-------LL
        FL  N+ G D      L +  +   NP       TKL+++D LI+KSLWS   I+W++LDASG + GILI+WNDP      + +GV+SL++        L
Subjt:  FLRDNINGTDI--SSTLCQILVQLANPFT-----TKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSL-------LL

Query:  SQVS--MAPTDIKIEAPFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKG-GVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRS
          VS    P+  +    FWQEL DLS LC + W+LAGDFNVTRWS EKS G  +T+SM  FN F  DS+LID+PL NG +TWS N S
Subjt:  SQVS--MAPTDIKIEAPFWQELYDLSCLCSDIWMLAGDFNVTRWSHEKSKG-GVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRS

A0A803QEA6 Uncharacterized protein8.0e-2135.63Show/hide
Query:  TDISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIE
        T I +T+C+    L      K A +DR  I S+W  R  +W  + A G SGG L++W+     + +   G +S+S+L++              P   K+ 
Subjt:  TDISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIE

Query:  APFWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
          FW EL  LS +C   W +AGDFNVTR   EK +    TRSMK F+    +  LID  L+NG +TWS+ R+ P
Subjt:  APFWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

A0A803QI00 Uncharacterized protein8.0e-2134.88Show/hide
Query:  ISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEAP
        I +T+C+    L      K   +DR  I S+W  R  +W  + A G SGG L++W+     + +   G +S+S+L+               P   K+   
Subjt:  ISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEAP

Query:  FWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
        FW EL  LS +C D W + GDFNVTR   EK +    TRSMK F+    +  LID  L+NG +TWS+ R+ P
Subjt:  FWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

A0A803QQM3 Uncharacterized protein1.4e-2035.47Show/hide
Query:  ISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEAP
        I +T+C+    L      K A +DR  I S+W  R  +W  L A G SGG L++W+     + +   G +S+S+L++              P   K+   
Subjt:  ISSTLCQILVQLANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVS---------MAPTDIKIEAP

Query:  FWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP
        FW EL  LS +C + W + GDFNVTR   EK +    TRSMK F+    +  LID  L+NG +TWS+ R+ P
Subjt:  FWQELYDLSCLCSDIWMLAGDFNVTRWSHEK-SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein3.9e-0431.08Show/hide
Query:  WQELYDLSC---LCSDIWMLAGDFN----VTRWSHEKSKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNR
        W ++  LS    LC+  W++ GDFN    VT            + ++       DS+L+DLP +  LYTWS+++
Subjt:  WQELYDLSC---LCSDIWMLAGDFN----VTRWSHEKSKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAACAAGCTTGCTGAAAATTATGGTGTAGCTCCTATATTTTGCCTGTGTTGGAACTTCGACGTTCGACCATCCAAATGTTTCATCCGATACTGTTTAGCGTTTAG
TGTTTACTTATGCCCCGCCAGTGCCCACGTGTATTTGCATATTTGGGTACTTGGGAACAGTAAAACATCCTCTTTTCATGGCATCTCGGATATTAGCAGGATATCATGGG
CAGCCGTCGCCACTAGTCAAATTTGCTTCTTTTTAGCAGTTTTTTTTTTGAGAGATAATATTAATGGCACAGATATCTCAAGCACTTTATGTCAAATACTTGTACAGCTA
GCAAATCCCTTTACTACTAAATTGGCTCATCTTGACCGGCTCATCATCAAATCTTTATGGAGTGGGAGAAATATTAGTTGGACTTCTCTTGATGCTTCTGGATCCTCTGG
TGGTATTCTCATTATGTGGAATGATCCTGCCTTTGTTATCTTTAACATCACTAAAGGTGTGTATTCCCTCTCCCTCCTCCTTTCACAGGTGTCTATGGCCCCAACAGATA
TAAAGATAGAGGCCCCCTTCTGGCAAGAATTATATGATTTATCGTGCCTGTGTTCTGATATCTGGATGTTGGCTGGCGATTTTAATGTGACTCGTTGGTCCCATGAGAAA
TCTAAAGGCGGAGTTACTCGCAGTATGAAGTTCTTCAATCAGTTTTTCGCCGACTCAAATTTAATTGATCTCCCTCTTCAGAATGGCTTGTACACTTGGTCAGATAACCG
ATCTCCCCCACGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAACAAGCTTGCTGAAAATTATGGTGTAGCTCCTATATTTTGCCTGTGTTGGAACTTCGACGTTCGACCATCCAAATGTTTCATCCGATACTGTTTAGCGTTTAG
TGTTTACTTATGCCCCGCCAGTGCCCACGTGTATTTGCATATTTGGGTACTTGGGAACAGTAAAACATCCTCTTTTCATGGCATCTCGGATATTAGCAGGATATCATGGG
CAGCCGTCGCCACTAGTCAAATTTGCTTCTTTTTAGCAGTTTTTTTTTTGAGAGATAATATTAATGGCACAGATATCTCAAGCACTTTATGTCAAATACTTGTACAGCTA
GCAAATCCCTTTACTACTAAATTGGCTCATCTTGACCGGCTCATCATCAAATCTTTATGGAGTGGGAGAAATATTAGTTGGACTTCTCTTGATGCTTCTGGATCCTCTGG
TGGTATTCTCATTATGTGGAATGATCCTGCCTTTGTTATCTTTAACATCACTAAAGGTGTGTATTCCCTCTCCCTCCTCCTTTCACAGGTGTCTATGGCCCCAACAGATA
TAAAGATAGAGGCCCCCTTCTGGCAAGAATTATATGATTTATCGTGCCTGTGTTCTGATATCTGGATGTTGGCTGGCGATTTTAATGTGACTCGTTGGTCCCATGAGAAA
TCTAAAGGCGGAGTTACTCGCAGTATGAAGTTCTTCAATCAGTTTTTCGCCGACTCAAATTTAATTGATCTCCCTCTTCAGAATGGCTTGTACACTTGGTCAGATAACCG
ATCTCCCCCACGATGA
Protein sequenceShow/hide protein sequence
MLNKLAENYGVAPIFCLCWNFDVRPSKCFIRYCLAFSVYLCPASAHVYLHIWVLGNSKTSSFHGISDISRISWAAVATSQICFFLAVFFLRDNINGTDISSTLCQILVQL
ANPFTTKLAHLDRLIIKSLWSGRNISWTSLDASGSSGGILIMWNDPAFVIFNITKGVYSLSLLLSQVSMAPTDIKIEAPFWQELYDLSCLCSDIWMLAGDFNVTRWSHEK
SKGGVTRSMKFFNQFFADSNLIDLPLQNGLYTWSDNRSPPR