; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022111 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022111
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:18592700..18593656
RNA-Seq ExpressionLag0022111
SyntenyLag0022111
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]3.6e-4240.45Show/hide
Query:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKDSTDWWR
        MK LCWNV GLGNP  FR+LR++VR   P L+FL ETK +  +E + KREL FD C+ V S G  GGL L+W S   V + S S GHI+    D    WR
Subjt:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKDSTDWWR

Query:  FTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQM---------DDF---RDAVNYC-----RLLDPGFKDNKPI
        FT FYGNP   KR  SW LL+RL  +   +LPW I+GGDFNE +   EK GG  R + QM         D F      +N C       L+    D++PI
Subjt:  FTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQM---------DDF---RDAVNYC-----RLLDPGFKDNKPI

Query:  VASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAAKSALCIKECLAWNK
        +AS      R      +  +RFEE W + +  +++I   W S+       + AK   C+     WNK
Subjt:  VASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAAKSALCIKECLAWNK

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]1.5e-4034.6Show/hide
Query:  RDIGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLG
        R+ GGGW PAPP  M  L WN RGLGNP+    L  +VR+++P LIFL+ETK +  +  ++ R++ + N   VP   + GGL+L W +   V V S+S  
Subjt:  RDIGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLG

Query:  HIN-VTGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGF--------
        HI+ +      D WRFT FYG+P    R++SWS+L+ L       LPW+ + GDFNE +++ EK G  DR + QM  FRDA+++CRL D GF        
Subjt:  HIN-VTGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGF--------

Query:  --------------------------------------KDNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWA--AK
                                               D+KPI+ S     NR   +K R    FE  W K    + +I+ +W  V    A  W   +K
Subjt:  --------------------------------------KDNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWA--AK

Query:  SALCIKECLAWNKQT
         + C      WNK+T
Subjt:  SALCIKECLAWNKQT

XP_030929385.1 uncharacterized protein LOC115955407 [Quercus lobata]5.1e-4136.18Show/hide
Query:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW
        M+ L WN RGLGNPR+ R LR++V+  +P  +FL ETK   +   + K  + F N LV+PS G  GGL+L+W+  I V V S+S  HI+ +  +D    W
Subjt:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW

Query:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL
        R T FYGNP V +RK+SW LLK L       LPW+   G+FNE + + EK GG  R Q QMDDFR+A+NYCR +D GF   +    ++  G +R   R  
Subjt:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL

Query:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN
        R+L+                                          +FE  WT+ +E +++I+  W+ SV+ YS     A    C  +   WN
Subjt:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]2.5e-4036.18Show/hide
Query:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW
        M+ L WN RGLGNP++ R LR++V+  +P  +FL ETK   R   + K  + F N LV+PS G  GGL+L+W+  I V V S+S  HI+ +  +D    W
Subjt:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW

Query:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL
        R T FYGNP V +RK+SW LLK L       LPW+   GDFNE + + EK GG  R Q QMDDFR+A++ CR +D GF   +    ++  G +R   R  
Subjt:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL

Query:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN
        R+L+                                           FE  WT+ +E +++I+ AW+ SV+ YS     A    C  +   WN
Subjt:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]3.3e-4036.18Show/hide
Query:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW
        M+ L WN RGLGNP++ R LR++V+  +P  +FL ETK   R   + K  + F N LV+PS G  GGL+L+W+  I V V S+S  HI+ +  +D    W
Subjt:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHIN-VTGKDSTDWW

Query:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL
        R T FYGNP V +RK+SW LLK L       LPW+   GDFNE + + EK GG  R Q QMDDFR+A++ CR +D GF   +    ++  G +R   R  
Subjt:  RFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKL

Query:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN
        R+L+                                           FE  WT+ +E +++I+ AW+ SV+ YS     A    C  +   WN
Subjt:  RSLL------------------------------------------RFEECWTKHEETKNLIQKAWS-SVSSYSAGDWAAKSALCIKECLAWN

TrEMBL top hitse value%identityAlignment
A0A2N9G1F9 Uncharacterized protein2.1e-4034.88Show/hide
Query:  APPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINV-TGKD
        APP+ M +L WN +GLGNP A R+L H+V+ + P ++FL+ETK D      ++ +L FDN   VPS G  GGL+L+WK+  +V + +YS  HI+      
Subjt:  APPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINV-TGKD

Query:  STDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK-------DNKPIVASL
            WR T FYG P   +R++SW+LLK L  +  D LPW  L GDFNE +  +EK GG +R   Q+ +F++AVN C  +D GF+       +N+   A++
Subjt:  STDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK-------DNKPIVASL

Query:  GCGGNRPI---------------ARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAG--DWAAKSALCIKECLAWNKQ
            +R +               +R  + + RFEE W  + + + LIQ++W    S  +       K + C    + W+++
Subjt:  GCGGNRPI---------------ARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAG--DWAAKSALCIKECLAWNKQ

A0A2N9HDH5 Uncharacterized protein1.6e-4033.12Show/hide
Query:  IGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHI
        +GGG   APP  M I+ WN RGLGN RA  +L ++V+S+ P ++FL+ETK D R    ++ +L+F  C  VPS G  GGL+L+W   +Q+ + ++S+ HI
Subjt:  IGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHI

Query:  NV-TGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK---------
        +          WRFT FYGNP   +R+ SW+LL +L  + S  LPW+++ GDFNE + S E+SG     Q  M +F + +N+C L+D G++         
Subjt:  NV-TGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK---------

Query:  -------------------------------------DNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAA--KSA
                                             D+ PI+     G  R  A + R   +FEE W+ H E + +IQK W  V+   +  +    K  
Subjt:  -------------------------------------DNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAA--KSA

Query:  LCIKECLAWNK
         C +E   W K
Subjt:  LCIKECLAWNK

A0A2N9HLP3 Uncharacterized protein6.8e-3933.99Show/hide
Query:  PAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKD
        PAPP AM  L WN RGLGNPR  + L  +V +++P ++FL+E   D     +L+ +L+FDN  V  S    GGL L+WKSSI + +NS+S  HI+    D
Subjt:  PAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKD

Query:  ST-DWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGF----------------
        +T + WRFT FYG P   KR++SW+LL+RL       LPW  + GDFNE +   EK G   R + QM  FRD ++ C  +D GF                
Subjt:  ST-DWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGF----------------

Query:  -----------------------------KDNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAW-SSVSSYSAGDWAAKSALCIKECLA
                                      D+KP+  S       P+  ++R   RFEE WT  +  +  +  AW  SV+         K   C ++  +
Subjt:  -----------------------------KDNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAW-SSVSSYSAGDWAAKSALCIKECLA

Query:  WNK
        W+K
Subjt:  WNK

A0A2N9HWG1 Reverse transcriptase domain-containing protein3.6e-4032.8Show/hide
Query:  IGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHI
        +GGG   APP  M I+ WN RGLGN RA  +L ++V+S+ P ++FL+ETK D R    ++ +L+F  C  VPS G  GGL+L+W   +Q+ + ++S+ HI
Subjt:  IGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHI

Query:  NV-TGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK---------
        +          WRFT FYGNP V +R++SW+LL +L  + S  LPW+++ GDFNE + S E++G     Q  M +F + +N C L+D G++         
Subjt:  NV-TGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFK---------

Query:  -------------------------------------DNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAA--KSA
                                             D+ PI+     G  R  A + R   +FEE W+ H E + +IQK W   +   +  +    K  
Subjt:  -------------------------------------DNKPIVASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAA--KSA

Query:  LCIKECLAWNK
         C +E   W K
Subjt:  LCIKECLAWNK

A0A6J1DUG8 uncharacterized protein LOC1110241351.7e-4240.45Show/hide
Query:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKDSTDWWR
        MK LCWNV GLGNP  FR+LR++VR   P L+FL ETK +  +E + KREL FD C+ V S G  GGL L+W S   V + S S GHI+    D    WR
Subjt:  MKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINVTGKDSTDWWR

Query:  FTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQM---------DDF---RDAVNYC-----RLLDPGFKDNKPI
        FT FYGNP   KR  SW LL+RL  +   +LPW I+GGDFNE +   EK GG  R + QM         D F      +N C       L+    D++PI
Subjt:  FTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQM---------DDF---RDAVNYC-----RLLDPGFKDNKPI

Query:  VASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAAKSALCIKECLAWNK
        +AS      R      +  +RFEE W + +  +++I   W S+       + AK   C+     WNK
Subjt:  VASLGCGGNRPIARKLRSLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAAKSALCIKECLAWNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCGAGGTCATCGAAGGGATATCGGCGGAGGCTGGTTGCCAGCCCCGCCGGACGCCATGAAAATCCTGTGCTGGAATGTCCGGGGATTGGGGAATCCTCGGGCATT
CCGATCGCTTCGACACGTTGTCAGAAGCGAAAATCCCCTACTGATTTTTCTTTTAGAGACTAAAAGTGATTGTAGAATTGAGTCCAAGTTGAAGAGAGAGCTAAAGTTTG
ATAATTGTCTGGTGGTTCCAAGCGATGGGAGTTATGGGGGGCTTAGTTTGATCTGGAAGTCGTCCATTCAGGTGTGTGTGAACTCTTATTCGCTTGGTCATATTAATGTC
ACGGGGAAAGATAGCACAGATTGGTGGAGGTTTACGAGTTTCTATGGGAACCCCTCGGTTGAAAAAAGAAAGGACTCTTGGTCCCTTTTGAAGAGGTTGGTTGATATTGG
TAGTGATAATTTGCCTTGGATGATCCTGGGAGGCGATTTTAATGAGTTCATTTTCAGTCATGAGAAATCAGGTGGAGGAGATAGAAGACAGGGGCAGATGGATGATTTTA
GGGATGCGGTGAATTATTGCAGATTGTTGGACCCTGGCTTTAAAGACAACAAACCAATTGTGGCTTCTCTTGGTTGCGGTGGCAACAGGCCGATCGCGAGGAAGTTGAGG
AGCCTTTTGAGGTTTGAGGAATGTTGGACTAAGCACGAAGAGACTAAAAATCTGATTCAGAAAGCCTGGAGTTCAGTGTCTTCCTATAGCGCTGGCGATTGGGCTGCTAA
GTCTGCGCTTTGCATCAAGGAATGTCTAGCGTGGAATAAACAGACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCGAGGTCATCGAAGGGATATCGGCGGAGGCTGGTTGCCAGCCCCGCCGGACGCCATGAAAATCCTGTGCTGGAATGTCCGGGGATTGGGGAATCCTCGGGCATT
CCGATCGCTTCGACACGTTGTCAGAAGCGAAAATCCCCTACTGATTTTTCTTTTAGAGACTAAAAGTGATTGTAGAATTGAGTCCAAGTTGAAGAGAGAGCTAAAGTTTG
ATAATTGTCTGGTGGTTCCAAGCGATGGGAGTTATGGGGGGCTTAGTTTGATCTGGAAGTCGTCCATTCAGGTGTGTGTGAACTCTTATTCGCTTGGTCATATTAATGTC
ACGGGGAAAGATAGCACAGATTGGTGGAGGTTTACGAGTTTCTATGGGAACCCCTCGGTTGAAAAAAGAAAGGACTCTTGGTCCCTTTTGAAGAGGTTGGTTGATATTGG
TAGTGATAATTTGCCTTGGATGATCCTGGGAGGCGATTTTAATGAGTTCATTTTCAGTCATGAGAAATCAGGTGGAGGAGATAGAAGACAGGGGCAGATGGATGATTTTA
GGGATGCGGTGAATTATTGCAGATTGTTGGACCCTGGCTTTAAAGACAACAAACCAATTGTGGCTTCTCTTGGTTGCGGTGGCAACAGGCCGATCGCGAGGAAGTTGAGG
AGCCTTTTGAGGTTTGAGGAATGTTGGACTAAGCACGAAGAGACTAAAAATCTGATTCAGAAAGCCTGGAGTTCAGTGTCTTCCTATAGCGCTGGCGATTGGGCTGCTAA
GTCTGCGCTTTGCATCAAGGAATGTCTAGCGTGGAATAAACAGACTTAA
Protein sequenceShow/hide protein sequence
MLRGHRRDIGGGWLPAPPDAMKILCWNVRGLGNPRAFRSLRHVVRSENPLLIFLLETKSDCRIESKLKRELKFDNCLVVPSDGSYGGLSLIWKSSIQVCVNSYSLGHINV
TGKDSTDWWRFTSFYGNPSVEKRKDSWSLLKRLVDIGSDNLPWMILGGDFNEFIFSHEKSGGGDRRQGQMDDFRDAVNYCRLLDPGFKDNKPIVASLGCGGNRPIARKLR
SLLRFEECWTKHEETKNLIQKAWSSVSSYSAGDWAAKSALCIKECLAWNKQT