; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029014 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029014
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:34255925..34261919
RNA-Seq ExpressionLag0029014
SyntenyLag0029014
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3486777.1 reverse transcriptase [Gossypium australe]1.6e-2234.27Show/hide
Query:  EDLENVASRGKDDDETGTIEGNVEFNRYNEGWNLSQRLFLPSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCF
        +++EN       +D T  ++         EG +  Q+      +K ISWN  GLG+ R  R ++  +      ++FL ETK D++  +K++ SC F    
Subjt:  EDLENVASRGKDDDETGTIEGNVEFNRYNEGWNLSQRLFLPSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCF

Query:  TVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL--RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
         ++++G++GGLC+ W +  +VT+RS+SNNHI+   N  N    WRFT  YG P    KN  W L+RKL      PWL+
Subjt:  TVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL--RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

KAF5472061.1 hypothetical protein F2P56_008808 [Juglans regia]1.2e-2241.91Show/hide
Query:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNLR--
        MKI SWN  GLGNPR  RT+ +L+     +VLFL ET+   +  +  K    F+ C  + SQG KGG+ +LW  +  +++ SYS NH++  I   NLR  
Subjt:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNLR--

Query:  -WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
         W  T +YGYPKT  +  TW L+R L  + D PW++
Subjt:  -WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.6e-2245.86Show/hide
Query:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDI-NWNNLRW
        MKIISWN  GLG  RTFR  + L+   + Q+LFLSETK   K  +  +   KFE CF V   G  GGL +LW+    + ++SYS +HI+  I N N   W
Subjt:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDI-NWNNLRW

Query:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWL
        R T VYG+P++ QK  TW L+R+L      PWL
Subjt:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWL

XP_020412490.1 uncharacterized protein LOC18793550 [Prunus persica]9.3e-2339.55Show/hide
Query:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDIN--WNNLR
        M ++SWN  GLGNPRT + ++ LV+++   V+FL ET+C ++  + IK    F+ CF V + G  GGLC+ W  +  + IRS S +HI+ ++    ++L 
Subjt:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDIN--WNNLR

Query:  WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWL
        WR T  YGYP T   +L+W L+R L +    PW+
Subjt:  WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWL

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]7.1e-2342.96Show/hide
Query:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNLR-W
        MK I WN  GLGNP   R +++L+      +LFL ETK   K  D +K    F  CF+V S+G  GGL +LW+    V +RS+S  HI+  I  ++L  W
Subjt:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNLR-W

Query:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLLG
        RFT +YG+P T ++  TW LIR L++    PWL+G
Subjt:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLLG

TrEMBL top hitse value%identityAlignment
A0A2N9HWG1 Reverse transcriptase domain-containing protein2.6e-2339.18Show/hide
Query:  RGKDDDETGTIEGNVEFNRYNEGWNLSQRLFL--PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQG
        R KD ++    +G   + R   GW L        P  M IISWN  GLGN R    + NLV S+  ++LFL ETK D +  + I+   +F+ CFTV S G
Subjt:  RGKDDDETGTIEGNVEFNRYNEGWNLSQRLFL--PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQG

Query:  AKGGLCILWSDKKMVTIRSYSNNHINCDINWNN-LRWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
          GGL +LW+D   +TI+++S NHI+  +     L WRFT  YG P   ++  +W L+ KL++    PWLL
Subjt:  AKGGLCILWSDKKMVTIRSYSNNHINCDINWNN-LRWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

A0A803PDL7 Uncharacterized protein1.4e-2442.34Show/hide
Query:  PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL
        P IM I+SWN  GLG P T + +K+LV  ++  ++FL ET CD+K  + +     FEGC+ V++QG  GG+ +LW +   V I S+S NHI+C ++ N L
Subjt:  PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL

Query:  -RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
          +R T +YG P   Q+  TW LIR L      PW+L
Subjt:  -RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

A0A803PU35 Uncharacterized protein1.8e-2440.44Show/hide
Query:  SIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL-
        S++ +  WN  GLGNPR F+ +  LV+ ++  ++FLSET C +   + +  S  FEGCF V++QG  GGL +LW DK  V+I  YS NHI+  I+W  + 
Subjt:  SIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL-

Query:  RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
        ++R T +YG P    +  TW L+R+L       W L
Subjt:  RWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

A0A803QDP5 Uncharacterized protein7.4e-2642.34Show/hide
Query:  PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL
        P IM  +SWN  GLGNPR  + +K+LV+ ++  ++FL ET C + V ++++ +  FEGCF V++QG  GGL +LW D+K+  +R +S NHI+  I+ +  
Subjt:  PSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNNL

Query:  R-WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
        R WR T +YG P    +  TW L R LN     PW L
Subjt:  R-WRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

A0A803QJ68 Uncharacterized protein3.7e-2542.54Show/hide
Query:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNN-LRW
        M  +SWN  GLGNPR  + +K+LV+ ++  V+FL ET C + V +++K    FEGCF V++QG  GGL + W ++KM  +R +SNNHI+  IN N+ ++W
Subjt:  MKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGAKGGLCILWSDKKMVTIRSYSNNHINCDINWNN-LRW

Query:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL
        R T +YG P    +  TW L R L+      W L
Subjt:  RFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGAGGGCACGTGGAGGAGCCAACCATTCGGCGCGGTCAACAAAAATCGCTATCGATAATGGAGATTCGAAGCCGATTCCATTGTCAGCGCTTTTGCCAGACAT
AGTTGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCCACCCTCCTTTCAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGAAACTCATAACCATTCCAAAAT
ATTTTGTTGAAGAAGTGCATCGAATCAAGGTGAAGAATTCGCAGGGATTTGAGTATCAGAAAATATCGGTGGTGCTTTGTCCACCACCGAAGCCTGGGTTTCCTGGTATC
GTGGTGAGACTAAGTGTGACTTTGTGTGATCGAAGTATGGACATTGATGGGTTGATTAATGAGTGGAAGGACTTCAATCTTATGGAGGAAGAAAGAGAAGCGTTTATTAC
CCTAAATGCTGAAGAAGTTGGAGTAATCAAAGGGCAATTAGATTACTGTTTGATAGGGAAACTTCTTGCGAGTAGAATCCTGCAGGGAGTCTGGAAAACAAGAAACAACT
TTAGTGTTGATGTATTAAGCAAAAACGTATTCTTGTTCAAATTCGAAAGAAAAATCGAGAAAGAAGGTAGGAACGGAGAAGGTCTGGATGTAGACTTGAATCTGGAGAGT
CCATTGGCTGAAGATTTAGAAAATGTAGCAAGCAGAGGCAAAGATGATGATGAAACTGGGACTATTGAAGGCAACGTGGAGTTTAATAGGTATAATGAAGGCTGGAATCT
AAGCCAGAGGCTGTTCCTGCCATCAATTATGAAAATAATAAGTTGGAATGATTGGGGCTTGGGGAACCCGAGAACATTCCGAACTGTGAAAAACCTTGTGTTATCAAGGC
AACTTCAAGTACTGTTCCTAAGTGAAACCAAATGTGATGAGAAAGTTGCTGATAAAATAAAAGGAAGCTGCAAGTTTGAAGGGTGCTTTACTGTAAAGAGCCAAGGGGCA
AAAGGTGGATTATGCATCCTCTGGAGTGATAAAAAAATGGTGACAATTCGTTCTTATTCCAACAACCATATCAACTGTGATATAAACTGGAACAACCTCAGGTGGAGATT
CACAAGAGTTTATGGGTATCCGAAAACAGGGCAAAAAAACCTGACGTGGGGTCTAATTCGGAAACTAAATACCTCAGGGGACAAACCATGGTTGCTAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGAGGGCACGTGGAGGAGCCAACCATTCGGCGCGGTCAACAAAAATCGCTATCGATAATGGAGATTCGAAGCCGATTCCATTGTCAGCGCTTTTGCCAGACAT
AGTTGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCCACCCTCCTTTCAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGAAACTCATAACCATTCCAAAAT
ATTTTGTTGAAGAAGTGCATCGAATCAAGGTGAAGAATTCGCAGGGATTTGAGTATCAGAAAATATCGGTGGTGCTTTGTCCACCACCGAAGCCTGGGTTTCCTGGTATC
GTGGTGAGACTAAGTGTGACTTTGTGTGATCGAAGTATGGACATTGATGGGTTGATTAATGAGTGGAAGGACTTCAATCTTATGGAGGAAGAAAGAGAAGCGTTTATTAC
CCTAAATGCTGAAGAAGTTGGAGTAATCAAAGGGCAATTAGATTACTGTTTGATAGGGAAACTTCTTGCGAGTAGAATCCTGCAGGGAGTCTGGAAAACAAGAAACAACT
TTAGTGTTGATGTATTAAGCAAAAACGTATTCTTGTTCAAATTCGAAAGAAAAATCGAGAAAGAAGGTAGGAACGGAGAAGGTCTGGATGTAGACTTGAATCTGGAGAGT
CCATTGGCTGAAGATTTAGAAAATGTAGCAAGCAGAGGCAAAGATGATGATGAAACTGGGACTATTGAAGGCAACGTGGAGTTTAATAGGTATAATGAAGGCTGGAATCT
AAGCCAGAGGCTGTTCCTGCCATCAATTATGAAAATAATAAGTTGGAATGATTGGGGCTTGGGGAACCCGAGAACATTCCGAACTGTGAAAAACCTTGTGTTATCAAGGC
AACTTCAAGTACTGTTCCTAAGTGAAACCAAATGTGATGAGAAAGTTGCTGATAAAATAAAAGGAAGCTGCAAGTTTGAAGGGTGCTTTACTGTAAAGAGCCAAGGGGCA
AAAGGTGGATTATGCATCCTCTGGAGTGATAAAAAAATGGTGACAATTCGTTCTTATTCCAACAACCATATCAACTGTGATATAAACTGGAACAACCTCAGGTGGAGATT
CACAAGAGTTTATGGGTATCCGAAAACAGGGCAAAAAAACCTGACGTGGGGTCTAATTCGGAAACTAAATACCTCAGGGGACAAACCATGGTTGCTAGGATGA
Protein sequenceShow/hide protein sequence
MAERARGGANHSARSTKIAIDNGDSKPIPLSALLPDIVASERRSRSSFPTLLSGSKESFCGKLGFVKLITIPKYFVEEVHRIKVKNSQGFEYQKISVVLCPPPKPGFPGI
VVRLSVTLCDRSMDIDGLINEWKDFNLMEEEREAFITLNAEEVGVIKGQLDYCLIGKLLASRILQGVWKTRNNFSVDVLSKNVFLFKFERKIEKEGRNGEGLDVDLNLES
PLAEDLENVASRGKDDDETGTIEGNVEFNRYNEGWNLSQRLFLPSIMKIISWNDWGLGNPRTFRTVKNLVLSRQLQVLFLSETKCDEKVADKIKGSCKFEGCFTVKSQGA
KGGLCILWSDKKMVTIRSYSNNHINCDINWNNLRWRFTRVYGYPKTGQKNLTWGLIRKLNTSGDKPWLLG