; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007620 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007620
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:2067652..2070409
RNA-Seq ExpressionLag0007620
SyntenyLag0007620
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7825347.1 uncharacterized protein G2W53_016511 [Senna tora]1.2e-1237.91Show/hide
Query:  PSESP--ALAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNS
        PS SP     PPP    KIN+DAS     G  G G  IR+  G ++AA   K+P +  I++LEA A+K G++  + L  I +A  +VE +   ++N L S
Subjt:  PSESP--ALAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNS

Query:  SDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVI
        + S+LS +  VL+ +KS+ +    + F W PR +N VA+ LA+  SR   L I
Subjt:  SDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVI

RYR79715.1 hypothetical protein Ahy_A01g004533 [Arachis hypogaea]5.9e-1238.35Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PPP   +K N DA++ E    G      RD  GSL+AA N ++  + P+   EA A++E L    N        +IVESD+  LI  L S   V +EIQ 
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALAR
        +LD++  + R I +  F W PR  NG+AH +AR
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALAR

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]2.0e-1233.33Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PPP     +N+DASW+++   GG+GW IR   G +V A N+ +     + +LEA AI EGL+   NL  +    L +E+D+ ++ ++LN     L++  +
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVIF
        V++E+ ++    + + F    R +NG AH+LA+ AS +R  +I+
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVIF

XP_027060953.1 uncharacterized protein LOC113687569 [Coffea arabica]3.4e-1233.12Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PP   H K+N DAS +    L G+G  IRD +G  +A  ++++P     D+ EALA +E +     L  IP  + I+E DA  ++  + SSD +LS++  
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHAL---ARNASRMRFLVIFDQALSSS
        V+++++     +  +  +W PR +N VAH+L   AR++S    L  F + L +S
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHAL---ARNASRMRFLVIFDQALSSS

XP_027172119.1 uncharacterized protein LOC113771755 [Coffea eugenioides]7.6e-1232.47Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PP   H K+N DAS +    + G+G  IRD +G  +A  ++++P     D+ EALA +E +     L  IP  + I+E DA  ++  + SSD +LS++  
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHAL---ARNASRMRFLVIFDQALSSS
        V+++++     +  +  +W PR +N VAH+L   AR++S    L  F + L +S
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHAL---ARNASRMRFLVIFDQALSSS

TrEMBL top hitse value%identityAlignment
A0A2N9ESR2 Uncharacterized protein8.3e-1234.9Show/hide
Query:  EEPS-ESPALA---PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLIN
        E+P+ + PA A   PP +   K N D ++++A   GG+G  IRD QG ++A  ++K+      +M+EALA K  +   +    +    +  E DAE+LI 
Subjt:  EEPS-ESPALA---PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLIN

Query:  MLNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
         LN  +++ +    +LD++K+M +GI+        RS N VAHALAR A
Subjt:  MLNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

A0A2N9I9F4 Reverse transcriptase domain-containing protein1.1e-1134Show/hide
Query:  MSEEPSESPALA---PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLI
        M  +  + PA A   PP +   K N D ++ +A   GG+G  IRD QG ++A  ++K+      +M+EALA K  +   +    +    +  E DAE+LI
Subjt:  MSEEPSESPALA---PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLI

Query:  NMLNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
          LN  +++ +    +LD++K+M +GI+        RS N VAHALAR A
Subjt:  NMLNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

A0A6J1DNV9 uncharacterized protein LOC1110224039.7e-1333.33Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PPP     +N+DASW+++   GG+GW IR   G +V A N+ +     + +LEA AI EGL+   NL  +    L +E+D+ ++ ++LN     L++  +
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVIF
        V++E+ ++    + + F    R +NG AH+LA+ AS +R  +I+
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVIF

A0A6J5UE59 Reverse transcriptase domain-containing protein1.4e-1133.56Show/hide
Query:  EPSESPA---LAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINML
        +P+ SPA      P    LK+N DA+W+     GG+GW IRDS G L+ A  +         M+E LAI+  L +  N +      +IVESD++  I ML
Subjt:  EPSESPA---LAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINML

Query:  NSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASR
        N   +V S+++ ++ +++ +   +  + F++ PRS N  AH++A  AS+
Subjt:  NSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASR

A0A6J5WPU6 Reverse transcriptase domain-containing protein3.7e-1234.67Show/hide
Query:  EPSESPA----LAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINM
        +P  SPA      PPP L LK+N DA+W+     GG+GW IRDS G L+ A  +         M+E LAI+  L +  N +      +IVESD++  I M
Subjt:  EPSESPA----LAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINM

Query:  LNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASR
        LN   +V S+++ ++ +++ +   +  + F++ PRS N  AH++A   S+
Subjt:  LNSSDSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein4.3e-0526.32Show/hide
Query:  PYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQFVL
        P   +K N DAS +E   + GLGW IR+SQG+++     K       +  E  A+   ++     +      +I E D  + +N L ++ S    ++  L
Subjt:  PYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQFVL

Query:  DEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
        D +KS        +F++  R  N  A  L + A
Subjt:  DEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.6e-1330.6Show/hide
Query:  PPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQFV
        PPY  +K N+DA+W       G+GW +R+  G ++    + LPR   +   E  A++  +   L ++      +I ESDA+ L+N+LN SD     +Q  
Subjt:  PPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQFV

Query:  LDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
        L++++ +    +++KF + PR  N VA  +AR +
Subjt:  LDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-0630.37Show/hide
Query:  PPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPID-MLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        P    + I +DA+W       G GW IR+    L A   +   RN  +  M EA+A+   L QY     I    L + SD++ LI  + +S+S  +E   
Subjt:  PPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPID-MLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
        ++ ++ +++ G  D+ F + PRS N VA  LA+++
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1027.41Show/hide
Query:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF
        PPP+  +K N+DA+WN      G+GW +R+ +G +     + LP+   +   E  A++  +   L+L+      +I ESD++ LI +LN +D +   ++ 
Subjt:  PPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEIQF

Query:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
         + +++ +     ++KF++ PR  N +A  +AR +
Subjt:  VLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0827.78Show/hide
Query:  EPSESPALAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSS
        +PS +   +PP    LK N DAS +E   + GLGW +R+SQG+++     K       +  E   +   ++            +I E D + +  M+N+ 
Subjt:  EPSESPALAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSS

Query:  DSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA
         S    +Q  LD ++S     + I+F +  R  NG A  LA+ A
Subjt:  DSVLSEIQFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGAAGAGCCTAGCGAGTCACCAGCATTGGCGCCCCCTCCTTATTTGCATCTGAAAATAAACTCTGATGCCTCTTGGAACGAGGCCCTGGGTCTCGGAGGTTTGGG
TTGGGCCATCCGTGACTCCCAAGGATCTTTAGTGGCTGCATGCAACAAAAAGTTACCCAGGAATTGGCCGATTGACATGCTAGAAGCTCTAGCAATCAAAGAGGGGTTGA
AGCAGTACCTTAATTTGAATCCCATCCCCCAAGCTGCTTTGATTGTGGAATCTGATGCTGAGGACCTGATCAACATGCTAAACTCATCTGACTCTGTTCTCTCCGAAATC
CAATTTGTCCTGGACGAAGTGAAATCGATGGCTAGAGGCATCAAGGATATAAAATTCCTTTGGTGCCCTAGATCTTCTAATGGCGTAGCGCATGCTCTCGCGCGAAATGC
TTCTCGAATGAGATTTCTTGTCATATTTGATCAGGCATTGAGTTCAAGCGATAATCTCTCTTTGTGCTTCTTTCTCATGTCGTCATGGTCGTTATCAAATTCACTTGGAG
TGACTTCACCTCCTGTCTACACCAGTCACTTTGATGACTGTAGTATCCTGCAGGGGTGGTCACCAAATTCGCCTATAGTGGTCGTCATCGAATTCAGTGATTTCACCTCT
CATGAATCATGGACTGCTATTAAGCTGGTATTGATGCAACTAAGGACATGGGTTGTCGAAGCTCTGCTCGAACTGAGATCTGAAGTTGTAGGTGTTTGGTGTACGAGCTT
CCTTGGGAGCACTTTGGAGCAAAAAGCCAAATCACCGTCAGCATCGAGAAGCCGGCCATGGCGTCTCGACGCGCTGCCTCGCTATTTAAGTTGTTGTTTTCCTGATTTTA
GGTCATATTCTCCGCTGTTTCACTTCGTTTCAGCCGTCTTCCTCTCCGATTCTTCTCTGTTTTCTACATTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCGAAGAGCCTAGCGAGTCACCAGCATTGGCGCCCCCTCCTTATTTGCATCTGAAAATAAACTCTGATGCCTCTTGGAACGAGGCCCTGGGTCTCGGAGGTTTGGG
TTGGGCCATCCGTGACTCCCAAGGATCTTTAGTGGCTGCATGCAACAAAAAGTTACCCAGGAATTGGCCGATTGACATGCTAGAAGCTCTAGCAATCAAAGAGGGGTTGA
AGCAGTACCTTAATTTGAATCCCATCCCCCAAGCTGCTTTGATTGTGGAATCTGATGCTGAGGACCTGATCAACATGCTAAACTCATCTGACTCTGTTCTCTCCGAAATC
CAATTTGTCCTGGACGAAGTGAAATCGATGGCTAGAGGCATCAAGGATATAAAATTCCTTTGGTGCCCTAGATCTTCTAATGGCGTAGCGCATGCTCTCGCGCGAAATGC
TTCTCGAATGAGATTTCTTGTCATATTTGATCAGGCATTGAGTTCAAGCGATAATCTCTCTTTGTGCTTCTTTCTCATGTCGTCATGGTCGTTATCAAATTCACTTGGAG
TGACTTCACCTCCTGTCTACACCAGTCACTTTGATGACTGTAGTATCCTGCAGGGGTGGTCACCAAATTCGCCTATAGTGGTCGTCATCGAATTCAGTGATTTCACCTCT
CATGAATCATGGACTGCTATTAAGCTGGTATTGATGCAACTAAGGACATGGGTTGTCGAAGCTCTGCTCGAACTGAGATCTGAAGTTGTAGGTGTTTGGTGTACGAGCTT
CCTTGGGAGCACTTTGGAGCAAAAAGCCAAATCACCGTCAGCATCGAGAAGCCGGCCATGGCGTCTCGACGCGCTGCCTCGCTATTTAAGTTGTTGTTTTCCTGATTTTA
GGTCATATTCTCCGCTGTTTCACTTCGTTTCAGCCGTCTTCCTCTCCGATTCTTCTCTGTTTTCTACATTTCTTTAG
Protein sequenceShow/hide protein sequence
MSEEPSESPALAPPPYLHLKINSDASWNEALGLGGLGWAIRDSQGSLVAACNKKLPRNWPIDMLEALAIKEGLKQYLNLNPIPQAALIVESDAEDLINMLNSSDSVLSEI
QFVLDEVKSMARGIKDIKFLWCPRSSNGVAHALARNASRMRFLVIFDQALSSSDNLSLCFFLMSSWSLSNSLGVTSPPVYTSHFDDCSILQGWSPNSPIVVVIEFSDFTS
HESWTAIKLVLMQLRTWVVEALLELRSEVVGVWCTSFLGSTLEQKAKSPSASRSRPWRLDALPRYLSCCFPDFRSYSPLFHFVSAVFLSDSSLFSTFL