; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016057 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016057
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:32637768..32639441
RNA-Seq ExpressionLag0016057
SyntenyLag0016057
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067681.1 UMP-CMP kinase isoform X1 [Cucumis melo var. makuwa]7.3e-3334.53Show/hide
Query:  GSGFAESTLVSEFITASRAWDVAKLSRVL---------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKI
        G   A   LVS FIT  R WD+  L  VL                            G ++V S Y+L+M   ++   S S   + WW  LWKL IP+KI
Subjt:  GSGFAESTLVSEFITASRAWDVAKLSRVL---------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKI

Query:  KVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDR
        K+ MW+AFH  LPV+T L+ R +D++P+C  C +  ET+   + GC  A   W   LP V    ++   F+D C+ +   LS  +F LWC+GCW LW+D 
Subjt:  KVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDR

Query:  NAVRMERQIPSLVVKCDWVEEFI
        N  R+   +  +  KC W+  ++
Subjt:  NAVRMERQIPSLVVKCDWVEEFI

XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]3.0e-2641.78Show/hide
Query:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN
        +F+W+ F+ C+P M +L+KRG+D  P C  C +  ET DHAL  C RA   W+++LP    + DF +S  D  L +   LS  +F L  VG WA+WNDRN
Subjt:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN

Query:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQPA--SLFTSEDARSS
        A+RM+RQIP   ++ DW+  ++  F Q R+ P+       EDA S+
Subjt:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQPA--SLFTSEDARSS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]7.1e-2830.29Show/hide
Query:  AESTLVSEFITASRAWDVAKLSRVL----------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVF
        A  T V+ FITA   WDV  +S                                G Y+VRS YKLYM++K  A+S+ +      W  +WKL +P+KIK+F
Subjt:  AESTLVSEFITASRAWDVAKLSRVL----------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVF

Query:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV-CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNA
        +W++ H  +P   +L+ RG+   P C  C +  E++ HA   C RA   W  + P + C   +   SF +   ++  QL  ++  L  +  W +WNDRN+
Subjt:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV-CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNA

Query:  VRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDAR
        +   +Q+  +  KC+W+  F+ S  QA+    S  T  + R
Subjt:  VRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDAR

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]3.4e-3031.43Show/hide
Query:  YVGSGFAESTLVSEFITASRAWDV---------AKLSRVLH-------------------GCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIP
        Y   G   + LV++ IT  R WD+         A ++RVL                    G YNV+S Y   +++     S+CS  +E WW + WKL +P
Subjt:  YVGSGFAESTLVSEFITASRAWDV---------AKLSRVLH-------------------GCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIP

Query:  SKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALW
         K+++F+WK FH+ LPV   L +R +  +P C  C+   ETV HAL  C RA   W +   S+ ++T    S AD  L + + LS  E  L+ V CW++W
Subjt:  SKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALW

Query:  NDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDA
        ++RNA+     + +      +   ++  F QAR + A   T+  A
Subjt:  NDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDA

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]3.3e-2528.51Show/hide
Query:  GFAESTLVSEFITASRAWDVAKL----------------------------SRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIK
        G   + LV++ IT  R WD+  L                            S    G YNV+S Y+L ++      ++ S  +E WW   WK+ +P K++
Subjt:  GFAESTLVSEFITASRAWDVAKL----------------------------SRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIK

Query:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN
        +F+WK FHS LPV   L +R +  +P C  C+   E+++HAL  C RA   W +    + + T    + AD  L + + LS  EF L+ V CW  W++RN
Subjt:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN

Query:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQ-----PASLFTSEDARSS
        A+     + S      +   ++  F  AR +     P S+  ++   SS
Subjt:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQ-----PASLFTSEDARSS

TrEMBL top hitse value%identityAlignment
A0A5A7VQ24 Adenylate kinase3.5e-3334.53Show/hide
Query:  GSGFAESTLVSEFITASRAWDVAKLSRVL---------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKI
        G   A   LVS FIT  R WD+  L  VL                            G ++V S Y+L+M   ++   S S   + WW  LWKL IP+KI
Subjt:  GSGFAESTLVSEFITASRAWDVAKLSRVL---------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKI

Query:  KVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDR
        K+ MW+AFH  LPV+T L+ R +D++P+C  C +  ET+   + GC  A   W   LP V    ++   F+D C+ +   LS  +F LWC+GCW LW+D 
Subjt:  KVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDR

Query:  NAVRMERQIPSLVVKCDWVEEFI
        N  R+   +  +  KC W+  ++
Subjt:  NAVRMERQIPSLVVKCDWVEEFI

A0A6J1CTE3 uncharacterized protein LOC1110145781.4e-2641.78Show/hide
Query:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN
        +F+W+ F+ C+P M +L+KRG+D  P C  C +  ET DHAL  C RA   W+++LP    + DF +S  D  L +   LS  +F L  VG WA+WNDRN
Subjt:  VFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRN

Query:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQPA--SLFTSEDARSS
        A+RM+RQIP   ++ DW+  ++  F Q R+ P+       EDA S+
Subjt:  AVRMERQIPSLVVKCDWVEEFIVSFLQAREQPA--SLFTSEDARSS

A0A6J1DX30 uncharacterized protein LOC1110248743.4e-2830.29Show/hide
Query:  AESTLVSEFITASRAWDVAKLSRVL----------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVF
        A  T V+ FITA   WDV  +S                                G Y+VRS YKLYM++K  A+S+ +      W  +WKL +P+KIK+F
Subjt:  AESTLVSEFITASRAWDVAKLSRVL----------------------------HGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVF

Query:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV-CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNA
        +W++ H  +P   +L+ RG+   P C  C +  E++ HA   C RA   W  + P + C   +   SF +   ++  QL  ++  L  +  W +WNDRN+
Subjt:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV-CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNA

Query:  VRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDAR
        +   +Q+  +  KC+W+  F+ S  QA+    S  T  + R
Subjt:  VRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDAR

A0A7J6HXN5 zf-RVT domain-containing protein7.9e-2531.69Show/hide
Query:  DVAKLSRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCT
        DV   +    G YNV+  Y+L +++     S+CS  +E WW   WK+ +P K+++F+WK FH+ L V   L +R +  +P C  C+   ET++HAL  C 
Subjt:  DVAKLSRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCT

Query:  RAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAR
        RA   W +    + + T    S AD  L + S LS  +F L+ V CW++W++RN +     + +      +   ++  F  AR
Subjt:  RAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALWNDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAR

A0A803Q2K8 Uncharacterized protein9.3e-2628.46Show/hide
Query:  YVGSGFAESTLVSEFITASRAWDVAKL----------------------------SRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIP
        Y   G   + LV++ IT  R WD+  L                            S    G YNV+S Y+L ++      ++ S  +E WW   WK+ +P
Subjt:  YVGSGFAESTLVSEFITASRAWDVAKL----------------------------SRVLHGCYNVRSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIP

Query:  SKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALW
         K+++F+WK FHS LPV   L +R +  +P C  C+   E+++HAL  C RA   W +    + + T    + AD  L + + LS  EF L+ V CW  W
Subjt:  SKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVGCWALW

Query:  NDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAREQ-----PASLFTSEDARSS
        ++RNA+     + S      +   ++  F  AR +     P S+  ++   SS
Subjt:  NDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAREQ-----PASLFTSEDARSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein4.0e-0528.32Show/hide
Query:  KIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVG------
        KIK+F+WKA    LPV   L++R +     C +C    ET  H L  C  A   W++           GH      +     L ++  +L  VG      
Subjt:  KIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLSEREFILWCVG------

Query:  ----CWALWNDRN
            CW +W  RN
Subjt:  ----CWALWNDRN

AT2G02650.1 Ribonuclease H-like superfamily protein2.3e-0827.91Show/hide
Query:  LWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLS--------
        +WKL +  KIK F+W+     L   T L  R +D  P+C +C    ET+ H +  C      W      +  +     SF D  L    QLS        
Subjt:  LWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFADRCLAVQSQLS--------

Query:  EREFILWCVGCWALWNDRNAVRMERQIPS
        +R    W +  W LW  RN    +++  S
Subjt:  EREFILWCVGCWALWNDRNAVRMERQIPS

AT3G25270.1 Ribonuclease H-like superfamily protein2.8e-0626.19Show/hide
Query:  LWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFW--------SVMLPSVCWETDFGHSFADRCLAVQSQLS
        +WKL    KIK F+WK     L    +L +R +   P C +C +  ET  H    C  A   W         +    +  ET         CLA   Q  
Subjt:  LWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFW--------SVMLPSVCWETDFGHSFADRCLAVQSQLS

Query:  EREFILWCVGCWALWNDRNAVRMERQ
             +W +  W LW  RN +  +++
Subjt:  EREFILWCVGCWALWNDRNAVRMERQ

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-0835.37Show/hide
Query:  WWRDLWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVC----WET
        W  D+W L I  KIK+ +WKA ++ LPV   L+ R + + P C +C +  ET+ H L  C  A     V++ S+     W+T
Subjt:  WWRDLWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVC----WET

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-0523.36Show/hide
Query:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV--------CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWA
        MW +    LP    L+  GL ++P+C  CS+  ET DH ++ C  +   W+++   +         WE+        +     S    R+ +     C A
Subjt:  MWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSV--------CWETDFGHSFADRCLAVQSQLSEREFILWCVGCWA

Query:  LWNDRNAV--RMERQIPSLVVKCDWVEEFIVSFLQAR
        +W  RN +   ++  +PS++ K   ++  I++ + AR
Subjt:  LWNDRNAV--RMERQIPSLVVKCDWVEEFIVSFLQAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATTAGTCGACAAACTCGACATGGAGGATTACCAATTTATCTCAAAGGTTCGCGAATTGATCTCTTGATCAATCAAACCATCGCACCGATGAACAAGATTTCGAT
CACCGTGAAACTCCCCAGTCGAACTAGACAAAGTATTGGGGATACAGAAATGAAAGCAATTGCTGAGTCAAATCATTATGTTGGAGATTTCCAGAGGCTGGTTAATGGGT
ATGTTGGCTCTGGGTTTGCTGAATCTACCTTAGTCAGTGAGTTTATTACTGCTAGCAGAGCATGGGATGTTGCAAAGCTTAGCCGAGTTTTGCATGGTTGTTATAATGTG
CGGAGCGTGTATAAGTTGTACATGAATATAAAATCAGTTGCTTCATCTTCTTGTTCACGTCGGGTTGAAAAGTGGTGGCGGGATCTATGGAAATTGTGTATACCTTCAAA
AATTAAGGTTTTCATGTGGAAGGCTTTTCACTCATGTCTTCCTGTCATGACTTCGTTAATGAAACGTGGTTTGGATGTCGCCCCGGTTTGTTTTCAGTGTTCTGAAGTTG
CTGAAACGGTTGATCATGCGCTGGTAGGTTGTACTCGGGCTATGATGTTTTGGTCTGTAATGTTGCCTTCGGTATGTTGGGAAACTGATTTTGGCCACAGTTTTGCTGAT
CGGTGTTTGGCAGTCCAATCCCAGCTCTCGGAAAGGGAGTTTATTCTATGGTGTGTGGGTTGTTGGGCTTTGTGGAATGACCGAAATGCAGTTCGGATGGAGAGGCAAAT
TCCGAGTTTGGTCGTGAAGTGTGACTGGGTGGAAGAGTTTATTGTCTCATTCCTACAAGCTCGTGAACAGCCTGCTTCTTTGTTTACATCTGAAGATGCACGCTCTTCTA
GGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATTAGTCGACAAACTCGACATGGAGGATTACCAATTTATCTCAAAGGTTCGCGAATTGATCTCTTGATCAATCAAACCATCGCACCGATGAACAAGATTTCGAT
CACCGTGAAACTCCCCAGTCGAACTAGACAAAGTATTGGGGATACAGAAATGAAAGCAATTGCTGAGTCAAATCATTATGTTGGAGATTTCCAGAGGCTGGTTAATGGGT
ATGTTGGCTCTGGGTTTGCTGAATCTACCTTAGTCAGTGAGTTTATTACTGCTAGCAGAGCATGGGATGTTGCAAAGCTTAGCCGAGTTTTGCATGGTTGTTATAATGTG
CGGAGCGTGTATAAGTTGTACATGAATATAAAATCAGTTGCTTCATCTTCTTGTTCACGTCGGGTTGAAAAGTGGTGGCGGGATCTATGGAAATTGTGTATACCTTCAAA
AATTAAGGTTTTCATGTGGAAGGCTTTTCACTCATGTCTTCCTGTCATGACTTCGTTAATGAAACGTGGTTTGGATGTCGCCCCGGTTTGTTTTCAGTGTTCTGAAGTTG
CTGAAACGGTTGATCATGCGCTGGTAGGTTGTACTCGGGCTATGATGTTTTGGTCTGTAATGTTGCCTTCGGTATGTTGGGAAACTGATTTTGGCCACAGTTTTGCTGAT
CGGTGTTTGGCAGTCCAATCCCAGCTCTCGGAAAGGGAGTTTATTCTATGGTGTGTGGGTTGTTGGGCTTTGTGGAATGACCGAAATGCAGTTCGGATGGAGAGGCAAAT
TCCGAGTTTGGTCGTGAAGTGTGACTGGGTGGAAGAGTTTATTGTCTCATTCCTACAAGCTCGTGAACAGCCTGCTTCTTTGTTTACATCTGAAGATGCACGCTCTTCTA
GGATTTGA
Protein sequenceShow/hide protein sequence
MQISRQTRHGGLPIYLKGSRIDLLINQTIAPMNKISITVKLPSRTRQSIGDTEMKAIAESNHYVGDFQRLVNGYVGSGFAESTLVSEFITASRAWDVAKLSRVLHGCYNV
RSVYKLYMNIKSVASSSCSRRVEKWWRDLWKLCIPSKIKVFMWKAFHSCLPVMTSLMKRGLDVAPVCFQCSEVAETVDHALVGCTRAMMFWSVMLPSVCWETDFGHSFAD
RCLAVQSQLSEREFILWCVGCWALWNDRNAVRMERQIPSLVVKCDWVEEFIVSFLQAREQPASLFTSEDARSSRI