; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035637 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035637
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:26021204..26022613
RNA-Seq ExpressionLag0035637
SyntenyLag0035637
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON66242.1 hypothetical protein PanWU01x14_111080 [Parasponia andersonii]9.6e-1752.44Show/hide
Query:  LQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS
        L+  E ++  I  ++E+YWKQR R SWLKWGD NTR+FH +AS R+ RN IRGL+D+ G+ R E  EI  +V +YF+ IFT+
Subjt:  LQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]2.1e-1634Show/hide
Query:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQ-------------VQQLDRRGSG------RGDWKCLEVLRRWGRGS
        SDH PI+  L+P+V+      +R++RFE  W         +RD W +    G Q             +   ++R  G      +   + L+ L+    GS
Subjt:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQ-------------VQQLDRRGSG------RGDWKCLEVLRRWGRGS

Query:  --ADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPS-----ERDID
          A+ QA   +++ +   EE+ WKQRSR  WL+ GD+NTR+FHT+AS R++RN I  L    G   +E  E+  LVS YF ++F+SS P       RDID
Subjt:  --ADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPS-----ERDID

XP_024163967.1 uncharacterized protein LOC112170934 [Rosa chinensis]3.9e-1833.98Show/hide
Query:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWFW-------IRDLWRWLEGVGGQVQQLDRRGSGRGDWKCLEVLRRWGRGSADLQAAEARL---------
        SDH P+L+  A   R V    ++ +RFEE W+        I+  W      G  +QQ++ +    GD      LR W R   + Q  E R+         
Subjt:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWFW-------IRDLWRWLEGVGGQVQQLDRRGSGRGDWKCLEVLRRWGRGSADLQAAEARL---------

Query:  -------------------EAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERD
                             +   +E YW+QRSR  WLK GDRNT +FH +AS RR RNLI+GL+D  G+ + EP EI  ++ +YF+ IF++    E  
Subjt:  -------------------EAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERD

Query:  IDVVTA
        I VVTA
Subjt:  IDVVTA

XP_030933437.1 uncharacterized protein LOC115959237 [Quercus lobata]2.8e-1633.51Show/hide
Query:  DHRPILLSLAPLVRMVDAHGSRIYRFEEAWFW-------IRDLWRWL-----EGVGGQVQQLDRRGSGRGDW----------------KCLEVL---RRW
        DHRPI+L     +R      +  +RFEEAW         I + W  +      G+    ++++  GS    W                K +EVL      
Subjt:  DHRPILLSLAPLVRMVDAHGSRIYRFEEAWFW-------IRDLWRWL-----EGVGGQVQQLDRRGSGRGDW----------------KCLEVL---RRW

Query:  GRGSADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS
            A++ AA   L+ + L++E+YW QRSR SWL+ GD+NT++FH++AS R++RN I G+ D       EP EI  +  EYFE+IF+S
Subjt:  GRGSADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS

XP_030940187.1 uncharacterized protein LOC115965136 [Quercus lobata]2.1e-1644.86Show/hide
Query:  LEVLRRWGRGSADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSE
        LE L      + ++Q  +  +      EEV W QRSR  W+KWGDRNT++FH  A+ RR RN I GLVDS G+ +++PG + G+  +YFE IF S+ PS 
Subjt:  LEVLRRWGRGSADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSE

Query:  RDIDVVT
         D  V T
Subjt:  RDIDVVT

TrEMBL top hitse value%identityAlignment
A0A2N9HFT1 Uncharacterized protein1.6e-1736.87Show/hide
Query:  DLCRSDHRPILLSLAPLV---RMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQVQQLDRRGSGRGDWKCLEVLRRWGRGSADLQAAEARLEA
        D   SDH+P+ LS  P V   R+V    ++ +RFEE W         I + W        Q  Q+++    R + K  E     G+ SA + +  A +  
Subjt:  DLCRSDHRPILLSLAPLV---RMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQVQQLDRRGSGRGDWKCLEVLRRWGRGSADLQAAEARLEA

Query:  IFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERD
        +  +EE  W+QRSR  WLK GDRNT +FH++A+ R++RN I GL DS G  + +P ++  L+  YF+NIF SS PS  D
Subjt:  IFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERD

A0A2P5CYW5 Uncharacterized protein4.7e-1752.44Show/hide
Query:  LQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS
        L+  E ++  I  ++E+YWKQR R SWLKWGD NTR+FH +AS R+ RN IRGL+D+ G+ R E  EI  +V +YF+ IFT+
Subjt:  LQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS

A0A2U1KHJ0 CCHC-type domain-containing protein1.0e-1634Show/hide
Query:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQ-------------VQQLDRRGSG------RGDWKCLEVLRRWGRGS
        SDH PI+  L+P+V+      +R++RFE  W         +RD W +    G Q             +   ++R  G      +   + L+ L+    GS
Subjt:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAWF-------WIRDLWRWLEGVGGQ-------------VQQLDRRGSG------RGDWKCLEVLRRWGRGS

Query:  --ADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPS-----ERDID
          A+ QA   +++ +   EE+ WKQRSR  WL+ GD+NTR+FHT+AS R++RN I  L    G   +E  E+  LVS YF ++F+SS P       RDID
Subjt:  --ADLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPS-----ERDID

A0A7N2N012 Uncharacterized protein2.7e-1731.38Show/hide
Query:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAW-FW------IRDLWRWLEGVGGQVQQLDRRGSGRG-------------DWKCLEVLRRWGR--GSA---
        SDH PILL    +++  +   +R ++FEEAW  W      +++ W   EGV   + ++  + +G G             D   ++VL+R     G+A   
Subjt:  SDHRPILLSLAPLVRMVDAHGSRIYRFEEAW-FW------IRDLWRWLEGVGGQVQQLDRRGSGRG-------------DWKCLEVLRRWGR--GSA---

Query:  -----DLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS
             +  A    L+ +  ++E++W QRSR SWLK GD+NT++FH++AS R++RN I+G+++      +E GE+  +   YFE++FT+
Subjt:  -----DLQAAEARLEAIFLEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTS

A0A803Q1K6 Uncharacterized protein3.2e-1839.2Show/hide
Query:  SDHRPILLSLAPLVRMVDAHGSRI----YRFEEAWFWIRDLWRWLEGVGGQVQQLDRRGSGRGDWK------CLEVLRRWGRGSA--DLQAAEARLEAIF
        SDH+ +++ + PL         +     + FEEAW    +  +  E V  Q  + D   SG GD+K       L VL       A  +++  E +L A+ 
Subjt:  SDHRPILLSLAPLVRMVDAHGSRI----YRFEEAWFWIRDLWRWLEGVGGQVQQLDRRGSGRGDWK------CLEVLRRWGRGSA--DLQAAEARLEAIF

Query:  LEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIF-TSSCPSE
         ++E+YW+QRSR  WLKWGD NT++FH +AS RRK+N I+GL+DS GV  QE G +  LV +YF +IF TS+ PS+
Subjt:  LEEEVYWKQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIF-TSSCPSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTGGCGGTGGGATGGTAAGAGGAAGGCGGATGTTATTCAAGGGGAGGAACAAGGTGGGAAGAGGGCTAAGTCTGGAGCTGAAGTAGCAGACGAAATGAGCAACCC
CGCCGATCGCCATGATGATACTATTCTGGAATGTGTAGGGTATGGGGTCGGACCGCGCATTCCGTCGGCTGTAAAGATTGTGCAGCAACATCGACCCCTTGGTTTTCCTT
ACAGAGACCAGTGTAGACAGTATTGGAAGAAGTGGGGCCTAGCTCTGCTATGGAGTTCGGAGGTGAGCTTTAGCTTGTCACTTACTCGAGGCACCATATTGATGGTGGGT
GGACTGGAATTCGTGTCAGTGGCGCTTTACAGGGATCTATGCCGATCGGATCACCGTCCTATCTTGCTCTCATTAGCGCCGTTGGTTCGAATGGTTGATGCGCATGGGAG
CAGAATTTATAGATTTGAGGAGGCCTGGTTCTGGATCCGGGATTTATGGAGGTGGTTAGAAGGAGTTGGGGGGCAAGTCCAACAGTTGGATCGCCGAGGGAGTGGCAGGG
GAGACTGGAAATGCCTGGAGGTGCTGAGGCGTTGGGGAAGGGGGAGTGCAGACCTGCAGGCAGCAGAAGCCAGATTGGAGGCGATTTTTCTGGAGGAAGAGGTGTACTGG
AAACAACGCTCTAGGGAGAGTTGGCTGAAGTGGGGTGACAGGAACACCCGATGGTTTCATACACAGGCGTCTTTTAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGA
CAGTGGTGGTGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCGGAGTACTTTGAGAACATCTTCACGTCTAGTTGTCCGTCAGAAAGGGATATTGATGTCG
TTACAGCAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTGGCGGTGGGATGGTAAGAGGAAGGCGGATGTTATTCAAGGGGAGGAACAAGGTGGGAAGAGGGCTAAGTCTGGAGCTGAAGTAGCAGACGAAATGAGCAACCC
CGCCGATCGCCATGATGATACTATTCTGGAATGTGTAGGGTATGGGGTCGGACCGCGCATTCCGTCGGCTGTAAAGATTGTGCAGCAACATCGACCCCTTGGTTTTCCTT
ACAGAGACCAGTGTAGACAGTATTGGAAGAAGTGGGGCCTAGCTCTGCTATGGAGTTCGGAGGTGAGCTTTAGCTTGTCACTTACTCGAGGCACCATATTGATGGTGGGT
GGACTGGAATTCGTGTCAGTGGCGCTTTACAGGGATCTATGCCGATCGGATCACCGTCCTATCTTGCTCTCATTAGCGCCGTTGGTTCGAATGGTTGATGCGCATGGGAG
CAGAATTTATAGATTTGAGGAGGCCTGGTTCTGGATCCGGGATTTATGGAGGTGGTTAGAAGGAGTTGGGGGGCAAGTCCAACAGTTGGATCGCCGAGGGAGTGGCAGGG
GAGACTGGAAATGCCTGGAGGTGCTGAGGCGTTGGGGAAGGGGGAGTGCAGACCTGCAGGCAGCAGAAGCCAGATTGGAGGCGATTTTTCTGGAGGAAGAGGTGTACTGG
AAACAACGCTCTAGGGAGAGTTGGCTGAAGTGGGGTGACAGGAACACCCGATGGTTTCATACACAGGCGTCTTTTAGGAGGAAGAGGAATCTAATTCGGGGTTTGGTAGA
CAGTGGTGGTGTGATGAGGCAGGAGCCTGGAGAGATTGTGGGTCTGGTCTCGGAGTACTTTGAGAACATCTTCACGTCTAGTTGTCCGTCAGAAAGGGATATTGATGTCG
TTACAGCAGGGTGA
Protein sequenceShow/hide protein sequence
MEWRWDGKRKADVIQGEEQGGKRAKSGAEVADEMSNPADRHDDTILECVGYGVGPRIPSAVKIVQQHRPLGFPYRDQCRQYWKKWGLALLWSSEVSFSLSLTRGTILMVG
GLEFVSVALYRDLCRSDHRPILLSLAPLVRMVDAHGSRIYRFEEAWFWIRDLWRWLEGVGGQVQQLDRRGSGRGDWKCLEVLRRWGRGSADLQAAEARLEAIFLEEEVYW
KQRSRESWLKWGDRNTRWFHTQASFRRKRNLIRGLVDSGGVMRQEPGEIVGLVSEYFENIFTSSCPSERDIDVVTAG