; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022011 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022011
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:15911893..15913361
RNA-Seq ExpressionLag0022011
SyntenyLag0022011
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2907816.1 hypothetical protein DAI22_12g128600 [Oryza sativa Japonica Group]2.3e-1131.5Show/hide
Query:  HIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP------------------------------------C--------GPRFTWCNRW
        HID  V D G +  RFT +YG   +E K RTW  M+ L  N TTP                                    C        G  FTW N  
Subjt:  HIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP------------------------------------C--------GPRFTWCNRW

Query:  LGDD-IFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQ-NVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM
           +    ER+DR + N  WR MFP   V + D   S HRP+++ L  +   ++   GH   RFE  WL   +FKE V   WD  A +  L   AS  G+
Subjt:  LGDD-IFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQ-NVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM

PON56647.1 Endonuclease/exonuclease/phosphatase [Trema orientale]5.1e-1132.48Show/hide
Query:  HIDGWVDGGESPR-RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--CGPRF-------------------TWCNRWLGDDIFWERIDRCIRNVAWRDMF
        HID  V   E    RFT  YGH    ++  +W L++ L G  + P  CG  F                   TW N+  G     E++DR + N  WR +F
Subjt:  HIDGWVDGGESPR-RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--CGPRF-------------------TWCNRWLGDDIFWERIDRCIRNVAWRDMF

Query:  PRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNW
        P   V+HLDF  S HRP++L++S +  L    G R  R E  W+   +F + V  +W
Subjt:  PRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNW

PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]9.3e-1329.13Show/hide
Query:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCN
        NHID  +     PR  RFT +YG  +A +++RTW L+K L  +   P                                             GPRFTW  
Subjt:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCN

Query:  RWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM
           G+++   R+DR   ++ WRD+FP   VRHLD   S H PILL + ++    +    +  +FEE WLL    KE V ++W+  + VG    L + M  
Subjt:  RWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM

Query:  CMSGLD
          + L+
Subjt:  CMSGLD

XP_024164679.1 uncharacterized protein LOC112171781 [Rosa chinensis]6.6e-1135.21Show/hide
Query:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKH--LRGNSTTPCGPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRP
        +HID  +    SP   RFT +YGH + E++ +TW L++   L+GN      P   +     G +    R+DR   ++  +D+FP   V+HLD C+S H P
Subjt:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKH--LRGNSTTPCGPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRP

Query:  ILLNLSVRIHLIQNVGH-RIKRFEETWLLSHEFKEAVSTNWD
        IL  L VR+H  +     R  +FEE WLL  + +E V   W+
Subjt:  ILLNLSVRIHLIQNVGH-RIKRFEETWLLSHEFKEAVSTNWD

XP_042962598.1 uncharacterized protein LOC122296867 [Carya illinoinensis]5.1e-1127.71Show/hide
Query:  LKCAWRFDL------LKANHIDG-----WVDGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPCGPR----------------------------
        L   W+++L         NH+        V+GG+     T +YGH Q + +   WAL+KHL   S T C PR                            
Subjt:  LKCAWRFDL------LKANHIDG-----WVDGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPCGPR----------------------------

Query:  -------------------FTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKE
                           FTWCN   GD +  ER+DR + N+AW D FP   V++     S H P+ L+       IQN G +  RFE  W+   E   
Subjt:  -------------------FTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKE

Query:  AVSTNWDQRASVGSLMALASAMGMCMSGLDR
         V   W     + SL  + S +  C S L R
Subjt:  AVSTNWDQRASVGSLMALASAMGMCMSGLDR

TrEMBL top hitse value%identityAlignment
A0A2N9F6W2 Reverse transcriptase domain-containing protein3.2e-1131.94Show/hide
Query:  HIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPC-------GPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSY
        HID  + +G     R T +YG  + +++  TWAL++HL   S   C       G  FTW N        W  +DR +  + W  +FP   V HLD  +S 
Subjt:  HIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPC-------GPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSY

Query:  HRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWD
        H+ + LN S          HR  RFEE W+     +E +   WD
Subjt:  HRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWD

A0A2N9FJU7 CCHC-type domain-containing protein1.1e-1131.76Show/hide
Query:  NHIDGWV--DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPCGPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPIL
        NHID  V   G       T  YG+ +  ++  +WAL+KHL   S  P G +FTW N+ +  D    R+DR + + +W   F   EV HL F    H P+L
Subjt:  NHIDGWV--DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTPCGPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPIL

Query:  LNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLM
        L +   +    +   R+ RFE  W    + K  +   W      GS M
Subjt:  LNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLM

A0A2P6SDG4 Putative RNA-directed DNA polymerase4.5e-1329.13Show/hide
Query:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCN
        NHID  +     PR  RFT +YG  +A +++RTW L+K L  +   P                                             GPRFTW  
Subjt:  NHIDGWVDGGESPR--RFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCN

Query:  RWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM
           G+++   R+DR   ++ WRD+FP   VRHLD   S H PILL + ++    +    +  +FEE WLL    KE V ++W+  + VG    L + M  
Subjt:  RWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASAMGM

Query:  CMSGLD
          + L+
Subjt:  CMSGLD

A0A803LHG7 Uncharacterized protein2.0e-1331.96Show/hide
Query:  NHIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCNR
        NHID WV + G+   RFT  YGHL+ E K  T ALMK L G    P                                             G  FTW N 
Subjt:  NHIDGWV-DGGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCNR

Query:  WLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIK--RFEETWLLSHEFKEAVSTNWDQRASVGSLMA
          G +   ER+D+   N+AW+D FP   V HL   RS H PILL +   ++  +    ++K  RFEE WL      + VS  WD    + S +A
Subjt:  WLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIK--RFEETWLLSHEFKEAVSTNWDQRASVGSLMA

A0A803MS51 Uncharacterized protein3.2e-1129.44Show/hide
Query:  NHIDGWVD-GGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCNR
        NHID  ++  G+   RFT IYGH + E K +T  L+K L  +   P                                             G  FTW N 
Subjt:  NHIDGWVD-GGESPRRFTRIYGHLQAEQKARTWALMKHLRGNSTTP--------------------------------------------CGPRFTWCNR

Query:  WLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLI-QNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASA
          GD+   ER+DR + N  WRD+FP   V HL   +S H PILL +   +H   +    ++ RFE  WL      E VS  W++   + S +A  S+
Subjt:  WLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLI-QNVGHRIKRFEETWLLSHEFKEAVSTNWDQRASVGSLMALASA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTACTGCAAAATCATCACTGTGCACCACGAGGAGGAAATCCAAGTAGTTGAGGAGCAGGAGACTACTGAACCAGAAGGCCCCAGGGGCAAAAGAAAAGGAGAGGA
GGTTGAGATGGTGTCACAAGGAAGGAAGAAGGCGAGATGGGGGATGGATGTAGGATTGGTTGTTTCGACGGTGGCTGACTCTCAGCCCTGCAGCCCCACCGGTTGCCATG
AGTATCGTGTTCTGAAATGTGCGTGGAGGTTTGATCTGCTGAAAGCTAACCACATAGATGGATGGGTCGATGGAGGAGAGAGCCCGAGGAGGTTCACTAGGATCTATGGC
CATCTCCAAGCAGAGCAGAAGGCAAGAACATGGGCACTGATGAAGCACCTTCGAGGGAATAGTACGACCCCATGTGGGCCGAGATTCACATGGTGCAATAGGTGGCTAGG
GGACGATATATTCTGGGAAAGAATTGATAGGTGTATCAGGAATGTGGCATGGCGAGATATGTTTCCTAGATTTGAGGTGAGACATCTGGACTTTTGTCGCTCGTATCATA
GGCCCATTCTGTTGAATCTATCTGTGAGGATACATCTGATTCAGAATGTTGGTCACAGGATCAAAAGGTTTGAGGAGACCTGGCTGTTATCCCATGAGTTTAAGGAAGCG
GTGTCAACCAACTGGGATCAAAGGGCCTCTGTGGGATCCCTGATGGCATTGGCATCAGCAATGGGAATGTGTATGTCGGGACTGGATAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTACTGCAAAATCATCACTGTGCACCACGAGGAGGAAATCCAAGTAGTTGAGGAGCAGGAGACTACTGAACCAGAAGGCCCCAGGGGCAAAAGAAAAGGAGAGGA
GGTTGAGATGGTGTCACAAGGAAGGAAGAAGGCGAGATGGGGGATGGATGTAGGATTGGTTGTTTCGACGGTGGCTGACTCTCAGCCCTGCAGCCCCACCGGTTGCCATG
AGTATCGTGTTCTGAAATGTGCGTGGAGGTTTGATCTGCTGAAAGCTAACCACATAGATGGATGGGTCGATGGAGGAGAGAGCCCGAGGAGGTTCACTAGGATCTATGGC
CATCTCCAAGCAGAGCAGAAGGCAAGAACATGGGCACTGATGAAGCACCTTCGAGGGAATAGTACGACCCCATGTGGGCCGAGATTCACATGGTGCAATAGGTGGCTAGG
GGACGATATATTCTGGGAAAGAATTGATAGGTGTATCAGGAATGTGGCATGGCGAGATATGTTTCCTAGATTTGAGGTGAGACATCTGGACTTTTGTCGCTCGTATCATA
GGCCCATTCTGTTGAATCTATCTGTGAGGATACATCTGATTCAGAATGTTGGTCACAGGATCAAAAGGTTTGAGGAGACCTGGCTGTTATCCCATGAGTTTAAGGAAGCG
GTGTCAACCAACTGGGATCAAAGGGCCTCTGTGGGATCCCTGATGGCATTGGCATCAGCAATGGGAATGTGTATGTCGGGACTGGATAGATAG
Protein sequenceShow/hide protein sequence
MEYCKIITVHHEEEIQVVEEQETTEPEGPRGKRKGEEVEMVSQGRKKARWGMDVGLVVSTVADSQPCSPTGCHEYRVLKCAWRFDLLKANHIDGWVDGGESPRRFTRIYG
HLQAEQKARTWALMKHLRGNSTTPCGPRFTWCNRWLGDDIFWERIDRCIRNVAWRDMFPRFEVRHLDFCRSYHRPILLNLSVRIHLIQNVGHRIKRFEETWLLSHEFKEA
VSTNWDQRASVGSLMALASAMGMCMSGLDR