; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035784 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035784
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:30177819..30178343
RNA-Seq ExpressionLag0035784
SyntenyLag0035784
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019170410.1 PREDICTED: uncharacterized protein LOC109165884 [Ipomoea nil]1.3e-4250Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        +GF D+W K +M C+ TVSY VL+NG   E I P RGLRQGDPLSPYLFI+CAEGLS LLK+ E   +I G  +    P+ITHLFFADD+L+FF A  +E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           IKR L +YE  SGQ +N  K     S+N  E   ++I   LGV ++ + G YLGLP+  G  +   FS ++
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

XP_019196049.1 PREDICTED: uncharacterized protein LOC109189882 [Ipomoea nil]6.6e-4250Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF  +W  LIM+C+ TV Y ++VNG+    I P RGLRQGDPLSPYLFI+CAEGLS LL++ + +  I G  +    P ++HLFFADD+L+FF A  EE
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           IK+ L  YER SGQK+N DK     SRN  E    ++   LGV+Q+ + G YLGLP+  G  K  +FS V+
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.6e-4654.29Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF+++W  LIMNC+E+V + VL+NG+P +   P RGLRQGDPLSPYLFIMCAEGLS L+  EE   NI+ + IN  CP I+HLF+ADD L+FF A    
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERA-SGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           IK IL SYE+A SGQ INLDK + ++S+N  E     I +EL V  + S+G YLGLP+QTG  K  +F+ +K
Subjt:  GACIKRILLSYERA-SGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

XP_027165828.1 uncharacterized protein LOC113765775 [Coffea eugenioides]6.6e-4249.43Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF +KW   IM CI TVS+   +NG  +E I P RGLRQGDPLSPYLF++C+EG S+LLK+     +++G+ I+ H P+ITHLFFADD+L+F  A  +E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           +KRIL  YE+ SGQ +NLDK     S+N+     +++C ELG  Q  S G YLGLP      KG +F  +K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

XP_027174019.1 uncharacterized protein LOC113773583 [Coffea eugenioides]3.9e-4249.43Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF +KW   IM CI TVS+   +NG  +E I P RGLRQGDPLSPYLF++C+EG S+LLK+     +++G+ I+ H P+ITHLFFADD+L+F  A  +E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           +KRIL  YE+ SGQ +NLDK     S+N+     +++C ELG  Q  S G YLGLP      KG +F  +K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

TrEMBL top hitse value%identityAlignment
A0A1B5Z7R2 Reverse transcriptase domain-containing protein (Fragment)7.1e-4251.15Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGFSD+W K IM C+E+V Y VLVNGI  E IKP RGLRQGDPLSPYLFI+CAEGL+ L++  E++ +I G+ I  + P I+HL FADD  +FF A   E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           +K IL +YE ASGQ IN  K     SRN+ + +  +I   LGV      G YLGLP+  G  K   F+ +K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

A0A2N9F8H0 Reverse transcriptase domain-containing protein5.5e-4247.7Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF+D+W  L+M C++TVSY V++NG P   I+P RG+RQGDPLSPYLF++CAEGL+TLL+   A+  ++G+S+N   P I+HLFFADD+L+F  A  EE
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           +   L  YERASGQK+N +K     S N  +   + IC  L  + +  +G YLGLP   G  K   F+++K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

A0A2Z6NHQ6 Reverse transcriptase domain-containing protein1.2e-4151.15Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGFSD+W K IM C+E+V Y VLVNGI  E IKP RGLRQGDPLSPYLFI+C EGL+ L++  E++ NI G+ I  + P I+HL FADD  +FF A   E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           +K IL +YE ASGQ IN  K     SRN+ + +  +I   LGV      G YLGLP+  G  K   F+ +K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

A0A396J7A3 Putative RNA-directed DNA polymerase4.2e-4252.3Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGFS +W   IM C+ETV Y VLVNG     I P RGLRQGDPLSPYLFI+CAEGLS+L++  E  +NI G SI  + P ++HL FADD  +FF A + E
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
          C+K IL +YE ASGQ INL K     SRN P +    I   LGV Q    G YLGLP+  G  K   F  +K
Subjt:  GACIKRILLSYERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

A0A6J1DUG8 uncharacterized protein LOC1110241351.3e-4654.29Show/hide
Query:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE
        MGF+++W  LIMNC+E+V + VL+NG+P +   P RGLRQGDPLSPYLFIMCAEGLS L+  EE   NI+ + IN  CP I+HLF+ADD L+FF A    
Subjt:  MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEE

Query:  GACIKRILLSYERA-SGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK
           IK IL SYE+A SGQ INLDK + ++S+N  E     I +EL V  + S+G YLGLP+QTG  K  +F+ +K
Subjt:  GACIKRILLSYERA-SGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein9.4e-0732.71Show/hide
Query:  VLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEEGACIKRILLSYERASGQKINL
        + VNG   E I  + G RQG PLSPYLF +  E L+  +++++    I GI I      I+ L  ADD +V+ +  K     +  ++ S+    G KIN 
Subjt:  VLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEEGACIKRILLSYERASGQKINL

Query:  DKFICMI
        +K +  +
Subjt:  DKFICMI

P92555 Uncharacterized mitochondrial protein AtMg012501.3e-1655.22Show/hide
Query:  LVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADD
        ++NG PQ ++ P RGLRQGDPLSPYLFI+C E LS L +R +    + GI ++N+ P I HL FADD
Subjt:  LVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADD

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.1e-0536.26Show/hide
Query:  GFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLV
        G  D     IM+ I      ++V G     I  R G++QGDPLSP LF +  + L T L  E+      G S+   C  I  L FADD L+
Subjt:  GFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLV

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.3e-1855.22Show/hide
Query:  LVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADD
        ++NG PQ ++ P RGLRQGDPLSPYLFI+C E LS L +R +    + GI ++N+ P I HL FADD
Subjt:  LVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTCTCTGATAAATGGACAAAGCTTATTATGAACTGCATTGAGACAGTAAGCTACCAAGTGTTGGTCAATGGCATTCCCCAGGAAGTGATAAAGCCGAGAAGGGG
GCTTCGTCAAGGAGACCCCCTATCTCCGTACCTTTTCATTATGTGCGCTGAAGGCCTATCGACTCTTCTAAAAAGGGAGGAAGCTAACTCTAATATCTCTGGTATTTCTA
TTAATAATCATTGTCCAACTATAACACATCTCTTTTTTGCAGATGACAACTTGGTGTTTTTCAATGCAAAAAAGGAGGAGGGAGCGTGTATTAAGAGGATCCTGCTATCC
TATGAGAGAGCCTCGGGCCAGAAGATCAATCTGGATAAATTTATTTGCATGATAAGCAGGAATATTCCTGAGAATAATGCGAAGGACATTTGTAGAGAGCTGGGAGTAAC
CCAATCCAACTCAATTGGGCACTACCTAGGGCTTCCTACTCAAACAGGCCTACAGAAGGGAATCATGTTTAGCAAAGTTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATTCTCTGATAAATGGACAAAGCTTATTATGAACTGCATTGAGACAGTAAGCTACCAAGTGTTGGTCAATGGCATTCCCCAGGAAGTGATAAAGCCGAGAAGGGG
GCTTCGTCAAGGAGACCCCCTATCTCCGTACCTTTTCATTATGTGCGCTGAAGGCCTATCGACTCTTCTAAAAAGGGAGGAAGCTAACTCTAATATCTCTGGTATTTCTA
TTAATAATCATTGTCCAACTATAACACATCTCTTTTTTGCAGATGACAACTTGGTGTTTTTCAATGCAAAAAAGGAGGAGGGAGCGTGTATTAAGAGGATCCTGCTATCC
TATGAGAGAGCCTCGGGCCAGAAGATCAATCTGGATAAATTTATTTGCATGATAAGCAGGAATATTCCTGAGAATAATGCGAAGGACATTTGTAGAGAGCTGGGAGTAAC
CCAATCCAACTCAATTGGGCACTACCTAGGGCTTCCTACTCAAACAGGCCTACAGAAGGGAATCATGTTTAGCAAAGTTAAATAG
Protein sequenceShow/hide protein sequence
MGFSDKWTKLIMNCIETVSYQVLVNGIPQEVIKPRRGLRQGDPLSPYLFIMCAEGLSTLLKREEANSNISGISINNHCPTITHLFFADDNLVFFNAKKEEGACIKRILLS
YERASGQKINLDKFICMISRNIPENNAKDICRELGVTQSNSIGHYLGLPTQTGLQKGIMFSKVK