; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035462 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035462
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:21927020..21931093
RNA-Seq ExpressionLag0035462
SyntenyLag0035462
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4262994.1 unnamed protein product [Prunus armeniaca]4.0e-1631.6Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP----------------------------RIFGGVWFGGGNFWGRVRSP----V
        L+ FNQALL KQC  I+Q P S + R+ + RY P+  FL+AGVG +P                            +++   W    +F+  + +P     
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP----------------------------RIFGGVWFGGGNFWGRVRSP----V

Query:  TLGLDL-----EWLISLM---------------PL--LSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSS-WWKGCWKMGIPSKIKCK
        TL  DL     +W + L+               PL  L+S D +IWH+E+ G+Y+VKSGY+L+++        SS+  +LSS +WK  W + IP+KIK  
Subjt:  TLGLDL-----EWLISLM---------------PL--LSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSS-WWKGCWKMGIPSKIKCK

Query:  FFR----DSLMGSEWEVLLQSVQANSMLNL---LRISLGGRS------LQPGLWIGQRIILCFSRGSQT
         +R      L G+  EV     + NS   L   ++IS GG        L+ GLW   R  L F   S+T
Subjt:  FFR----DSLMGSEWEVLLQSVQANSMLNL---LRISLGGRS------LQPGLWIGQRIILCFSRGSQT

XP_015873545.1 uncharacterized protein LOC107410610 [Ziziphus jujuba]1.9e-1833.66Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP-----------RIF--GGVW-FGGG-------NFW------GRVRSPVTLGLD
        L  FN+ALLAKQCWRIV+ P+S L RV+K +YFP   F +AG+GR P           RI   GG+W  G G       + W       R+ SP  L  D
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP-----------RIF--GGVW-FGGG-------NFW------GRVRSPVTLGLD

Query:  L----------EWLISLMPLLSSEDKI-----------------IWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKF
                    W ++L+    S D++                  W F + G+YTV++GY  ++   +    S  S  ++  WWKG WK+ IP+K+KC  
Subjt:  L----------EWLISLMPLLSSEDKI-----------------IWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKF

Query:  FR
        F+
Subjt:  FR

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.2e-1836.36Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGV-------------GRDPRIFGGVW--------FGGGNFWGRVRSPVTLGLDLEWLIS
        L+ FN+ALLAKQCWRI+  P S L RVLKGRYF +  F++A +             GRD    G  W        F  G+ W  V +  TL +     + 
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGV-------------GRDPRIFGGVW--------FGGGNFWGRVRSPVTLGLDLEWLIS

Query:  LMPLLSS------------------------------------EDKIIWHFEKCGIYTVKSGYRLSQVALL----AQIPSSSSSESLSSWWKGCWKMGIP
        L+  +SS                                    ED++IW++EK G+Y+V+SGY+   VALL     Q PSSSSSE +  WW G WKM IP
Subjt:  LMPLLSS------------------------------------EDKIIWHFEKCGIYTVKSGYRLSQVALL----AQIPSSSSSESLSSWWKGCWKMGIP

Query:  SKIKCKFFR
        +KIK   +R
Subjt:  SKIKCKFFR

XP_030497969.1 uncharacterized protein LOC115713621 [Cannabis sativa]1.0e-1640.71Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGRVRSPVTLGLDLEWLISLMPLLSSEDKIIWHFEKCGIYTVK
        FNQALLAKQ WRI +   S L R+LK RYF N  FL++ +G  P +    W   G  WGR      L   +E         +  D +IWH    G+YTVK
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGRVRSPVTLGLDLEWLISLMPLLSSEDKIIWHFEKCGIYTVK

Query:  SGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK
        SG+       L  +   SSS S   WWK  W + +PSKIK
Subjt:  SGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]6.8e-1637.04Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGRVRSPVTLGLDL-----EW-LISL----------------M
        FNQALLAKQ WR+++ P S L ++L+ RYF NG FL +G+G +P +    W   G        P  L  DL     +W LISL                +
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGRVRSPVTLGLDL-----EW-LISL----------------M

Query:  PLLSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK
         L   +D +IW     GIY VKSGY+L+     A+   ++SS S+ +WW   WKM +P K++
Subjt:  PLLSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK

TrEMBL top hitse value%identityAlignment
A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-1836.36Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGV-------------GRDPRIFGGVW--------FGGGNFWGRVRSPVTLGLDLEWLIS
        L+ FN+ALLAKQCWRI+  P S L RVLKGRYF +  F++A +             GRD    G  W        F  G+ W  V +  TL +     + 
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGV-------------GRDPRIFGGVW--------FGGGNFWGRVRSPVTLGLDLEWLIS

Query:  LMPLLSS------------------------------------EDKIIWHFEKCGIYTVKSGYRLSQVALL----AQIPSSSSSESLSSWWKGCWKMGIP
        L+  +SS                                    ED++IW++EK G+Y+V+SGY+   VALL     Q PSSSSSE +  WW G WKM IP
Subjt:  LMPLLSS------------------------------------EDKIIWHFEKCGIYTVKSGYRLSQVALL----AQIPSSSSSESLSSWWKGCWKMGIP

Query:  SKIKCKFFR
        +KIK   +R
Subjt:  SKIKCKFFR

A0A6P3Z8V7 uncharacterized protein LOC1074106109.2e-1933.66Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP-----------RIF--GGVW-FGGG-------NFW------GRVRSPVTLGLD
        L  FN+ALLAKQCWRIV+ P+S L RV+K +YFP   F +AG+GR P           RI   GG+W  G G       + W       R+ SP  L  D
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP-----------RIF--GGVW-FGGG-------NFW------GRVRSPVTLGLD

Query:  L----------EWLISLMPLLSSEDKI-----------------IWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKF
                    W ++L+    S D++                  W F + G+YTV++GY  ++   +    S  S  ++  WWKG WK+ IP+K+KC  
Subjt:  L----------EWLISLMPLLSSEDKI-----------------IWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKF

Query:  FR
        F+
Subjt:  FR

A0A803P996 Uncharacterized protein2.3e-1735.23Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRI-FGGV---------------------------WFGGGNFW----------GRVR
        FNQALLAKQ WRI ++P S L R+LK RYFPN +FL+A +G  P + + G+                           W  G N +          G V 
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRI-FGGV---------------------------WFGGGNFW----------GRVR

Query:  SPVT--------------LGLDLEWLISL-MPLLSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK
        + +T                LD+E ++S+ +   SS D +IWH    G+YTVKSGY L+  A +  I  SSSS   S WWK  W + +P K+K
Subjt:  SPVT--------------LGLDLEWLISL-MPLLSSEDKIIWHFEKCGIYTVKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIK

A0A803PI06 Uncharacterized protein7.8e-1838.1Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGG-------GNFWGRVRSPVTLGLD--------------LEWLISL-M
        FNQA+LAKQ WRI++QP S + R+L  RY+P   FL +  G  P      W  G            RV S +T  L                + ++S+ +
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGG-------GNFWGRVRSPVTLGLD--------------LEWLISL-M

Query:  PLLSSEDKIIWHFEKCGIYTVKSGYRLS-QVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKFFR
        PL  S+D +IW     GIYTV+SGY LS        IPSSSS    S WWK  W + IP+K+K   FR
Subjt:  PLLSSEDKIIWHFEKCGIYTVKSGYRLS-QVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKFFR

A0A803PIN0 Uncharacterized protein1.0e-1737.91Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGR-VRSPVTLGLDLEWLISL-MPLLSSEDKIIWHFEKCGIYT
        +NQALLAKQ WR +  P+S L R+LK RYFP+  FL+A  G  P +    W      W + +       +D++ ++++ +    + D +IWH    GIY 
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGR-VRSPVTLGLDLEWLISL-MPLLSSEDKIIWHFEKCGIYT

Query:  VKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKC---KFFRDSL
        V SGY    VA L     SS+S S ++WWK  WK+ +P KIK    + F D+L
Subjt:  VKSGYRLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKC---KFFRDSL

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003106.6e-0652.27Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP
        FNQALLAKQ +RI+ QP + L R+L+ RYFP+   ++  VG  P
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein4.0e-0624.76Show/hide
Query:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVW-----------------FGGGN---FWGRV---RSPVTLGLDL---
        ++ FN ALL KQ WR++ +P S + +V K RYF   D L+A +G  P     VW                  G G     W        P +  L +   
Subjt:  LKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVW-----------------FGGGN---FWGRV---RSPVTLGLDL---

Query:  -----------------------EWLISLMPLLSSE-----------------DKIIWHFEKCGIYTVKSGY-RLSQVALLAQIPSSSSSESLSSWWKGC
                               EW   ++ +L  E                 D   W +   G YTVKSGY  L+Q+      P   S  SL+  ++  
Subjt:  -----------------------EWLISLMPLLSSE-----------------DKIIWHFEKCGIYTVKSGY-RLSQVALLAQIPSSSSSESLSSWWKGC

Query:  WKMGIPSKIK
        WK     KI+
Subjt:  WKMGIPSKIK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.7e-0752.27Show/hide
Query:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP
        FNQALLAKQ +RI+ QP + L R+L+ RYFP+   ++  VG  P
Subjt:  FNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAGGATTGAAGACTTTCAACCAAGCTCTTCTGGCCAAACAGTGTTGGAGGATTGTTCAGCAGCCTACCTCTTTTCTCTTCCGTGTGTTGAAGGGGCGGTATTT
TCCTAATGGAGACTTCCTGGATGCAGGGGTGGGTCGCGACCCTCGTATATTTGGAGGAGTCTGGTTTGGGGGAGGGAACTTTTGGGGAAGGGTACGGTCCCCGGTCACTT
TGGGTCTTGATCTCGAGTGGCTGATCTCCTTAATGCCTCTGTTATCGTCTGAGGATAAAATTATATGGCATTTTGAGAAGTGCGGGATCTACACGGTTAAGAGTGGGTAT
CGGCTTAGTCAGGTGGCCTTGCTTGCCCAAATCCCATCGTCGTCCTCAAGTGAGTCGTTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGATTCCGAGCAAAATTAA
GTGCAAGTTTTTTCGTGATTCATTGATGGGATCTGAGTGGGAGGTTTTGCTACAGAGTGTCCAGGCGAATTCTATGCTTAATCTGCTAAGGATAAGTTTAGGGGGCAGGT
CCCTTCAGCCGGGCTTGTGGATTGGGCAGCGAATTATCTTGTGTTTTTCGAGGGGCTCCCAGACCTATTGTGAGGGGATTAGGGGGTGCCTCAGAGTGCCGTTAGATGGT
TGCGCCAGAGGAGGGATGATTTGGCAGAGGGTCTGGCGGTGGGGACAGATTGAGACTTGTGGTGGAAATGGGTTTAGCGCCGGTACTCTGGAGACTGACTCTATGCGGGC
TTTTTCGCTACTGCAAGATGCTGCGATGGTGGATCTATCTGAGTTCGGTGTACTGGTTTCTGAGGCTCGGAGGGGGTGCCTGCGCATTTTCAGCTCAGATTCAGCTTTAC
AAGGAGGGAAGGAAACTGTGTCGCCCATGAGTGGCAGTCGAGGTTGGATAGTTCGAGAACGGAGGCGTTCCAAAGACCAACTACACTGGAAAGGTGATCAGACGGAGCAG
CGTCAGCCACTACCCAGTACCGGCGTGCAGATAGCAGGGGAGAGTGTTTGTGGGCGGGGTGTTTTAGTTGAAGGATGGACATTTTGTATGGTTGTGGATCCTAGGCTGAG
GCTGGATGTTTTCGAGCATCAGTGGGCCCATGTATTCTGGTGGGGCAATGCTCAAATCTATGTTTATGGCTGTGATGATGTGTTGAGATGTTTAAGCATGACAGGGCCTT
ATGTGCTAGATCAAGGTTTTGTGTGGAATATATTCTTGCATTCTATATGTGAGGCCTGCCTAGGGATGGTGGTAGAGGGTCTTCAAAAGGTTGGGTGTGGTGTTTACGTG
GTTAGTCGAACTCCCTGTATGGCTGACTTGGGGTGGAAATACGTGAGGAGTTTGGAAGCTAGCTGGAGTGCAGAGAAAGTCAAATGGCTGTCTTCATGGATCGGGGCTTG
CGCTTCGTGTGGGGAAGAGTGTTGCACAGTGGGTGATGATTTTGGCTTATCTTTGAGTATGCCCTCCGTGTGTGGAGGAATAATGGAGAATGCCCTTTTGCTTATGACAC
CGCGCACATTTTACTACCGTGTTGCCCCTCCACAGGCCAGCGGAATCGATTCCCGCTCTCGATCCACGATCTCGCTTCACACTCTCACTTCTTGGACAGTTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCAGGATTGAAGACTTTCAACCAAGCTCTTCTGGCCAAACAGTGTTGGAGGATTGTTCAGCAGCCTACCTCTTTTCTCTTCCGTGTGTTGAAGGGGCGGTATTT
TCCTAATGGAGACTTCCTGGATGCAGGGGTGGGTCGCGACCCTCGTATATTTGGAGGAGTCTGGTTTGGGGGAGGGAACTTTTGGGGAAGGGTACGGTCCCCGGTCACTT
TGGGTCTTGATCTCGAGTGGCTGATCTCCTTAATGCCTCTGTTATCGTCTGAGGATAAAATTATATGGCATTTTGAGAAGTGCGGGATCTACACGGTTAAGAGTGGGTAT
CGGCTTAGTCAGGTGGCCTTGCTTGCCCAAATCCCATCGTCGTCCTCAAGTGAGTCGTTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGATTCCGAGCAAAATTAA
GTGCAAGTTTTTTCGTGATTCATTGATGGGATCTGAGTGGGAGGTTTTGCTACAGAGTGTCCAGGCGAATTCTATGCTTAATCTGCTAAGGATAAGTTTAGGGGGCAGGT
CCCTTCAGCCGGGCTTGTGGATTGGGCAGCGAATTATCTTGTGTTTTTCGAGGGGCTCCCAGACCTATTGTGAGGGGATTAGGGGGTGCCTCAGAGTGCCGTTAGATGGT
TGCGCCAGAGGAGGGATGATTTGGCAGAGGGTCTGGCGGTGGGGACAGATTGAGACTTGTGGTGGAAATGGGTTTAGCGCCGGTACTCTGGAGACTGACTCTATGCGGGC
TTTTTCGCTACTGCAAGATGCTGCGATGGTGGATCTATCTGAGTTCGGTGTACTGGTTTCTGAGGCTCGGAGGGGGTGCCTGCGCATTTTCAGCTCAGATTCAGCTTTAC
AAGGAGGGAAGGAAACTGTGTCGCCCATGAGTGGCAGTCGAGGTTGGATAGTTCGAGAACGGAGGCGTTCCAAAGACCAACTACACTGGAAAGGTGATCAGACGGAGCAG
CGTCAGCCACTACCCAGTACCGGCGTGCAGATAGCAGGGGAGAGTGTTTGTGGGCGGGGTGTTTTAGTTGAAGGATGGACATTTTGTATGGTTGTGGATCCTAGGCTGAG
GCTGGATGTTTTCGAGCATCAGTGGGCCCATGTATTCTGGTGGGGCAATGCTCAAATCTATGTTTATGGCTGTGATGATGTGTTGAGATGTTTAAGCATGACAGGGCCTT
ATGTGCTAGATCAAGGTTTTGTGTGGAATATATTCTTGCATTCTATATGTGAGGCCTGCCTAGGGATGGTGGTAGAGGGTCTTCAAAAGGTTGGGTGTGGTGTTTACGTG
GTTAGTCGAACTCCCTGTATGGCTGACTTGGGGTGGAAATACGTGAGGAGTTTGGAAGCTAGCTGGAGTGCAGAGAAAGTCAAATGGCTGTCTTCATGGATCGGGGCTTG
CGCTTCGTGTGGGGAAGAGTGTTGCACAGTGGGTGATGATTTTGGCTTATCTTTGAGTATGCCCTCCGTGTGTGGAGGAATAATGGAGAATGCCCTTTTGCTTATGACAC
CGCGCACATTTTACTACCGTGTTGCCCCTCCACAGGCCAGCGGAATCGATTCCCGCTCTCGATCCACGATCTCGCTTCACACTCTCACTTCTTGGACAGTTTCCTAA
Protein sequenceShow/hide protein sequence
MDSGLKTFNQALLAKQCWRIVQQPTSFLFRVLKGRYFPNGDFLDAGVGRDPRIFGGVWFGGGNFWGRVRSPVTLGLDLEWLISLMPLLSSEDKIIWHFEKCGIYTVKSGY
RLSQVALLAQIPSSSSSESLSSWWKGCWKMGIPSKIKCKFFRDSLMGSEWEVLLQSVQANSMLNLLRISLGGRSLQPGLWIGQRIILCFSRGSQTYCEGIRGCLRVPLDG
CARGGMIWQRVWRWGQIETCGGNGFSAGTLETDSMRAFSLLQDAAMVDLSEFGVLVSEARRGCLRIFSSDSALQGGKETVSPMSGSRGWIVRERRRSKDQLHWKGDQTEQ
RQPLPSTGVQIAGESVCGRGVLVEGWTFCMVVDPRLRLDVFEHQWAHVFWWGNAQIYVYGCDDVLRCLSMTGPYVLDQGFVWNIFLHSICEACLGMVVEGLQKVGCGVYV
VSRTPCMADLGWKYVRSLEASWSAEKVKWLSSWIGACASCGEECCTVGDDFGLSLSMPSVCGGIMENALLLMTPRTFYYRVAPPQASGIDSRSRSTISLHTLTSWTVS