; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024146 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024146
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 297
Genome locationchr10:784091..785931
RNA-Seq ExpressionLag0024146
SyntenyLag0024146
Gene Ontology termsNA
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7843765.1 putative mitochondrial protein [Senna tora]3.9e-2843.02Show/hide
Query:  LVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------PYLGHIVSADGVSVDPSK
        L +L+ L    DD      ++P  IR VL     +F     L P RE +HRI L+ G+T V+V PY                 YLGHIVS+ G++ DP K
Subjt:  LVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------PYLGHIVSADGVSVDPSK

Query:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL
        + AM +WP+P N+K+LRG L L GYYR+F+++Y  +A PLT  LKKDSF W+  +  AF  LK  MV+ PVL
Subjt:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL

KAF8393178.1 hypothetical protein HHK36_021419 [Tetracentron sinense]8.6e-2836.96Show/hide
Query:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------------
        ++R    E  G LVELN L+G+ ++ S G   +P+ +   L +H ++F    GL P     H I L  G+ P+SV PY                      
Subjt:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------------

Query:  --------------------------------PYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFV
                                         YLGHI+S  GV+ DPSK+ AM  WP P N++ LRG L L GYYRKFV  Y  IA PLT+QLKKD F 
Subjt:  --------------------------------PYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFV

Query:  WNDEATTAFFKLKTAMVTVPVLLYLIFQKS
        WN EA T+F +LK AM +V VL    F +S
Subjt:  WNDEATTAFFKLKTAMVTVPVLLYLIFQKS

KAF8400118.1 hypothetical protein HHK36_013414 [Tetracentron sinense]5.1e-2837.85Show/hide
Query:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVH------------------------
        ++R    E  G LVELN L+G+ ++ S G   +P+ +   L +H ++F    GL P R   H I L  G+ P+S+                         
Subjt:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVH------------------------

Query:  ----------------------PYPYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTA
                                 YLGHI+S  GV+ DPSK+ AM  WP P N++ LRG L L GYYRKFV  Y  IA PLT+QLKKD F WN EA T+
Subjt:  ----------------------PYPYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTA

Query:  FFKLKTAMVTVPVL
        F +LK  M +VPVL
Subjt:  FFKLKTAMVTVPVL

OIT36196.1 putative mitochondrial protein, partial [Nicotiana attenuata]1.9e-2743.6Show/hide
Query:  EEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPY----------------------------------LGHIVSADGVSVDPSK
        E+IPE I+ +L     IFQ   GL  +RE +H IEL+PG+ PV++ PY Y                                  LGHI+S  GVS D  K
Subjt:  EEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPY----------------------------------LGHIVSADGVSVDPSK

Query:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL
        I+AM  W  P +IKELRG L L GYYR+F+QN+ +I+ PL+  LKK +F+W  EAT AF +LK AMV  PVL
Subjt:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL

XP_039003082.1 uncharacterized protein LOC120129711 [Hibiscus syriacus]4.3e-2743.02Show/hide
Query:  ELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------PYLGHIVSADGVSVDPSKIAAMEQWP
        EL  + GA       W+        VL ++S +F    GL P+R  +H I L   +T V++ PY           YL HI+SA GVS DPSKI  M+ WP
Subjt:  ELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------PYLGHIVSADGVSVDPSKIAAMEQWP

Query:  IPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQKS
         P  +K LRG L L GYYR+F+++YG+I+ PLT+ LKK+ F WN  A  AF  LK AM T PVL    F K+
Subjt:  IPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQKS

TrEMBL top hitse value%identityAlignment
A0A087H2U0 Uncharacterized protein3.0e-2641.36Show/hide
Query:  GFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTP--------------------------VSVHPYPYLGHI
        G +VE N LQ    + ++    +P  ++ VL ++  +F    GL P R   H I L PG  P                                 YLGHI
Subjt:  GFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTP--------------------------VSVHPYPYLGHI

Query:  VSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQK
        +S DGV+V+P KI AM  W  P NIK LRG L L GYYRKFV+ YG IA PLT  LKKD F W+ EA  AF +LK AM TVPVL  + F +
Subjt:  VSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQK

A0A1U7YI19 uncharacterized protein LOC1042473107.9e-2740.62Show/hide
Query:  EELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------------PYLGHI
        ++L+  ++ LN    A ++ ++G  EI   I TVL Q    F     L P R  +H I L     P+S+ PY                       YLGHI
Subjt:  EELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPY----------------------PYLGHI

Query:  VSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQKS
         + +GVS DPSKI AM +WPIP ++K LRG L L+GYYR+F+++YG I+ PLT  L+K++F WN EA   FF+LK AM   PVL    F KS
Subjt:  VSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQKS

A0A314L3G5 Putative mitochondrial protein (Fragment)9.3e-2843.6Show/hide
Query:  EEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPY----------------------------------LGHIVSADGVSVDPSK
        E+IPE I+ +L     IFQ   GL  +RE +H IEL+PG+ PV++ PY Y                                  LGHI+S  GVS D  K
Subjt:  EEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPY----------------------------------LGHIVSADGVSVDPSK

Query:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL
        I+AM  W  P +IKELRG L L GYYR+F+QN+ +I+ PL+  LKK +F+W  EAT AF +LK AMV  PVL
Subjt:  IAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL

A0A6A3AWX9 Protein STRUBBELIG-RECEPTOR FAMILY 2-like1.8e-2652.85Show/hide
Query:  LPGLLPEREGNHRIELMPGTTPVSVHPY------------PYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQ
        LPGL P+R  +H I L P +TPV++ PY             YLGHI+SA GV+ +PSK+  M+ WP P  +K L G L L GYYR+F++NYGLI  PLTQ
Subjt:  LPGLLPEREGNHRIELMPGTTPVSVHPY------------PYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQ

Query:  QLKKDSFVWNDEATTAFFKLKTA
         LKKD F WN EA TAF  LK A
Subjt:  QLKKDSFVWNDEATTAFFKLKTA

A0A6L2J1B8 RT_RNaseH_2 domain-containing protein1.3e-2642.86Show/hide
Query:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPYLGHIVSADGVSVDPSKIAAM
        +IR+   E +GF++E+  L+  P   S     I   I  ++ ++  +F P   L P ++  + I L  GTTP       YL HIVS   V  DPSKI+AM
Subjt:  IIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSSIFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPYLGHIVSADGVSVDPSKIAAM

Query:  EQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQK
         +WPIP N+ EL G L L GYY+K V+ Y  IA  LT+QLKKD+F WN + T AF  L+ AM  V VL  L F K
Subjt:  EQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVLLYLIFQK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-1340.59Show/hide
Query:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFV--WNDEATTAFFKLKTAMVTVPVLLYLIFQKSL
        +LGH+++ DG+  +P KI A++++PIP   KE++  L L GYYRKF+ N+  IA P+T+ LKK+  +   N E  +AF KLK  +   P+L    F K  
Subjt:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFV--WNDEATTAFFKLKTAMVTVPVLLYLIFQKSL

Query:  S
        +
Subjt:  S

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.5e-0833.94Show/hide
Query:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQL------------KKDSFVWNDEATTAFFKLKTAMVTVPV
        YLG IVS DG   DP K+ A++++P P  + ++R  L L  YYR F++++  IA P+T  L            KK    +N+    AF +L+  + +  V
Subjt:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQL------------KKDSFVWNDEATTAFFKLKTAMVTVPV

Query:  LL-YLIFQK
        +L Y  F+K
Subjt:  LL-YLIFQK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-1343.88Show/hide
Query:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWND--EATTAFFKLKTAMVTVPVLLYLIFQK
        +LGHIV+ DG+  +P K+ A+  +PIP   KE+R  L L GYYRKF+ NY  IA P+T  LKK + +     E   AF KLK  ++  P+L    F+K
Subjt:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWND--EATTAFFKLKTAMVTVPVLLYLIFQK

P92523 Uncharacterized mitochondrial protein AtMg008604.8e-2157.14Show/hide
Query:  YLG--HIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL
        YLG  HI+S +GVS DP+K+ AM  WP P N  ELRG L L GYYR+FV+NYG I  PLT+ LKK+S  W + A  AF  LK A+ T+PVL
Subjt:  YLG--HIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.5e-1131.93Show/hide
Query:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLT------------QQLKKDSFVWNDEATTAFFKLKTAMVTVPV
        +LG+IV+ADG+  DP K+ A+ + P P ++KEL+  L +  YYRKF+Q+Y  +A PLT             Q  K     ++ A  +F  LK+ + +  +
Subjt:  YLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLT------------QQLKKDSFVWNDEATTAFFKLKTAMVTVPV

Query:  LLYLIFQKSLSWRLMLQGW
        L +  F K          W
Subjt:  LLYLIFQKSLSWRLMLQGW

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.4e-2257.14Show/hide
Query:  YLG--HIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL
        YLG  HI+S +GVS DP+K+ AM  WP P N  ELRG L L GYYR+FV+NYG I  PLT+ LKK+S  W + A  AF  LK A+ T+PVL
Subjt:  YLG--HIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFFKLKTAMVTVPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCAAGTGTTCCTCCAAAAGGAGACAGAATCTGCAAGCCAAGGGGATAAGTACCAGCTTTTGAACTTCTTTTGGATTTTATCATCAGTATTTAGGCTACGTAAACT
GATCAAAAGTTGGGTAGTTGGCCACAATGTCCACCCAGATTTTGTTCCCTTTTTTCCCTCTAAAGAGATTAGTTTTCCAGCCATCGGGGGAAGCACCATAAATACTGATG
ATAATTTTCCTCCACAAGGAGTTCTTCTCTTGCGTGAACCTCCAAAGCCATTTAACCATCAGGGAGATATTTCTTTGAGCAAGGGATCCCACTCCCAGCCCTCCATGGAT
GCTCGACAAAGGGAAGCCAAGATAGTTGATGGGCAGGGAATCCACTTGGCATCCGAGGAGGTTGGCCCATTTGGTCGTCTGAGGGAGGTCTACGTTAATCCCTATCATAG
CGGATTTAGCCAAATTCAGAGAAAGCCTAGAGCCTGCCATTATCATCTTGAGAATATCCCACCACTTAATCAAATCATTCTCATTATCCGGGCAAAAAATGAGGAATTGC
AGGGGTTTTTGGTTGAACTCAACGTGCTTCAAGGAGCACCCGATGACATGTCAGATGGTTGGGAGGAAATCCCCGAATGCATTCGTACAGTCTTAATCCAACATTCCTCT
ATTTTTCAACCTTTACCTGGCTTGCTGCCCGAGCGTGAGGGCAATCATCGAATCGAACTAATGCCTGGTACGACCCCTGTTAGTGTACACCCATATCCATATTTGGGGCA
TATCGTGTCTGCTGATGGAGTGTCTGTTGACCCGTCTAAGATAGCTGCCATGGAACAATGGCCAATCCCTTGGAACATTAAGGAACTAAGAGGTTCTCTTGTGCTCATGG
GTTATTATCGTAAATTTGTTCAAAATTATGGCTTGATCGCCCATCCATTGACACAACAGCTTAAGAAAGACAGCTTTGTGTGGAACGACGAAGCCACAACAGCGTTCTTT
AAGCTGAAGACCGCCATGGTCACGGTTCCTGTTCTTCTTTACCTGATTTTTCAAAAGAGTTTGTCGTGGAGGTTGATGCTTCAGGGGTGGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCAAGTGTTCCTCCAAAAGGAGACAGAATCTGCAAGCCAAGGGGATAAGTACCAGCTTTTGAACTTCTTTTGGATTTTATCATCAGTATTTAGGCTACGTAAACT
GATCAAAAGTTGGGTAGTTGGCCACAATGTCCACCCAGATTTTGTTCCCTTTTTTCCCTCTAAAGAGATTAGTTTTCCAGCCATCGGGGGAAGCACCATAAATACTGATG
ATAATTTTCCTCCACAAGGAGTTCTTCTCTTGCGTGAACCTCCAAAGCCATTTAACCATCAGGGAGATATTTCTTTGAGCAAGGGATCCCACTCCCAGCCCTCCATGGAT
GCTCGACAAAGGGAAGCCAAGATAGTTGATGGGCAGGGAATCCACTTGGCATCCGAGGAGGTTGGCCCATTTGGTCGTCTGAGGGAGGTCTACGTTAATCCCTATCATAG
CGGATTTAGCCAAATTCAGAGAAAGCCTAGAGCCTGCCATTATCATCTTGAGAATATCCCACCACTTAATCAAATCATTCTCATTATCCGGGCAAAAAATGAGGAATTGC
AGGGGTTTTTGGTTGAACTCAACGTGCTTCAAGGAGCACCCGATGACATGTCAGATGGTTGGGAGGAAATCCCCGAATGCATTCGTACAGTCTTAATCCAACATTCCTCT
ATTTTTCAACCTTTACCTGGCTTGCTGCCCGAGCGTGAGGGCAATCATCGAATCGAACTAATGCCTGGTACGACCCCTGTTAGTGTACACCCATATCCATATTTGGGGCA
TATCGTGTCTGCTGATGGAGTGTCTGTTGACCCGTCTAAGATAGCTGCCATGGAACAATGGCCAATCCCTTGGAACATTAAGGAACTAAGAGGTTCTCTTGTGCTCATGG
GTTATTATCGTAAATTTGTTCAAAATTATGGCTTGATCGCCCATCCATTGACACAACAGCTTAAGAAAGACAGCTTTGTGTGGAACGACGAAGCCACAACAGCGTTCTTT
AAGCTGAAGACCGCCATGGTCACGGTTCCTGTTCTTCTTTACCTGATTTTTCAAAAGAGTTTGTCGTGGAGGTTGATGCTTCAGGGGTGGGTGTAG
Protein sequenceShow/hide protein sequence
MIQVFLQKETESASQGDKYQLLNFFWILSSVFRLRKLIKSWVVGHNVHPDFVPFFPSKEISFPAIGGSTINTDDNFPPQGVLLLREPPKPFNHQGDISLSKGSHSQPSMD
ARQREAKIVDGQGIHLASEEVGPFGRLREVYVNPYHSGFSQIQRKPRACHYHLENIPPLNQIILIIRAKNEELQGFLVELNVLQGAPDDMSDGWEEIPECIRTVLIQHSS
IFQPLPGLLPEREGNHRIELMPGTTPVSVHPYPYLGHIVSADGVSVDPSKIAAMEQWPIPWNIKELRGSLVLMGYYRKFVQNYGLIAHPLTQQLKKDSFVWNDEATTAFF
KLKTAMVTVPVLLYLIFQKSLSWRLMLQGWV