; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025492 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025492
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationchr10:13865111..13865807
RNA-Seq ExpressionLag0025492
SyntenyLag0025492
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]1.2e-1729.88Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL
        M +YL+ MK  A ++ +AG P     L + I++GL+ E +PI   +  +  + WQE +   L+++S+L  + ++  K   +S+PSA++AT+  +   ++ 
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL

Query:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT
         +S+ Q    G                                                       +   NS  NS    FVAT E V D  WYADSGAT
Subjt:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT

Query:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        NH T D  NL+ ++ Y+G+ESL VGNG  L I HVG   +P
Subjt:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]1.4e-1629.05Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL
        M +YL+ MK  A ++ +AG P     L +  ++GL+ E +PI   +  +  + WQE +   L+++S+L  + ++  K   +S+PSA++AT+  +   ++ 
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL

Query:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT
         +S+ Q    G                                                       +   NS  NS    FVAT E V D  WYADSGAT
Subjt:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT

Query:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        +H T D  NL+ +++Y+G+ESL VGNG  L I HVG   +P
Subjt:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]3.6e-1729.46Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL
        M +YL+ MK  A ++ +AG P     L +  ++GL+ E +PI   +  +  + WQE +   L+++S+L  + ++  K   +S+PSA++AT+  +   ++ 
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL

Query:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT
         +S+ Q    G                                                       +   NS  NS    FVAT E V D  WYADSGAT
Subjt:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT

Query:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        NH T D  NL+ +++Y+G+ESL VGNG  L I HVG   +P
Subjt:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]6.8e-2435.53Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMA------------
        M DYLS MK  A  + +AG+PIS   L+S +++GL  E L I CQ+N K +  WQE HA  + FE+ L  L+ +    D+S PSAN              
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMA------------

Query:  ------------------------TSSMDRPFSSLLSSSLQQKV---HGLKSE-IQNPNSGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNH
                                 S+  RP   +        V   H L  + + N   GGN    A++   E + DP W  DSGATNH T D  NL  
Subjt:  ------------------------TSSMDRPFSSLLSSSLQQKV---HGLKSE-IQNPNSGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNH

Query:  QTEYKGNESLSVGNGDLLTIEHVGYSVV
        Q EY+GNE+L+VGN   L I HVG +V+
Subjt:  QTEYKGNESLSVGNGDLLTIEHVGYSVV

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]2.5e-1830.59Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMATSSMDRPFSSLL
        M++YL LMK  A N+ +AG  +S++ LVS +++GL++E  PI   V  K +  W E HA  L +E +L   + +     I+            R F    
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMATSSMDRPFSSLL

Query:  SSSLQQKVHGLKSEIQNPNSGGNSKMGAF-----------------------------------VATLEFVADPMWYADSGATNHSTPDINNLNHQTEYK
          + Q+  +G  S   N + GG  + G+F                                   V T E V DP WYADSGAT+H T + NN+  + +Y 
Subjt:  SSSLQQKVHGLKSEIQNPNSGGNSKMGAF-----------------------------------VATLEFVADPMWYADSGATNHSTPDINNLNHQTEYK

Query:  GNESLSVGNGDLLTIEHVG
        G E++ V NG+ L+I H+G
Subjt:  GNESLSVGNGDLLTIEHVG

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein6.0e-1829.88Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL
        M +YL+ MK  A ++ +AG P     L + I++GL+ E +PI   +  +  + WQE +   L+++S+L  + ++  K   +S+PSA++AT+  +   ++ 
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL

Query:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT
         +S+ Q    G                                                       +   NS  NS    FVAT E V D  WYADSGAT
Subjt:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT

Query:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        NH T D  NL+ ++ Y+G+ESL VGNG  L I HVG   +P
Subjt:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

A0A5C7IJ06 Uncharacterized protein1.7e-1729.46Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL
        M +YL+ MK  A ++ +AG P     L +  ++GL+ E +PI   +  +  + WQE +   L+++S+L  + ++  K   +S+PSA++AT+  +   ++ 
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQL-HIIPKTTDISAPSANMATSSMDRPFSSL

Query:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT
         +S+ Q    G                                                       +   NS  NS    FVAT E V D  WYADSGAT
Subjt:  LSSSLQQKVHG---------------------------------------------------LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGAT

Query:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        NH T D  NL+ +++Y+G+ESL VGNG  L I HVG   +P
Subjt:  NHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

A0A6J1CLV9 uncharacterized protein LOC1110128093.3e-2435.53Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMA------------
        M DYLS MK  A  + +AG+PIS   L+S +++GL  E L I CQ+N K +  WQE HA  + FE+ L  L+ +    D+S PSAN              
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMA------------

Query:  ------------------------TSSMDRPFSSLLSSSLQQKV---HGLKSE-IQNPNSGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNH
                                 S+  RP   +        V   H L  + + N   GGN    A++   E + DP W  DSGATNH T D  NL  
Subjt:  ------------------------TSSMDRPFSSLLSSSLQQKV---HGLKSE-IQNPNSGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNH

Query:  QTEYKGNESLSVGNGDLLTIEHVGYSVV
        Q EY+GNE+L+VGN   L I HVG +V+
Subjt:  QTEYKGNESLSVGNGDLLTIEHVGYSVV

A0A6J1DCW4 uncharacterized protein LOC1110195981.2e-1830.59Show/hide
Query:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMATSSMDRPFSSLL
        M++YL LMK  A N+ +AG  +S++ LVS +++GL++E  PI   V  K +  W E HA  L +E +L   + +     I+            R F    
Subjt:  MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMATSSMDRPFSSLL

Query:  SSSLQQKVHGLKSEIQNPNSGGNSKMGAF-----------------------------------VATLEFVADPMWYADSGATNHSTPDINNLNHQTEYK
          + Q+  +G  S   N + GG  + G+F                                   V T E V DP WYADSGAT+H T + NN+  + +Y 
Subjt:  SSSLQQKVHGLKSEIQNPNSGGNSKMGAF-----------------------------------VATLEFVADPMWYADSGATNHSTPDINNLNHQTEYK

Query:  GNESLSVGNGDLLTIEHVG
        G E++ V NG+ L+I H+G
Subjt:  GNESLSVGNGDLLTIEHVG

A0A803QCM5 Uncharacterized protein3.9e-1737.58Show/hide
Query:  KQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKT-----TDISAPSANMATSSMDRPFSSLLSSSLQQKVHGLKSEIQNPN
        KQLVS  +SGL+ E LPI  QV  +    WQE     L+ +S++ +L  +  T      +  +P+AN+AT+    P +S                    N
Subjt:  KQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKT-----TDISAPSANMATSSMDRPFSSLLSSSLQQKVHGLKSEIQNPN

Query:  SGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVG
         G    + AF+AT + V D  WYADSGA+NH T   NN+  + EY G E L+VGNG+ L I HVG
Subjt:  SGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVG

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-0438.78Show/hide
Query:  WYADSGATNHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP
        W  DSGAT+H T D NNL+    Y G + + + +G  + I H G + +P
Subjt:  WYADSGATNHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGACTATCTCAGCTTAATGAAGCACACTGCCGCAAATATGCATGTTGCAGGTAAACCTATTTCTCTTAAACAACTCGTATCTTTCATAGTTTCTGGACTTGAGGA
TGAATGTTTACCAATCACTTGCCAAGTTAATTGTAAAAGTGACTGGAAATGGCAAGAGTTTCATGCAAACACGCTAGCCTTTGAATCCCAATTGACACAACTTCATATTA
TACCCAAGACTACTGATATTTCTGCTCCTTCTGCCAACATGGCTACAAGTTCAATGGACAGGCCATTCAGCAGCCTACTGTCATCATCGCTTCAACAAAAAGTACATGGG
CTCAAATCAGAAATTCAGAATCCTAACAGTGGAGGTAACAGTAAAATGGGTGCCTTTGTTGCCACTCTAGAATTTGTTGCTGATCCAATGTGGTATGCCGACTCTGGTGC
TACAAATCATAGCACCCCAGATATCAACAATCTGAATCATCAAACTGAATACAAAGGTAATGAGTCACTTTCCGTGGGTAATGGTGATCTTTTGACAATTGAGCACGTTG
GGTATTCTGTTGTTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGACTATCTCAGCTTAATGAAGCACACTGCCGCAAATATGCATGTTGCAGGTAAACCTATTTCTCTTAAACAACTCGTATCTTTCATAGTTTCTGGACTTGAGGA
TGAATGTTTACCAATCACTTGCCAAGTTAATTGTAAAAGTGACTGGAAATGGCAAGAGTTTCATGCAAACACGCTAGCCTTTGAATCCCAATTGACACAACTTCATATTA
TACCCAAGACTACTGATATTTCTGCTCCTTCTGCCAACATGGCTACAAGTTCAATGGACAGGCCATTCAGCAGCCTACTGTCATCATCGCTTCAACAAAAAGTACATGGG
CTCAAATCAGAAATTCAGAATCCTAACAGTGGAGGTAACAGTAAAATGGGTGCCTTTGTTGCCACTCTAGAATTTGTTGCTGATCCAATGTGGTATGCCGACTCTGGTGC
TACAAATCATAGCACCCCAGATATCAACAATCTGAATCATCAAACTGAATACAAAGGTAATGAGTCACTTTCCGTGGGTAATGGTGATCTTTTGACAATTGAGCACGTTG
GGTATTCTGTTGTTCCATAA
Protein sequenceShow/hide protein sequence
MVDYLSLMKHTAANMHVAGKPISLKQLVSFIVSGLEDECLPITCQVNCKSDWKWQEFHANTLAFESQLTQLHIIPKTTDISAPSANMATSSMDRPFSSLLSSSLQQKVHG
LKSEIQNPNSGGNSKMGAFVATLEFVADPMWYADSGATNHSTPDINNLNHQTEYKGNESLSVGNGDLLTIEHVGYSVVP