; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001926 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001926
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:37123218..37126928
RNA-Seq ExpressionLag0001926
SyntenyLag0001926
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH00857.1 hypothetical protein Prudu_010958, partial [Prunus dulcis]1.9e-2239.43Show/hide
Query:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAI
        S+ +KGY C +  T ++ ISRHV+  E  FPF   FA+      +I P+ P  + L     +     PS      S + P+T    S  N   + S    
Subjt:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAI

Query:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
          P+ S L   ++F  WR AM  EF+AL+  G W LVP   + N+VGCKWVFR K  PDG++  YKARLVAKGF+
Subjt:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN

KAB2608903.1 hypothetical protein D8674_012071 [Pyrus ussuriensis x Pyrus communis]4.1e-2233.33Show/hide
Query:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIP-
        YKGY C +  T ++ +SRHVI +E  +P+  + A+     S + P D AF+ ++    S      ++  +  + ++    A  S + +    + + ++P 
Subjt:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIP-

Query:  --------------------PNESVLSA--PNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
                            P++++++A   +  P+WRQAM DEF+AL +Q  W+LVPP+   NIVGCKWVF+ K N DGS+  +KARLVAKGF+
Subjt:  --------------------PNESVLSA--PNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN

RVW26376.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.1e-2135.15Show/hide
Query:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMDPAFLAL----------VDKFLSETYFNPSVACADVSRNIPST
        +KGYLC N S  K+ IS+HVI  E  FPFA+                I  ++ ITP+     +L           D  +S + F  +        ++PS 
Subjt:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMDPAFLAL----------VDKFLSETYFNPSVACADVSRNIPST

Query:  CADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKG
        C   +      +   +  + P E++   L  PN F      P+W+QAM  EF AL     W LVPP  N NI+GC+WV++ KY P+G+V  YKARLVAKG
Subjt:  CADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKG

Query:  FN
        F+
Subjt:  FN

RVX14515.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.1e-2235.78Show/hide
Query:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVS
        +KGYLC N S  K+ ISRHVI +E  FPFA+                I   S ITP+    P+  +   + +  +  + S++ ++V+    +  + + VS
Subjt:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVS

Query:  NSLHVASVEA---------IIPPNESV---LSAPNEFP------KWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVA
        +  H  +  A          + PN+++   L  PN F       +W+QAM  EF AL     W LVPP  N NI+GC+WV++ KY PDG+V  YKARLVA
Subjt:  NSLHVASVEA---------IIPPNESV---LSAPNEFP------KWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVA

Query:  KGFN
        KGF+
Subjt:  KGFN

RVX23193.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.7e-2335.75Show/hide
Query:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADV---------SR
        S  +KGYLC N S  K+ ISRHVI  E  FPFA+                I   + ITP+    P+  +   + +  +  + S++ ++V         S 
Subjt:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADV---------SR

Query:  NIPSTCADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKAR
        ++PS C   +      +   +  + P E++   L  PN F      P+W+QAM  EF AL     W LVPP  N NI+GC+WV++ KY PDG+V  YKAR
Subjt:  NIPSTCADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKAR

Query:  LVAKGFN
        LVAKGF+
Subjt:  LVAKGFN

TrEMBL top hitse value%identityAlignment
A0A2N9GRJ0 Uncharacterized protein4.5e-2233.33Show/hide
Query:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN------------------------IGFMSNITPMD---PAFLALVD--------KFLSETY
        +++ KGYLC N  T K++ISRHV  HE  FPF    + +                        +G   ++ P+    P   +L D          L  T 
Subjt:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN------------------------IGFMSNITPMD---PAFLALVD--------KFLSETY

Query:  FNPSVACADVSRNIPS-------------TCADASVSNS---LHVASVEAI--IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVG
         +P ++C   S ++PS             T   + +S     LH  ++  +   PP+  V S   ++P+W+ AM+DE++AL+ Q  W+LVPP  N NIVG
Subjt:  FNPSVACADVSRNIPS-------------TCADASVSNS---LHVASVEAI--IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVG

Query:  CKWVFRTKYNPDGSVSYYKARLVAKGFN
        CKWV++ K  PDGSV+ YKARLVAKG++
Subjt:  CKWVFRTKYNPDGSVSYYKARLVAKGFN

A0A438JZY3 Retrovirus-related Pol polyprotein from transposon RE13.4e-2235.78Show/hide
Query:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVS
        +KGYLC N S  K+ ISRHVI +E  FPFA+                I   S ITP+    P+  +   + +  +  + S++ ++V+    +  + + VS
Subjt:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVS

Query:  NSLHVASVEA---------IIPPNESV---LSAPNEFP------KWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVA
        +  H  +  A          + PN+++   L  PN F       +W+QAM  EF AL     W LVPP  N NI+GC+WV++ KY PDG+V  YKARLVA
Subjt:  NSLHVASVEA---------IIPPNESV---LSAPNEFP------KWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVA

Query:  KGFN
        KGF+
Subjt:  KGFN

A0A438KPR1 Retrovirus-related Pol polyprotein from transposon RE11.8e-2335.75Show/hide
Query:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADV---------SR
        S  +KGYLC N S  K+ ISRHVI  E  FPFA+                I   + ITP+    P+  +   + +  +  + S++ ++V         S 
Subjt:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATN-----------IGFMSNITPMD---PAFLALVDKFLSETYFNPSVACADV---------SR

Query:  NIPSTCADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKAR
        ++PS C   +      +   +  + P E++   L  PN F      P+W+QAM  EF AL     W LVPP  N NI+GC+WV++ KY PDG+V  YKAR
Subjt:  NIPSTCADASVSNSLHVASVEAIIPPNESV---LSAPNEF------PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKAR

Query:  LVAKGFN
        LVAKGF+
Subjt:  LVAKGFN

A0A4Y1RA80 Integrase catalytic domain-containing protein (Fragment)9.0e-2339.43Show/hide
Query:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAI
        S+ +KGY C +  T ++ ISRHV+  E  FPF   FA+      +I P+ P  + L     +     PS      S + P+T    S  N   + S    
Subjt:  SVDYKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAI

Query:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
          P+ S L   ++F  WR AM  EF+AL+  G W LVP   + N+VGCKWVFR K  PDG++  YKARLVAKGF+
Subjt:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN

A0A5N5G0M4 CCHC-type domain-containing protein2.0e-2233.33Show/hide
Query:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIP-
        YKGY C +  T ++ +SRHVI +E  +P+  + A+     S + P D AF+ ++    S      ++  +  + ++    A  S + +    + + ++P 
Subjt:  YKGYLCYNQSTKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIP-

Query:  --------------------PNESVLSA--PNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
                            P++++++A   +  P+WRQAM DEF+AL +Q  W+LVPP+   NIVGCKWVF+ K N DGS+  +KARLVAKGF+
Subjt:  --------------------PNESVLSA--PNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.9e-0837.84Show/hide
Query:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGF
        +P +   +   ++   W +A+  E +A K    WT+    +N NIV  +WVF  KYN  G+   YKARLVA+GF
Subjt:  IPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-0539.19Show/hide
Query:  ESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWK
        + VLS P E  +  +AM +E  +L++ G + LV        + CKWVF+ K + D  +  YKARLV KGF   K
Subjt:  ESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWK

P92520 Uncharacterized mitochondrial protein AtMg008209.6e-1457.38Show/hide
Query:  PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
        P W QAM +E  AL     W LVPP  N NI+GCKWVF+TK + DG++   KARLVAKGF+
Subjt:  PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-1243.01Show/hide
Query:  SVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLV-PPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWKARPTFLFAD
        S+ A   P  ++ +  +E  +WR AM  E +A      W LV PP  ++ IVGC+W+F  KYN DGS++ YKARLVAKG+N    RP   +A+
Subjt:  SVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLV-PPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWKARPTFLFAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-1240.62Show/hide
Query:  HVASVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLV-PPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWKARPTFLFAD
        +  S+ A   P  ++ +  ++  +WRQAM  E +A      W LV PP  ++ IVGC+W+F  K+N DGS++ YKARLVAKG+N    RP   +A+
Subjt:  HVASVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLV-PPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWKARPTFLFAD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.0e-1134.82Show/hide
Query:  ETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSV
        + Y+  SVA   +  +I    +   VS   H   V        S  +   EF  W  AM DE  A++    W +     N   +GCKWV++ KYN DG++
Subjt:  ETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIPPNESVLSAPNEFPKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSV

Query:  SYYKARLVAKGF
          YKARLVAKG+
Subjt:  SYYKARLVAKGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.8e-1557.38Show/hide
Query:  PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN
        P W QAM +E  AL     W LVPP  N NI+GCKWVF+TK + DG++   KARLVAKGF+
Subjt:  PKWRQAMIDEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATGCTTGCCCCCGCTGTTGGAGCTTGCTGTCGGTCGCCATCGTCGTCGCCGCTAGAGCTTGCTATCGGGTATGTGTTTTGTCTGGGCTTTCGCCGCCATAGGAG
CTTGCTGTCTGGTTTAAGTCGCGTCGTCGTCGGAGCTTTTTACTTTTCGTGGGTGTTGCAGAAGCAGAAGAAAAAGAAAAAAAATCTGCAGTTGACCCGACCCAACTCGA
ACCGATTCGGGTCAATTCTTAATCGGGTTGGGTTGGTTTGGGGCGGTTTGTCCATACAACCGACCTGGGCCCAGTCAGTTGATTACAAGGGATATCTTTGCTATAATCAG
TCTACTAAAAAGATGATTATCTCTCGACATGTTATCATCCATGAGTATGGGTTTCCATTTGCTGAAATGTTTGCTACTAATATTGGTTTTATGTCTAATATTACTCCAAT
GGATCCTGCTTTTCTTGCTTTGGTTGATAAATTTCTCTCTGAGACTTATTTCAATCCATCTGTTGCTTGTGCTGATGTTAGTCGCAATATCCCAAGTACTTGTGCTGATG
CTAGTGTTTCTAATTCCTTGCATGTTGCATCTGTTGAAGCTATTATTCCACCTAATGAGTCCGTCTTATCTGCTCCTAATGAGTTTCCTAAATGGAGACAAGCTATGATT
GATGAATTTTCGGCTCTCAAAGAACAAGGGGCTTGGACTCTAGTACCTCCCATTGATAATATGAATATAGTAGGATGTAAATGGGTGTTTCGCACTAAATATAACCCTGA
TGGTTCAGTTTCTTATTATAAGGCCCGACTAGTTGCTAAAGGGTTCAATAGTTGGAAGGCTAGGCCGACGTTTCTTTTCGCCGACGCAACACAATACGTCGGTTATTGGG
CCTTTCCTCCGACGAAATTGGGCCGCGTCGGAGCCGCCCCCTTTTCTTCTTCGTGTAGCAGCGTAGCCCCTCGATTTTTATTCCCATTCTCCGTTCGTGTAGCAGCACCT
TCCCCTTGGTCTTCTCTCCTTCCCATTGACGACGGCGTTGAAGCTCACGCCGGCACAGCGCCCTCCCCCCCTTGGAGCTTCGGCGGCCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCATGCTTGCCCCCGCTGTTGGAGCTTGCTGTCGGTCGCCATCGTCGTCGCCGCTAGAGCTTGCTATCGGGTATGTGTTTTGTCTGGGCTTTCGCCGCCATAGGAG
CTTGCTGTCTGGTTTAAGTCGCGTCGTCGTCGGAGCTTTTTACTTTTCGTGGGTGTTGCAGAAGCAGAAGAAAAAGAAAAAAAATCTGCAGTTGACCCGACCCAACTCGA
ACCGATTCGGGTCAATTCTTAATCGGGTTGGGTTGGTTTGGGGCGGTTTGTCCATACAACCGACCTGGGCCCAGTCAGTTGATTACAAGGGATATCTTTGCTATAATCAG
TCTACTAAAAAGATGATTATCTCTCGACATGTTATCATCCATGAGTATGGGTTTCCATTTGCTGAAATGTTTGCTACTAATATTGGTTTTATGTCTAATATTACTCCAAT
GGATCCTGCTTTTCTTGCTTTGGTTGATAAATTTCTCTCTGAGACTTATTTCAATCCATCTGTTGCTTGTGCTGATGTTAGTCGCAATATCCCAAGTACTTGTGCTGATG
CTAGTGTTTCTAATTCCTTGCATGTTGCATCTGTTGAAGCTATTATTCCACCTAATGAGTCCGTCTTATCTGCTCCTAATGAGTTTCCTAAATGGAGACAAGCTATGATT
GATGAATTTTCGGCTCTCAAAGAACAAGGGGCTTGGACTCTAGTACCTCCCATTGATAATATGAATATAGTAGGATGTAAATGGGTGTTTCGCACTAAATATAACCCTGA
TGGTTCAGTTTCTTATTATAAGGCCCGACTAGTTGCTAAAGGGTTCAATAGTTGGAAGGCTAGGCCGACGTTTCTTTTCGCCGACGCAACACAATACGTCGGTTATTGGG
CCTTTCCTCCGACGAAATTGGGCCGCGTCGGAGCCGCCCCCTTTTCTTCTTCGTGTAGCAGCGTAGCCCCTCGATTTTTATTCCCATTCTCCGTTCGTGTAGCAGCACCT
TCCCCTTGGTCTTCTCTCCTTCCCATTGACGACGGCGTTGAAGCTCACGCCGGCACAGCGCCCTCCCCCCCTTGGAGCTTCGGCGGCCATTAA
Protein sequenceShow/hide protein sequence
MLMLAPAVGACCRSPSSSPLELAIGYVFCLGFRRHRSLLSGLSRVVVGAFYFSWVLQKQKKKKKNLQLTRPNSNRFGSILNRVGLVWGGLSIQPTWAQSVDYKGYLCYNQ
STKKMIISRHVIIHEYGFPFAEMFATNIGFMSNITPMDPAFLALVDKFLSETYFNPSVACADVSRNIPSTCADASVSNSLHVASVEAIIPPNESVLSAPNEFPKWRQAMI
DEFSALKEQGAWTLVPPIDNMNIVGCKWVFRTKYNPDGSVSYYKARLVAKGFNSWKARPTFLFADATQYVGYWAFPPTKLGRVGAAPFSSSCSSVAPRFLFPFSVRVAAP
SPWSSLLPIDDGVEAHAGTAPSPPWSFGGH