; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040984 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040984
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:10561832..10567147
RNA-Seq ExpressionLag0040984
SyntenyLag0040984
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5516879.1 hypothetical protein DKX38_027527 [Salix brachista]8.7e-6363.08Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+LR E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

KAB5519281.1 hypothetical protein DKX38_023600 [Salix brachista]8.7e-6363.08Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+LR E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

KAF9681460.1 hypothetical protein SADUNF_Sadunf05G0003800 [Salix dunnii]8.7e-6364.4Show/hide
Query:  VPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQVNRM
        VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  ++RM
Subjt:  VPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQVNRM

Query:  YDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        YDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E   AKSQILS  EI +L EVFSR+LR E
Subjt:  YDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]5.1e-7972.55Show/hide
Query:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE
        MA++K LV+SN +PL+SK TEHKLNGSNYY+W RTI FYLRST+MDDH+ EDPP+D  +KK WLRDDARLYL IKNSIE+EI+GL +HC+SVKELL F +
Subjt:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE

Query:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL
        FLYSGKEQV+RM++V M FF A QK +SVT+YFM+ KK  AELG L PF+ D+KVQQ QREKMAVMIFLNGL PEFGMAK+QILS S+IP+LD+ F+RVL
Subjt:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL

Query:  RIES
        RIES
Subjt:  RIES

XP_038898187.1 uncharacterized protein LOC120085933 [Benincasa hispida]7.9e-7268.63Show/hide
Query:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE
        M E+K +V+SN +PL+SK T+HKLNGSNYY+W RTI+FYL ST MDDH++E+P +D+ KKK WL DDARLYL IKNSIE+E++GL +HCD VKELL F E
Subjt:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE

Query:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL
        FLYSGKE V RM+DV M FF   QK + VT+YFM+ KKT AEL  L P++ D+KVQQAQREKM VMIFLNGL  EFGMAK+QILS SEIP+L+E FSRVL
Subjt:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL

Query:  RIES
         IES
Subjt:  RIES

TrEMBL top hitse value%identityAlignment
A0A5N5JC74 Uncharacterized protein4.2e-6363.08Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+LR E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

A0A5N5JJ99 Uncharacterized protein4.2e-6363.08Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+LR E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

A0A5N5KU30 Uncharacterized protein1.2e-6262.56Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS + DDH+ E+PP+D+TKKK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+LR E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

A0A5N5M9B2 Uncharacterized protein1.4e-6161.54Show/hide
Query:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ
        I+  VP  SK T+HKLNG+NY EWS+TIR YLRS E DDH+ ++PP+D+T+KK W+RDDARL+L I+NSI++EI+GL NHC+ VKEL+ + EFLYSGK  
Subjt:  ISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQ

Query:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE
        ++RMYDV   F+ A ++ +S+T YFM FKKT  EL  L PF+TDIKVQQ QREKMAVM FL GL  E    KSQILS  EI +L EVFSR+L  E
Subjt:  VNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIE

Q6L3Q0 Polyprotein, putative1.2e-6061.27Show/hide
Query:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE
        MAE K   ++  VP  SK T+ KLNGSNY +WSR IR YLRS E DDH+I+DPP DD  KK WLRDDARL L I NSI+NE++GL NHC+ VKEL+ + E
Subjt:  MAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTEMDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTE

Query:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL
        +LYSGK  ++R+Y+VS  F+ + ++ +S+T YFM+FKKT  EL  L PF+TDIKVQQAQRE+MA+M FL GL  EF  AKSQILS SEI +L +VFS+VL
Subjt:  FLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIKVQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVL

Query:  RIES
        R ES
Subjt:  RIES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40316.1 FUNCTIONS IN: molecular_function unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Autophagy-related protein 27 (InterPro:IPR018939); Has 138 Blast hits to 138 proteins in 57 species: Archae - 0; Bacteria - 0; Metazoa - 32; Fungi - 62; Plants - 33; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink).4.2e-3154.9Show/hide
Query:  FMILHQILPFNSVFSVCEFSYANNGKMYNFNLALPSSKFPHGVLSEDGFYRVAVNDTIVWFQLCDGMIFNHDPPMCVDCLDCGGPSHCGMGCSALVANKI
        F+++  +   + V S CEFS+    K+++FNLA     +PHGVLSEDGFY+V  N +++WFQLCD +IFNHDPP CV C DCGGPSHCG  CSALV+  +
Subjt:  FMILHQILPFNSVFSVCEFSYANNGKMYNFNLALPSSKFPHGVLSEDGFYRVAVNDTIVWFQLCDGMIFNHDPPMCVDCLDCGGPSHCGMGCSALVANKI

Query:  EG
         G
Subjt:  EG

AT2G40316.2 FUNCTIONS IN: molecular_function unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Autophagy-related protein 27 (InterPro:IPR018939); Has 135 Blast hits to 135 proteins in 58 species: Archae - 0; Bacteria - 0; Metazoa - 32; Fungi - 63; Plants - 31; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink).2.0e-1261.7Show/hide
Query:  QLCDGMIFNHDPPMCVDCLDCGGPSHCGMGCSALVANKIEGAPPSGE
        QLCD +IFNHDPP CV C DCGGPSHCG  CSALV+  +      G+
Subjt:  QLCDGMIFNHDPPMCVDCLDCGGPSHCGMGCSALVANKIEGAPPSGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCGTAGAATCAATTCTGTGAAATGGGCTACCATTTTCTTCATGATTCTACACCAAATTCTGCCCTTCAATTCGGTCTTTTCCGTTTGCGAGTTCAGTTACGCTAA
CAATGGCAAGATGTACAACTTCAATCTGGCGTTGCCTAGCTCCAAATTCCCTCATGGAGTCCTCAGCGAAGACGGGTTTTACAGAGTAGCAGTAAATGATACCATTGTCT
GGTTTCAGCTATGTGATGGGATGATCTTCAATCACGATCCACCTATGTGTGTTGATTGCTTGGATTGTGGGGGACCTTCGCATTGTGGGATGGGTTGTAGTGCACTTGTA
GCGAACAAGATAGAAGGAGCGCCGCCGTCAGGAGAGTGGAAGCACCGTCGCCGATCGTCGGAAAACGCCGCGCGTGAGTCACACGCGCCGCCGAAGGAGTCTGCGTCCGC
CGCCACGCGCCGGCGCGTGAGGGCGCGTGAGCCGTCCTTTTCTGTTTCCGCCGCCTGGGTTCGTCTCGCCTCCGGTTTTAAGCTACCAATGGCTGAGCTGAAACCTTTGG
TCATATCGAATGGAGTTCCTCTTTCTTCAAAGTTCACAGAACATAAGCTAAATGGCTCCAATTACTATGAATGGAGTCGTACAATTCGGTTTTATCTACGAAGCACTGAG
ATGGATGACCATATCATTGAGGATCCGCCAGAGGATGACACAAAGAAGAAGATTTGGCTTAGAGATGATGCTCGCTTGTACTTGGGAATAAAGAATTCTATAGAAAACGA
GATACTTGGTTTGTTTAATCATTGTGATTCGGTGAAAGAACTACTAGCATTCACAGAGTTTTTATACTCAGGTAAAGAGCAAGTTAACAGAATGTATGATGTTTCCATGA
CCTTCTTTCATGCAGCTCAGAAAGAACAATCTGTGACGAATTACTTCATGCAGTTTAAGAAGACATGTGCAGAGCTTGGTACGTTATTCCCATTTACCACTGATATAAAG
GTTCAACAAGCTCAGCGAGAAAAGATGGCCGTCATGATATTTTTGAATGGACTTCAACCTGAATTTGGGATGGCGAAATCACAAATTCTCTCTGGTTCTGAAATTCCTAC
TTTGGATGAAGTCTTCAGTAGAGTTCTCCGTATTGAAAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCGTAGAATCAATTCTGTGAAATGGGCTACCATTTTCTTCATGATTCTACACCAAATTCTGCCCTTCAATTCGGTCTTTTCCGTTTGCGAGTTCAGTTACGCTAA
CAATGGCAAGATGTACAACTTCAATCTGGCGTTGCCTAGCTCCAAATTCCCTCATGGAGTCCTCAGCGAAGACGGGTTTTACAGAGTAGCAGTAAATGATACCATTGTCT
GGTTTCAGCTATGTGATGGGATGATCTTCAATCACGATCCACCTATGTGTGTTGATTGCTTGGATTGTGGGGGACCTTCGCATTGTGGGATGGGTTGTAGTGCACTTGTA
GCGAACAAGATAGAAGGAGCGCCGCCGTCAGGAGAGTGGAAGCACCGTCGCCGATCGTCGGAAAACGCCGCGCGTGAGTCACACGCGCCGCCGAAGGAGTCTGCGTCCGC
CGCCACGCGCCGGCGCGTGAGGGCGCGTGAGCCGTCCTTTTCTGTTTCCGCCGCCTGGGTTCGTCTCGCCTCCGGTTTTAAGCTACCAATGGCTGAGCTGAAACCTTTGG
TCATATCGAATGGAGTTCCTCTTTCTTCAAAGTTCACAGAACATAAGCTAAATGGCTCCAATTACTATGAATGGAGTCGTACAATTCGGTTTTATCTACGAAGCACTGAG
ATGGATGACCATATCATTGAGGATCCGCCAGAGGATGACACAAAGAAGAAGATTTGGCTTAGAGATGATGCTCGCTTGTACTTGGGAATAAAGAATTCTATAGAAAACGA
GATACTTGGTTTGTTTAATCATTGTGATTCGGTGAAAGAACTACTAGCATTCACAGAGTTTTTATACTCAGGTAAAGAGCAAGTTAACAGAATGTATGATGTTTCCATGA
CCTTCTTTCATGCAGCTCAGAAAGAACAATCTGTGACGAATTACTTCATGCAGTTTAAGAAGACATGTGCAGAGCTTGGTACGTTATTCCCATTTACCACTGATATAAAG
GTTCAACAAGCTCAGCGAGAAAAGATGGCCGTCATGATATTTTTGAATGGACTTCAACCTGAATTTGGGATGGCGAAATCACAAATTCTCTCTGGTTCTGAAATTCCTAC
TTTGGATGAAGTCTTCAGTAGAGTTCTCCGTATTGAAAGCCCTTAA
Protein sequenceShow/hide protein sequence
MMRRINSVKWATIFFMILHQILPFNSVFSVCEFSYANNGKMYNFNLALPSSKFPHGVLSEDGFYRVAVNDTIVWFQLCDGMIFNHDPPMCVDCLDCGGPSHCGMGCSALV
ANKIEGAPPSGEWKHRRRSSENAARESHAPPKESASAATRRRVRAREPSFSVSAAWVRLASGFKLPMAELKPLVISNGVPLSSKFTEHKLNGSNYYEWSRTIRFYLRSTE
MDDHIIEDPPEDDTKKKIWLRDDARLYLGIKNSIENEILGLFNHCDSVKELLAFTEFLYSGKEQVNRMYDVSMTFFHAAQKEQSVTNYFMQFKKTCAELGTLFPFTTDIK
VQQAQREKMAVMIFLNGLQPEFGMAKSQILSGSEIPTLDEVFSRVLRIESP