; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033593 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033593
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:454594..459095
RNA-Seq ExpressionLag0033593
SyntenyLag0033593
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8483306.1 hypothetical protein CXB51_022295 [Gossypium anomalum]1.0e-2336.71Show/hide
Query:  PPAEIQSQMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVL
        P       + +++    WYLDSGAT+H+T  + + ++  P  G   + + NG S+ I++ GSS  T G+R   L+ +LHVP + KN +SV QF +DN V 
Subjt:  PPAEIQSQMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVL

Query:  FEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFVEEGEDVACKTE
        FEFH   C++KD Q+G+I+  G M  GLYR   + P  +  P  +F   G  + C  +
Subjt:  FEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFVEEGEDVACKTE

KAG8491907.1 hypothetical protein CXB51_015260 [Gossypium anomalum]7.2e-2244.72Show/hide
Query:  QMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALS
        Q  +++    WY DSGAT+H+T    N SS     G Q V + NG+S+ I + GSS +  G+R   L+ +LHVP++ KN +SV QF KDN V FEFH L 
Subjt:  QMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALS

Query:  CYIKDRQSGQIISVGEMFGGLYR
        C++KD Q+ + + VG M  GLY+
Subjt:  CYIKDRQSGQIISVGEMFGGLYR

KAG8501432.1 hypothetical protein CXB51_003747 [Gossypium anomalum]4.2e-2247.79Show/hide
Query:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ
        WY DSGAT+H+T  + N S+  P  G   V + NG+S+ I H GSS +  G+R   L+ +LHVP + KN +SV QF K N V FEFH   C++KD Q+G 
Subjt:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ

Query:  IISVGEMFGGLYR
        I+ VG +  GLYR
Subjt:  IISVGEMFGGLYR

KAG8503713.1 hypothetical protein CXB51_001715 [Gossypium anomalum]4.2e-2239.19Show/hide
Query:  LISPQIQIPPAEIQSQMQANLEPH--WWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISV
        L++  + +  A + S  Q +   H   WYLDSGAT+H+T  +   S+  P  G + + + NG+S+ I + GSS +  G++   L+ +LHV  + KN +SV
Subjt:  LISPQIQIPPAEIQSQMQANLEPH--WWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISV

Query:  SQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDL
        +QF KDN V FEFH L C++KD Q+G+ + VG M  GLY+     P L
Subjt:  SQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDL

OMO79651.1 hypothetical protein CCACVL1_13538 [Corchorus capsularis]9.5e-2248.25Show/hide
Query:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ
        WY +SGA++H+T  L N S      G   V++ NG+ L I+H GSS+L   NR   L  +LHVP +TKN ISVSQF  DN+V FEFH+  C++KD  SGQ
Subjt:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ

Query:  IISVGEMFGGLYRI
        ++  G +  GLYR+
Subjt:  IISVGEMFGGLYRI

TrEMBL top hitse value%identityAlignment
A0A1R3IAV5 Retrotran_gag_3 domain-containing protein4.6e-2248.25Show/hide
Query:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ
        WY +SGA++H+T  L N S      G   V++ NG+ L I+H GSS+L   NR   L  +LHVP +TKN ISVSQF  DN+V FEFH+  C++KD  SGQ
Subjt:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ

Query:  IISVGEMFGGLYRI
        ++  G +  GLYR+
Subjt:  IISVGEMFGGLYRI

A0A1U8IHB6 uncharacterized protein LOC1078945142.7e-2247.79Show/hide
Query:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ
        WY DSGAT+H+T  + N S+  P  G   V + NG+S+ I H G+S +  G+R   L+ +LHVP + KN +SV QF KDN V FEFH   C++KD Q+  
Subjt:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ

Query:  IISVGEMFGGLYR
        I+ VG M  GLYR
Subjt:  IISVGEMFGGLYR

A0A1U8P8H3 uncharacterized protein LOC1079563221.1e-2039.42Show/hide
Query:  ANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYI
        ++L  + WY DSGA++HVT  L N     P  G   + + NG  + + H GS R T       L+ +LHVP + KN +SV+QF KDN V FEFH + C++
Subjt:  ANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYI

Query:  KDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFV
        KD ++  I+ VG +  GLY+      DL   P V  V
Subjt:  KDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFV

A0A314KNU6 Uncharacterized protein5.1e-2137.11Show/hide
Query:  PPLISPQIQIPPAEIQSQMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISV
        PP   P  +  P +      + +    WY DSGAT HVT  + N S      G   + + NG  L I H GSS L+   R   L  +LHVP ++KN +SV
Subjt:  PPLISPQIQIPPAEIQSQMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISV

Query:  SQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFVEE
        S+ T+DN+V  +FH   CY+KD Q  +++  G +  GLYR+Q +SP     P   F+ E
Subjt:  SQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFVEE

A0A6P4N3Y9 uncharacterized protein LOC1084627861.3e-2141.46Show/hide
Query:  LEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKD
        + P  WYLDSG ++HVT  L N     P  G   + + NG+S+ + H GS  L   +R   L  +LHVP + KN +SV+QF KDN V FEFH + C++KD
Subjt:  LEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKD

Query:  RQSGQIISVGEMFGGLYRIQRNS
         ++G ++ VG +  GLYR   ++
Subjt:  RQSGQIISVGEMFGGLYRIQRNS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1336.81Show/hide
Query:  LISPQIQIPPAEIQS-QMQANL------EPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTK
        L S   Q PP+     Q +ANL        + W LDSGAT H+T    N S   P  G   V +A+GS++ I HTGS+ L+  +R   L  +L+VP++ K
Subjt:  LISPQIQIPPAEIQS-QMQANL------EPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTK

Query:  NFISVSQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLY
        N ISV +    N V  EF   S  +KD  +G  +  G+    LY
Subjt:  NFISVSQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1440.18Show/hide
Query:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ
        W LDSGAT H+T    N S   P  G   V +A+GS++ I HTGS+ L   +R   L K+L+VP++ KN ISV +    N V  EF   S  +KD  +G 
Subjt:  WYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANGSSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQ

Query:  IISVGEMFGGLY
         +  G+    LY
Subjt:  IISVGEMFGGLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTTCAAACCCTCGTGAGAGCAATGATCAAGATCAAGAAAACGAGTGTGCATGTGGTCCAAGAATTAACCCTGGCGATCAAATCGAGACGATTTCAGAAACAGA
AAGTCCAACATTTGGTGTGATGAACTCGAACAATCCTCCCTTAATTTCCCCCCAAATTCAAATCCCTCCAGCGGAGATTCAGTCTCAAATGCAGGCTAATCTGGAGCCTC
ATTGGTGGTATCTTGATTCTGGTGCGACCGATCATGTGACGAAATATCTTGTGAATTTTTCAAGCTTCTTCCCTTGTAACGGTCGTCAAGTGGTTCGACTTGCTAATGGA
TCGTCTTTGCAGATCGAGCATACTGGTTCTTCTCGTTTGACATTCGGTAATCGTTGTTGCATTCTGAGAAAATTGTTGCATGTTCCTGATATGACCAAGAATTTTATCAG
TGTCAGCCAATTTACCAAAGATAACGATGTCTTGTTTGAGTTTCACGCGTTGTCGTGTTATATAAAGGATCGCCAAAGCGGACAAATTATTAGCGTAGGAGAAATGTTTG
GTGGACTGTACAGAATTCAAAGGAACTCACCTGACCTTCACGTGCTACCAACGGTAAAATTTGTAGAAGAGGGGGAAGACGTGGCCTGCAAGACAGAAAACCTGCACACC
GGTGTGGTGTTCGCCACACTGACTCCGATGCTTAAGTCAGCAAGCAGAACGGTGGGGAGTGGAAAAAGTTCGGATCCCTTCTTCAGTGGTGAAGAGGGGTATAAATACCT
GCTCATCCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGGCAAATCATGCGAAACCGGGGCGACCAGAGACGGTAGGGACAGAACGGAGTCGGGAGAACTCGACCC
GCGCGAGCAGGACGAGGCTGACCATGGGCCTCGGCCTCGGCCTTAGGCCGAGGCCGAGCACGGGGTCGGGCCAAAAGCTCGACCCCTTTGGTCTTGGCTCGTCCTGCTTG
TCGGTCTCGCCTTTGGAGTCCACCCCTCAGTCCTATTTCTGTCCATTGTCCTCGTCAGCTCCTTGTACATCGGAGTGGTCCAAAATCACCTATAACACTCATGAATCACC
TTATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGTTCAAACCCTCGTGAGAGCAATGATCAAGATCAAGAAAACGAGTGTGCATGTGGTCCAAGAATTAACCCTGGCGATCAAATCGAGACGATTTCAGAAACAGA
AAGTCCAACATTTGGTGTGATGAACTCGAACAATCCTCCCTTAATTTCCCCCCAAATTCAAATCCCTCCAGCGGAGATTCAGTCTCAAATGCAGGCTAATCTGGAGCCTC
ATTGGTGGTATCTTGATTCTGGTGCGACCGATCATGTGACGAAATATCTTGTGAATTTTTCAAGCTTCTTCCCTTGTAACGGTCGTCAAGTGGTTCGACTTGCTAATGGA
TCGTCTTTGCAGATCGAGCATACTGGTTCTTCTCGTTTGACATTCGGTAATCGTTGTTGCATTCTGAGAAAATTGTTGCATGTTCCTGATATGACCAAGAATTTTATCAG
TGTCAGCCAATTTACCAAAGATAACGATGTCTTGTTTGAGTTTCACGCGTTGTCGTGTTATATAAAGGATCGCCAAAGCGGACAAATTATTAGCGTAGGAGAAATGTTTG
GTGGACTGTACAGAATTCAAAGGAACTCACCTGACCTTCACGTGCTACCAACGGTAAAATTTGTAGAAGAGGGGGAAGACGTGGCCTGCAAGACAGAAAACCTGCACACC
GGTGTGGTGTTCGCCACACTGACTCCGATGCTTAAGTCAGCAAGCAGAACGGTGGGGAGTGGAAAAAGTTCGGATCCCTTCTTCAGTGGTGAAGAGGGGTATAAATACCT
GCTCATCCTCCTAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGGCAAATCATGCGAAACCGGGGCGACCAGAGACGGTAGGGACAGAACGGAGTCGGGAGAACTCGACCC
GCGCGAGCAGGACGAGGCTGACCATGGGCCTCGGCCTCGGCCTTAGGCCGAGGCCGAGCACGGGGTCGGGCCAAAAGCTCGACCCCTTTGGTCTTGGCTCGTCCTGCTTG
TCGGTCTCGCCTTTGGAGTCCACCCCTCAGTCCTATTTCTGTCCATTGTCCTCGTCAGCTCCTTGTACATCGGAGTGGTCCAAAATCACCTATAACACTCATGAATCACC
TTATGCTTAA
Protein sequenceShow/hide protein sequence
MDGSNPRESNDQDQENECACGPRINPGDQIETISETESPTFGVMNSNNPPLISPQIQIPPAEIQSQMQANLEPHWWYLDSGATDHVTKYLVNFSSFFPCNGRQVVRLANG
SSLQIEHTGSSRLTFGNRCCILRKLLHVPDMTKNFISVSQFTKDNDVLFEFHALSCYIKDRQSGQIISVGEMFGGLYRIQRNSPDLHVLPTVKFVEEGEDVACKTENLHT
GVVFATLTPMLKSASRTVGSGKSSDPFFSGEEGYKYLLILLGFLGIRRRFGANHAKPGRPETVGTERSRENSTRASRTRLTMGLGLGLRPRPSTGSGQKLDPFGLGSSCL
SVSPLESTPQSYFCPLSSSAPCTSEWSKITYNTHESPYA