; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032540 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032540
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:34303229..34303887
RNA-Seq ExpressionLag0032540
SyntenyLag0032540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051442.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.1e-2433.82Show/hide
Query:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKK---------
        M+++EK F+F+ + NK+L ENLDEFK++T+     GEK+G ENEA +L+NS+ + YKEVKT LKYGRE+IT + + + +++KELEL ++ K         
Subjt:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKK---------

Query:  ----------------------------------------------EITEDALATTDQCNNQQSSSECHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMY
                                                      E TE   AT  +    ++  E  D ++D GC++HMT +K WF  Y+  +G+ +Y
Subjt:  ----------------------------------------------EITEDALATTDQCNNQQSSSECHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMY

Query:  MRNN
        M NN
Subjt:  MRNN

KAA0065687.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-2668.13Show/hide
Query:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE
        Y+RE+FFTF+ D NKSL ENL EFK+++S+FK LG+KIGDENE+F+LLNSL EAYKEVK AL+YGR+ ITT G+ SAIRT+EL+L SQ+++
Subjt:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE

XP_038880322.1 uncharacterized protein LOC120071961 [Benincasa hispida]3.1e-2454.01Show/hide
Query:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALATTDQCNNQQSSSE
        D  K L  NLDEFKR+  EFK+L EKIGDENEAFVLLNSL E YKEVK ALKYGRES+T D I SA+RT+ELEL S KKE                    
Subjt:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALATTDQCNNQQSSSE

Query:  CHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMYMRNN
               PG        K WFSTY+  DGE +YM NN
Subjt:  CHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMYMRNN

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]5.0e-2260.44Show/hide
Query:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE
        ++RE+FFT++ DP KSL +NL+EFK ++S+F+++G+ IG+ENEAF+LLNSL E +K+VKTALKYGRE ITT  I SA+  KELEL   KK+
Subjt:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]2.9e-2258.7Show/hide
Query:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEI
        ++RE+FFT++ DP KSL +NL+EFKR++SEF+++G+ IG+ENEAF+L NSL E +K+VKTALKY R+ IT D I SA+R KELEL    +E+
Subjt:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEI

TrEMBL top hitse value%identityAlignment
A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-2433.82Show/hide
Query:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKK---------
        M+++EK F+F+ + NK+L ENLDEFK++T+     GEK+G ENEA +L+NS+ + YKEVKT LKYGRE+IT + + + +++KELEL ++ K         
Subjt:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKK---------

Query:  ----------------------------------------------EITEDALATTDQCNNQQSSSECHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMY
                                                      E TE   AT  +    ++  E  D ++D GC++HMT +K WF  Y+  +G+ +Y
Subjt:  ----------------------------------------------EITEDALATTDQCNNQQSSSECHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMY

Query:  MRNN
        M NN
Subjt:  MRNN

A0A5D3BP49 Uncharacterized protein1.5e-1938.73Show/hide
Query:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALATTDQCNNQQSSSE
        D +KSL ENL++F+++  +  N+ EKI DEN+A +LLNSL E Y+EVK A+KYGR+S+T   +  A++T+ LE+  ++K+  E  +A   + ++ ++   
Subjt:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALATTDQCNNQQSSSE

Query:  CH-----DWIIDPGCSFHMTPEKGWFSTYRKWDGEIMYMRNN
         H      WI D GC++HMTP + +F  ++K DG  + + +N
Subjt:  CH-----DWIIDPGCSFHMTPEKGWFSTYRKWDGEIMYMRNN

A0A5D3CAI4 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-2768.13Show/hide
Query:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE
        Y+RE+FFTF+ D NKSL ENL EFK+++S+FK LG+KIGDENE+F+LLNSL EAYKEVK AL+YGR+ ITT G+ SAIRT+EL+L SQ+++
Subjt:  YMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE

A0A5D3DNU1 Putative gag-pol polyprotein1.9e-1929.15Show/hide
Query:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE--------
        +Y++EKFF ++ D +KSL ENLDEF+++  +  N+GEK+ DEN+A +LLNSL E Y+EVK A+KYGR+S+T   +  A++T+ LE+  ++K+        
Subjt:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKE--------

Query:  ---------------------------------------ITEDALATTDQCN--NQQSSSECHD-------------------------WIIDPGCSFHM
                                               + +   A+T + N  +  +S+E  D                         WI+D GC+FHM
Subjt:  ---------------------------------------ITEDALATTDQCN--NQQSSSECHD-------------------------WIIDPGCSFHM

Query:  TPEKGWFSTYRKWDGEIMYMRNN
        TP + + + ++K DG  + + +N
Subjt:  TPEKGWFSTYRKWDGEIMYMRNN

A0A6J1DGM8 uncharacterized protein LOC1110199001.3e-2071.62Show/hide
Query:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELEL
        D +K+L +NLD+FK+++SEF +LGEKIG ENEAF+LLNSL E+Y+EVK ALKYGRESITTD I SA++TKELEL
Subjt:  DPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELEL

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-0627.93Show/hide
Query:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALAT
        +Y++++ +        +   +L+ F  + ++  NLG KI +E++A +LLNSL  +Y  + T + +G+ +I    +TSA+     E + +K E    AL T
Subjt:  MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALAT

Query:  TDQCNNQQSSS
          +  + Q SS
Subjt:  TDQCNNQQSSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATGAGGGAGAAGTTCTTTACCTTCAGGACGGATCCAAACAAATCACTGTTTGAGAATCTTGATGAATTCAAAAGAATGACCTCAGAATTCAAGAATTTAGGTGA
GAAAATAGGAGATGAGAATGAAGCATTTGTTCTCTTAAACTCACTAGATGAAGCATACAAAGAGGTAAAGACGGCGTTGAAATATGGAAGAGAATCCATCACAACCGATG
GAATTACCTCAGCAATTAGAACTAAAGAACTTGAACTGTTGTCTCAAAAAAAGGAGATAACTGAAGATGCATTAGCCACAACTGATCAGTGTAACAACCAACAAAGTTCT
TCAGAATGTCATGACTGGATAATAGACCCAGGTTGTTCCTTTCACATGACACCAGAGAAGGGCTGGTTTAGCACCTACAGAAAATGGGATGGTGAAATTATGTACATGAG
AAATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATATGAGGGAGAAGTTCTTTACCTTCAGGACGGATCCAAACAAATCACTGTTTGAGAATCTTGATGAATTCAAAAGAATGACCTCAGAATTCAAGAATTTAGGTGA
GAAAATAGGAGATGAGAATGAAGCATTTGTTCTCTTAAACTCACTAGATGAAGCATACAAAGAGGTAAAGACGGCGTTGAAATATGGAAGAGAATCCATCACAACCGATG
GAATTACCTCAGCAATTAGAACTAAAGAACTTGAACTGTTGTCTCAAAAAAAGGAGATAACTGAAGATGCATTAGCCACAACTGATCAGTGTAACAACCAACAAAGTTCT
TCAGAATGTCATGACTGGATAATAGACCCAGGTTGTTCCTTTCACATGACACCAGAGAAGGGCTGGTTTAGCACCTACAGAAAATGGGATGGTGAAATTATGTACATGAG
AAATAATTAG
Protein sequenceShow/hide protein sequence
MYMREKFFTFRTDPNKSLFENLDEFKRMTSEFKNLGEKIGDENEAFVLLNSLDEAYKEVKTALKYGRESITTDGITSAIRTKELELLSQKKEITEDALATTDQCNNQQSS
SECHDWIIDPGCSFHMTPEKGWFSTYRKWDGEIMYMRNN