; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028805 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028805
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:30916204..30917028
RNA-Seq ExpressionLag0028805
SyntenyLag0028805
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]2.9e-1730.88Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------------------------
        MT  VA  V+   T+  +WKALE ++GA +K++ N +R  +Q T+ GS  M EYL  MK  +++L +A   Y                            
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------------------------

Query:  --------------------------SGHKQFSP---------------NRGSNSQ--GNGPSNFNNNSGFRGGNNNNRGRGRGRNNQRGGGPKPTCQLC
                                   G+   SP               N+ SN Q    G +   N  GFRGG    RGRG GRNN      +PTCQ+C
Subjt:  --------------------------SGHKQFSP---------------NRGSNSQ--GNGPSNFNNNSGFRGGNNNNRGRGRGRNNQRGGGPKPTCQLC

Query:  GKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        GK+GH A  CY R+++ +       +    NS  +S + ++ +PE ++D  W AD+GATNHVT D GN+  K
Subjt:  GKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]4.0e-2236.61Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA--------------TGQYSGHKQ---------
        M+P +A DV++  TSR+VWKALE++Y   NKAR+N L+  LQ T+   +KM +YL+ MKQ ++ L LA              TG  + + Q         
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA--------------TGQYSGHKQ---------

Query:  ------------------FSPNRGSNSQGNGPS-NFNNNSGF------------RGGNNNNRGRGRGR--NNQRGGGPKPTCQLCGKYGHLAPYCYSRFE
                             N  S +  +GPS N+  N               RG   N+RGR RG     QR    +PTCQ+CGK GHLA  CY R  
Subjt:  ------------------FSPNRGSNSQGNGPS-NFNNNSGF------------RGGNNNNRGRGRGR--NNQRGGGPKPTCQLCGKYGHLAPYCYSRFE

Query:  EEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNI
           N  Y  N    GN A +   AYIT PE++ DP WL D+GATNH T D  N+
Subjt:  EEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNI

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]1.8e-3840.94Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY
        MTP++A DVV+FR+SREVWKALE++YGAT+KAR+N LR +LQNTK  S+KM EYL +MKQASE+L+LA                             G+ 
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY

Query:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT
        S                                              G++QF  ++    QG G  N N+        NN RGRGRGR +  RG   KP+
Subjt:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT

Query:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        CQLCGKYGH+A  CY RF+E FNN  S+NNN         ++AY+  PEI+ +P WLAD+GAT+HVT+DL N+  K
Subjt:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]1.8e-3840.94Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY
        MTP++A DVV+FR+SREVWKALE++YGAT+KAR+N LR +LQNTK  S+KM EYL +MKQASE+L+LA                             G+ 
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY

Query:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT
        S                                              G++QF  ++    QG G  N N+        NN RGRGRGR +  RG   KP+
Subjt:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT

Query:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        CQLCGKYGH+A  CY RF+E FNN  S+NNN         ++AY+  PEI+ +P WLAD+GAT+HVT+DL N+  K
Subjt:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

XP_030492910.1 uncharacterized protein LOC115709020 isoform X2 [Cannabis sativa]2.7e-1830.11Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------------------------
        MT A+AT+V+   T+  +WKALE +YGA +K++++  R  +Q T+ G+  M++YL   K  S+ L LA   Y                            
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------------------------

Query:  ---------------------------------------SGHKQFSPNRGSNS-QGNGPSNFNNNSGFRGGNNNNRGRGRGRNNQRGGGPKPTCQLCGKY
                                               + ++  S  RG+NS     PS  N +   RG  N +RGRGRGR+N      KPTCQ+CGK+
Subjt:  ---------------------------------------SGHKQFSPNRGSNS-QGNGPSNFNNNSGFRGGNNNNRGRGRGRNNQRGGGPKPTCQLCGKY

Query:  GHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        GH A  CY+R+ E F     ++ N   N  ++  +A+  SPE+++   W AD+GA++HVT+D  N++ K
Subjt:  GHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

TrEMBL top hitse value%identityAlignment
A0A6J1CLV9 uncharacterized protein LOC1110128091.9e-2236.61Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA--------------TGQYSGHKQ---------
        M+P +A DV++  TSR+VWKALE++Y   NKAR+N L+  LQ T+   +KM +YL+ MKQ ++ L LA              TG  + + Q         
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA--------------TGQYSGHKQ---------

Query:  ------------------FSPNRGSNSQGNGPS-NFNNNSGF------------RGGNNNNRGRGRGR--NNQRGGGPKPTCQLCGKYGHLAPYCYSRFE
                             N  S +  +GPS N+  N               RG   N+RGR RG     QR    +PTCQ+CGK GHLA  CY R  
Subjt:  ------------------FSPNRGSNSQGNGPS-NFNNNSGF------------RGGNNNNRGRGRGR--NNQRGGGPKPTCQLCGKYGHLAPYCYSRFE

Query:  EEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNI
           N  Y  N    GN A +   AYIT PE++ DP WL D+GATNH T D  N+
Subjt:  EEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNI

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X28.6e-3940.94Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY
        MTP++A DVV+FR+SREVWKALE++YGAT+KAR+N LR +LQNTK  S+KM EYL +MKQASE+L+LA                             G+ 
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY

Query:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT
        S                                              G++QF  ++    QG G  N N+        NN RGRGRGR +  RG   KP+
Subjt:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT

Query:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        CQLCGKYGH+A  CY RF+E FNN  S+NNN         ++AY+  PEI+ +P WLAD+GAT+HVT+DL N+  K
Subjt:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X18.6e-3940.94Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY
        MTP++A DVV+FR+SREVWKALE++YGAT+KAR+N LR +LQNTK  S+KM EYL +MKQASE+L+LA                             G+ 
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLA----------------------------TGQY

Query:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT
        S                                              G++QF  ++    QG G  N N+        NN RGRGRGR +  RG   KP+
Subjt:  S----------------------------------------------GHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGRGRGR-NNQRGGGPKPT

Query:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        CQLCGKYGH+A  CY RF+E FNN  S+NNN         ++AY+  PEI+ +P WLAD+GAT+HVT+DL N+  K
Subjt:  CQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

A0A803PHM7 Uncharacterized protein5.8e-2738.43Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQYSGHKQFSPNRGSNSQ-------------
        MT ++AT+V+   TS ++W +LE ++GA  K+R++  R  +Q  + GSM M  +L   KQ ++ L LA   Y      S    +  Q             
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQYSGHKQFSPNRGSNSQ-------------

Query:  GNGPSNFNNNSGFRGGNNNNRGRGR--GRNNQRGGGPKPTCQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADN
        GNG +    N+  RGG NNN  RGR  GR     GGPKPTCQ+CG+YGH A YCY+R+ E F       N    N    ++AA++ +PE+L D  W AD+
Subjt:  GNGPSNFNNNSGFRGGNNNNRGRGR--GRNNQRGGGPKPTCQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADN

Query:  GATNHVTADLGNIATK
        GA+NHVT++  N+  K
Subjt:  GATNHVTADLGNIATK

A0A803QD97 Uncharacterized protein4.1e-2533.85Show/hide
Query:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------SG----------------
        MT  +AT+++   +S E+W +LE ++GA +KA+++  R  +Q  + GSM M++YL   KQ S+ L LA   Y          SG                
Subjt:  MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQY----------SG----------------

Query:  --------------------HKQFSPNRGSNSQGNGPSNFNNNSG---FRGGNNNNRGRG------RGRNNQRG----GGPKPTCQLCGKYGHLAPYCYS
                                S N   ++  +  +N  N SG   +  GNNNN+GRG      RGR N RG    GGPKPTCQ+CG+YGH A YCY+
Subjt:  --------------------HKQFSPNRGSNSQGNGPSNFNNNSG---FRGGNNNNRGRG------RGRNNQRG----GGPKPTCQLCGKYGHLAPYCYS

Query:  RFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK
        RF+E F       N    N +  ++ A++ +PE+L D  W A++GA+NHVT++  N+  K
Subjt:  RFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCAGCCGTAGCTACCGATGTGGTTAACTTCAGAACCTCAAGAGAGGTGTGGAAGGCTTTAGAGGAAGTTTATGGAGCAACCAATAAGGCAAGAGTGAATCACCT
CCGTGGGATTCTTCAAAATACCAAAAATGGCTCAATGAAAATGATCGAGTACCTCGCGATAATGAAGCAGGCATCCGAAAATCTCCAGTTGGCAACAGGACAATACTCTG
GACATAAGCAATTTAGTCCGAACAGAGGCAGCAACAGCCAGGGTAATGGCCCATCTAACTTTAATAATAACTCTGGCTTTCGAGGTGGTAACAACAACAATCGGGGCCGT
GGTAGAGGAAGGAACAACCAGCGAGGAGGTGGACCCAAGCCAACATGCCAGTTATGTGGAAAATACGGCCACTTAGCACCATACTGCTATTCTCGCTTTGAGGAAGAATT
CAACAATCCTTACTCTGCAAACAACAACACCCAAGGTAACTCTGCCAGAAGCTCCTCAGCAGCATACATTACCTCTCCTGAAATCCTCAACGATCCGAAGTGGCTAGCAG
ACAATGGAGCCACAAACCATGTCACTGCTGATCTGGGGAATATTGCCACAAAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACACCAGCCGTAGCTACCGATGTGGTTAACTTCAGAACCTCAAGAGAGGTGTGGAAGGCTTTAGAGGAAGTTTATGGAGCAACCAATAAGGCAAGAGTGAATCACCT
CCGTGGGATTCTTCAAAATACCAAAAATGGCTCAATGAAAATGATCGAGTACCTCGCGATAATGAAGCAGGCATCCGAAAATCTCCAGTTGGCAACAGGACAATACTCTG
GACATAAGCAATTTAGTCCGAACAGAGGCAGCAACAGCCAGGGTAATGGCCCATCTAACTTTAATAATAACTCTGGCTTTCGAGGTGGTAACAACAACAATCGGGGCCGT
GGTAGAGGAAGGAACAACCAGCGAGGAGGTGGACCCAAGCCAACATGCCAGTTATGTGGAAAATACGGCCACTTAGCACCATACTGCTATTCTCGCTTTGAGGAAGAATT
CAACAATCCTTACTCTGCAAACAACAACACCCAAGGTAACTCTGCCAGAAGCTCCTCAGCAGCATACATTACCTCTCCTGAAATCCTCAACGATCCGAAGTGGCTAGCAG
ACAATGGAGCCACAAACCATGTCACTGCTGATCTGGGGAATATTGCCACAAAAAAATGA
Protein sequenceShow/hide protein sequence
MTPAVATDVVNFRTSREVWKALEEVYGATNKARVNHLRGILQNTKNGSMKMIEYLAIMKQASENLQLATGQYSGHKQFSPNRGSNSQGNGPSNFNNNSGFRGGNNNNRGR
GRGRNNQRGGGPKPTCQLCGKYGHLAPYCYSRFEEEFNNPYSANNNTQGNSARSSSAAYITSPEILNDPKWLADNGATNHVTADLGNIATKK