; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028473 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028473
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr8:22784011..22792651
RNA-Seq ExpressionLag0028473
SyntenyLag0028473
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030477929.1 uncharacterized protein LOC115694963 [Cannabis sativa]1.1e-0633.58Show/hide
Query:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIG
        D   ++   ++   P +A  R+LIDVQ+ ELT++V+DQKV F++F+AM++P++ + CS I  +D  V + F KE    E     F E+    + ED ++ 
Subjt:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIG

Query:  GLEYEHEDAGEISSCKRTFESFEPIDKKSKPIEP
         +    E    I + K+ FES E  +   KP +P
Subjt:  GLEYEHEDAGEISSCKRTFESFEPIDKKSKPIEP

XP_030478180.1 uncharacterized protein LOC115695237 [Cannabis sativa]4.3e-0649.09Show/hide
Query:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKEL
        R+LIDV++ ELT++V+DQKV F++F+AM++PN+ + CSC+  +D  V + F KE+
Subjt:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKEL

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]1.5e-0636.84Show/hide
Query:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIGGLEYEHEDAGEISSCKRTFE
        R+LIDVQ+EELT++V+DQKV F++F+AM++P++ + CS I  +D  V + F KE    E     F E+    + ED ++  +    E    +   K+ FE
Subjt:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIGGLEYEHEDAGEISSCKRTFE

Query:  SFEPIDKKSKPIEP
        S E  +   KP +P
Subjt:  SFEPIDKKSKPIEP

XP_030504461.1 uncharacterized protein LOC115719521 [Cannabis sativa]3.3e-0632.33Show/hide
Query:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEH
        D   ++   ++   P +A  R+LIDVQ+ ELT++V+DQKV F++F+AM++P++ + CS +  +D  V + F K++ +    +     +EDLE    + E 
Subjt:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEH

Query:  -----EDAGEISSCKRTFESFEPIDKKSKPIEP
             E        KR FES E  +   KP++P
Subjt:  -----EDAGEISSCKRTFESFEPIDKKSKPIEP

XP_030504924.1 uncharacterized protein LOC115719886 [Cannabis sativa]1.1e-0633.58Show/hide
Query:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIG
        D   ++   ++   P +A  R+LIDVQ+ ELT++V+DQKV F++F+AM++P++ + CS I  +D  V + F KE    E     F E+    + ED ++ 
Subjt:  DICCEKSATVVHKVPPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELM--EYHTQKFGEI----QIEDLEIG

Query:  GLEYEHEDAGEISSCKRTFESFEPIDKKSKPIEP
         +    E    I + K+ FES E  +   KP +P
Subjt:  GLEYEHEDAGEISSCKRTFESFEPIDKKSKPIEP

TrEMBL top hitse value%identityAlignment
A0A151QZB8 Transposon Ty3-G Gag-Pol polyprotein (Fragment)3.9e-0531.03Show/hide
Query:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAG-EISSCKRTF
        P +   R LIDV   +L ++V+D++V FD+FDAM YPND   C  +  +D  VED   ++ + + T       +E + I  +EY HE+   EI +C ++ 
Subjt:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAG-EISSCKRTF

Query:  ESFEPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAV
        +S++ +      IE      L ++ +++K  L+ ++F  +H+K V
Subjt:  ESFEPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAV

A0A5B6VBU6 Reverse transcriptase-like protein4.7e-0644.44Show/hide
Query:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLE
        R++IDVQ  ELTI+V+DQ++ F++F A+KY +D   C  +  LD  VE+ FEK   EYH +K  E    D++
Subjt:  RSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLE

A0A5E4GQP1 PREDICTED: LOW QUALITY PROTEIN (Fragment)5.2e-0533.94Show/hide
Query:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTFE
        P + +  + IDV    LT+++  + VKF +FDAM+YP+DF  C  I   D FV+D F K + + + +K   + +  +  G L Y      E+     T E
Subjt:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTFE

Query:  SFEPIDKKS
        S  PI  KS
Subjt:  SFEPIDKKS

A0A6J1DTH7 uncharacterized protein LOC1110229715.2e-0525.99Show/hide
Query:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTFE
        P +A   ++ +V+  E+T+KV++++VKF++ DAMK P D +     +W    V  + +   ++Y    F +   +   I  +    E   E+  CK T  
Subjt:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTFE

Query:  SF-EPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAVKRPWYDDFSNYLDFGNLPPGLSKREMKDFFHE
           + + +   P++         Q++IR+ F+DEQL  + ++      WY D +NYL    +P   S++++K F H+
Subjt:  SF-EPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAVKRPWYDDFSNYLDFGNLPPGLSKREMKDFFHE

A0A6P4D2E7 uncharacterized protein LOC1074849292.3e-0526.55Show/hide
Query:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPND-FDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTF
        P +A I  +IDVQ  +L +++H++K+ F++F AM YP D    C  +  ++  V++ FE+E  E+   K G              E++ A  +S      
Subjt:  PPVAQIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPND-FDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTF

Query:  ESFEPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAVKRPWYDDFSNYLDFGNLPPGLSKREMKDFFHE
            P +K +           +  + + + F DEQL  V      K PW+ D +N+   G+LPPG++K + +   ++
Subjt:  ESFEPIDKKSKPIEPYYSLTLLQQSKIRKSFLDEQLFTVAHIKAVKRPWYDDFSNYLDFGNLPPGLSKREMKDFFHE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGACTCCAAACCCAAAGGGATGATAGGGAAGTAGGATCCACTTCCATAGAGTGTCTAGTGAGTCATGTGGCCACTGATTCATTAGCGCCAATTCAGGAGCCTAA
TCCTTTCAATTTTGCTGAGTTTGAGCATATAGGGGTGAGCTGTATAGAAAAAGAGGAAGAATTGGATAGTTCTAGCACCCACCAAGAACAAGTTTGTGAGGAAGAAAAAG
AAAATGAGCATGTAGTGTTTGAGGAGTTTCTAGAGATAGAGTTTAAAGTTGAAAGGCCTTTGTTTGTTCTATCCTTCACATCCTCTCTTGATAATGAGTGTTCTTTTTCT
TGTTCTCACGAGTCAGAGTCAGACTGCGTGGAAAATTCTGGAATCGCGATAGCGTCTCAACGCTGTCTATGTAGCGTCTCGACGCTGCGACGTCTTGGAGCCCAAGAAAA
GATGACGTCAGAGCGTCTCGACGCTGTCCTTGTAGCGTCTCGACGCTGCGCCTGCTGGGAGCCTATAAAAGGAGCCTTTTGCATCCATATTTGTCCATCATTGGGGTTAC
AAACGCACGTGAAGGATTCACTTGCTGTTGATGGTCGATATCCGTGGACACAGAAAGATATCTGCTGTGAGAAGAGTGCAACTGTGGTCCATAAGGTCCCACCAGTAGCT
CAGATAAGATCTTTGATAGATGTCCAACATGAGGAGCTTACAATAAAGGTGCATGACCAAAAGGTGAAGTTTGATATGTTTGATGCAATGAAATATCCAAATGATTTTGA
TTTTTGCTCGTGCATTCAGTGGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGG
AAATAGGTGGATTGGAGTATGAGCATGAAGATGCAGGTGAGATTTCTAGTTGTAAGAGGACTTTTGAATCCTTTGAGCCAATAGATAAGAAATCCAAGCCTATTGAACCT
TATTATTCATTGACATTGCTCCAGCAATCTAAGATTAGGAAATCCTTCCTTGATGAGCAGTTATTTACTGTAGCTCATATTAAGGCAGTGAAAAGACCTTGGTATGATGA
CTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAAGAGAGATGAAAGATTTTTTTCATGAGGATTTCAGTACGGGAGGTTCATATCAAGAAGTCC
TGAAGCGAAATGTTCTAATGGAGCGCGGCTTCGATGATGGCAATGAGTTCCCACATTTTCCTTCAGTGGACATGACTAATCACAGTTGGGAACGATTGTGCACAAAGTCG
GAGCCCACTGTTGATCAACTAGAGTGGGAGTTCTACGCCAACATCGATGAAAATGAAGGATTCTTGATTATTCTTCGTAGAGTTGTTGTCTACTGGAGCCATGTAGTGAT
TAATTCTCTGTTTAATTTGAAAGACTTCCCCCACGCTGTTTTCAATGAAATGTTAGTTGCTTCCTCAAATGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGGATGAGGG
CAAGAACATTCCAGTCAACATACCTGAAGTGTTCAGAGTCCCAAAAGAGTTCCAATGATATAAACTTGTTTGATAAGGGGATTATTGACACTGCCAATCTGGCTAGGCTT
CAAAAAGCAGTGCTTGGTTTTGCAGGATGCTCAGAAAATACTATTGAGCGACGTGAGGGAGCAAAATCTGTGCTGCAGCAAACCTGGGAGCAAAACTGCCACGTCACAGC
TCGATTTAGAGACCGAATCAATCGGTCAAAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGTGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTT
GGTGGTTTGGACCAGCCGTTTCATCCCCATTTGCTGAGAAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGACTCCAAACCCAAAGGGATGATAGGGAAGTAGGATCCACTTCCATAGAGTGTCTAGTGAGTCATGTGGCCACTGATTCATTAGCGCCAATTCAGGAGCCTAA
TCCTTTCAATTTTGCTGAGTTTGAGCATATAGGGGTGAGCTGTATAGAAAAAGAGGAAGAATTGGATAGTTCTAGCACCCACCAAGAACAAGTTTGTGAGGAAGAAAAAG
AAAATGAGCATGTAGTGTTTGAGGAGTTTCTAGAGATAGAGTTTAAAGTTGAAAGGCCTTTGTTTGTTCTATCCTTCACATCCTCTCTTGATAATGAGTGTTCTTTTTCT
TGTTCTCACGAGTCAGAGTCAGACTGCGTGGAAAATTCTGGAATCGCGATAGCGTCTCAACGCTGTCTATGTAGCGTCTCGACGCTGCGACGTCTTGGAGCCCAAGAAAA
GATGACGTCAGAGCGTCTCGACGCTGTCCTTGTAGCGTCTCGACGCTGCGCCTGCTGGGAGCCTATAAAAGGAGCCTTTTGCATCCATATTTGTCCATCATTGGGGTTAC
AAACGCACGTGAAGGATTCACTTGCTGTTGATGGTCGATATCCGTGGACACAGAAAGATATCTGCTGTGAGAAGAGTGCAACTGTGGTCCATAAGGTCCCACCAGTAGCT
CAGATAAGATCTTTGATAGATGTCCAACATGAGGAGCTTACAATAAAGGTGCATGACCAAAAGGTGAAGTTTGATATGTTTGATGCAATGAAATATCCAAATGATTTTGA
TTTTTGCTCGTGCATTCAGTGGTTGGATGAGTTTGTTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGG
AAATAGGTGGATTGGAGTATGAGCATGAAGATGCAGGTGAGATTTCTAGTTGTAAGAGGACTTTTGAATCCTTTGAGCCAATAGATAAGAAATCCAAGCCTATTGAACCT
TATTATTCATTGACATTGCTCCAGCAATCTAAGATTAGGAAATCCTTCCTTGATGAGCAGTTATTTACTGTAGCTCATATTAAGGCAGTGAAAAGACCTTGGTATGATGA
CTTTTCCAATTACCTTGATTTTGGAAATTTGCCTCCTGGTTTATCAAAAAGAGAGATGAAAGATTTTTTTCATGAGGATTTCAGTACGGGAGGTTCATATCAAGAAGTCC
TGAAGCGAAATGTTCTAATGGAGCGCGGCTTCGATGATGGCAATGAGTTCCCACATTTTCCTTCAGTGGACATGACTAATCACAGTTGGGAACGATTGTGCACAAAGTCG
GAGCCCACTGTTGATCAACTAGAGTGGGAGTTCTACGCCAACATCGATGAAAATGAAGGATTCTTGATTATTCTTCGTAGAGTTGTTGTCTACTGGAGCCATGTAGTGAT
TAATTCTCTGTTTAATTTGAAAGACTTCCCCCACGCTGTTTTCAATGAAATGTTAGTTGCTTCCTCAAATGAGCAACTAAATGCCCAGTGGAGGTTGTCTAGGATGAGGG
CAAGAACATTCCAGTCAACATACCTGAAGTGTTCAGAGTCCCAAAAGAGTTCCAATGATATAAACTTGTTTGATAAGGGGATTATTGACACTGCCAATCTGGCTAGGCTT
CAAAAAGCAGTGCTTGGTTTTGCAGGATGCTCAGAAAATACTATTGAGCGACGTGAGGGAGCAAAATCTGTGCTGCAGCAAACCTGGGAGCAAAACTGCCACGTCACAGC
TCGATTTAGAGACCGAATCAATCGGTCAAAGGCCGAATTGACGGCGGAATGGAGATCTAAGAAGACCATTTGTGATTCCACTCAAGGAGACGAAGTTCTTCCATGTTCTT
GGTGGTTTGGACCAGCCGTTTCATCCCCATTTGCTGAGAAGGATTAG
Protein sequenceShow/hide protein sequence
MERLQTQRDDREVGSTSIECLVSHVATDSLAPIQEPNPFNFAEFEHIGVSCIEKEEELDSSSTHQEQVCEEEKENEHVVFEEFLEIEFKVERPLFVLSFTSSLDNECSFS
CSHESESDCVENSGIAIASQRCLCSVSTLRRLGAQEKMTSERLDAVLVASRRCACWEPIKGAFCIHICPSLGLQTHVKDSLAVDGRYPWTQKDICCEKSATVVHKVPPVA
QIRSLIDVQHEELTIKVHDQKVKFDMFDAMKYPNDFDFCSCIQWLDEFVEDHFEKELMEYHTQKFGEIQIEDLEIGGLEYEHEDAGEISSCKRTFESFEPIDKKSKPIEP
YYSLTLLQQSKIRKSFLDEQLFTVAHIKAVKRPWYDDFSNYLDFGNLPPGLSKREMKDFFHEDFSTGGSYQEVLKRNVLMERGFDDGNEFPHFPSVDMTNHSWERLCTKS
EPTVDQLEWEFYANIDENEGFLIILRRVVVYWSHVVINSLFNLKDFPHAVFNEMLVASSNEQLNAQWRLSRMRARTFQSTYLKCSESQKSSNDINLFDKGIIDTANLARL
QKAVLGFAGCSENTIERREGAKSVLQQTWEQNCHVTARFRDRINRSKAELTAEWRSKKTICDSTQGDEVLPCSWWFGPAVSSPFAEKD