; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028471 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028471
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:22707851..22709325
RNA-Seq ExpressionLag0028471
SyntenyLag0028471
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]1.6e-2738.4Show/hide
Query:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE
        E E  + +++ LM  SD    +D  ++EV  +  + +EL++ FE++  DLEKL SKYV L KKY  LS ENKSL+++ AC K                  
Subjt:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE

Query:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL
                                         N N++IE  ++     I                  N+K AL+DK++F+E DS EK+NL+ VLKE EL
Subjt:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL

Query:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMP
           ++L  AKE+IKKLTIGAQ LDKII++GK   DKR LGY+DES T S SKT FVKAS  +P
Subjt:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.2e-2575.32Show/hide
Query:  SRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV
        +RIMGLKT+LQ ++KDG SVSQYL++IK+IADKF+ +GEP SYRDHL H+LDGLGSEYNAFVTSI NR+D+P+LEDV
Subjt:  SRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]3.3e-2535.51Show/hide
Query:  GVEKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDC
        G E E  + ++ C MAHSD    +D  ++EVN   L+YDEL++AFE+M  +LEKLGSKYV L  K    + ENKSL ++ ACLK                
Subjt:  GVEKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDC

Query:  DEKNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEK
                                            KNE                                                H+ +NL+ +LK+ 
Subjt:  DEKNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEK

Query:  ELLANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKS
        E  A  EL  AK+ IK+LTIGAQ LDKII+ GKP  DKRGLGY++E  TPS+SKTIFVKAS +MP  + P V LK+
Subjt:  ELLANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKS

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]9.4e-3341.64Show/hide
Query:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE
        E E  + +++ LMAHSD    DD  +++V  + L+ DEL++ FESM  DLEKL SKYV L KKY  L  ENKSL++  AC                    
Subjt:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE

Query:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL
                    KENE+ + +      E N   +K+  IE                              K AL+DK++F+E DS EK+NL+ VLKE EL
Subjt:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL

Query:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPN
           +EL  AKE+IKKLTIGAQ LDKII++GK   DKRGLGY+DES TPS+SKT FVKAS  +P   + N
Subjt:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPN

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]2.6e-2277.03Show/hide
Query:  MGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV
        M LK +LQ I+KD LS+SQYLSQIKD+ADKFSV+GE  SYRDHL HILDGLGSEYNAFVTSIQN  DN ++EDV
Subjt:  MGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV

TrEMBL top hitse value%identityAlignment
A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein7.6e-2838.4Show/hide
Query:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE
        E E  + +++ LM  SD    +D  ++EV  +  + +EL++ FE++  DLEKL SKYV L KKY  LS ENKSL+++ AC K                  
Subjt:  EKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDCDE

Query:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL
                                         N N++IE  ++     I                  N+K AL+DK++F+E DS EK+NL+ VLKE EL
Subjt:  KNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEKEL

Query:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMP
           ++L  AKE+IKKLTIGAQ LDKII++GK   DKR LGY+DES T S SKT FVKAS  +P
Subjt:  LANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMP

A0A5A7U923 Zf-CCHC domain-containing protein/UBN2 domain-containing protein1.3e-1943.64Show/hide
Query:  EKENEDLKSLSCE--LKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLE-----NDKLALIDKIKFIECDSHEKNNLLHVLKEKELLANK
        +KE++D KS   E  +K  ++D      ++E     +   + +  NK  + +  N S +     N+K AL+DK++F+E D  EK+NL+ VLKE EL   +
Subjt:  EKENEDLKSLSCE--LKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLE-----NDKLALIDKIKFIECDSHEKNNLLHVLKEKELLANK

Query:  ELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPN
        +L  AKE+IKKLTI AQ L +II++GK   DKRGLGY+DE  TPS+SKT FVKAS  +P+  +PN
Subjt:  ELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPN

A0A5C7IHH0 Uncharacterized protein1.3e-1638.41Show/hide
Query:  KELGIAKESIKKLTIGAQNLDKII-DMGKPL---NDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKSRIMGLKTQLQCIKKDGLSVSQYLSQ
        K + +A   + + T     + +++ D+ + L    D   +G+  ES     SK     +S ++ S+ LP        +  ++QL  +KK+G +++QYL Q
Subjt:  KELGIAKESIKKLTIGAQNLDKII-DMGKPL---NDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKSRIMGLKTQLQCIKKDGLSVSQYLSQ

Query:  IKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV
         K+I DKF+ IGEP SYRDHL ++L+GLG EY+AFVTSI+NR D P++EDV
Subjt:  IKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV

A0A6J1DQX7 uncharacterized protein LOC1110223152.1e-2575.32Show/hide
Query:  SRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV
        +RIMGLKT+LQ ++KDG SVSQYL++IK+IADKF+ +GEP SYRDHL H+LDGLGSEYNAFVTSI NR+D+P+LEDV
Subjt:  SRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV

A0A6J1DS74 uncharacterized protein LOC1110238061.6e-2535.51Show/hide
Query:  GVEKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDC
        G E E  + ++ C MAHSD    +D  ++EVN   L+YDEL++AFE+M  +LEKLGSKYV L  K    + ENKSL ++ ACLK                
Subjt:  GVEKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINENACLKNGTNNVLLHNEVDSDC

Query:  DEKNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEK
                                            KNE                                                H+ +NL+ +LK+ 
Subjt:  DEKNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEKNNLLHVLKEK

Query:  ELLANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKS
        E  A  EL  AK+ IK+LTIGAQ LDKII+ GKP  DKRGLGY++E  TPS+SKTIFVKAS +MP  + P V LK+
Subjt:  ELLANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-0425.97Show/hide
Query:  KSRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALED
        ++R +  + +L+    D LSV +Y  ++K ++D  + +  P S R  ++H+L+GL  +Y+  +  I+++S  P+  +
Subjt:  KSRIMGLKTQLQCIKKDGLSVSQYLSQIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAATGTTGAAAATTCTCTTTGTACTACTAATGACTTCTTTGATGTGAAGGAGAATTATAATTCTCAAAATCAAGAAAAGTTCTATGGAGTAGAAAAAGAAAAATG
TGATGGATCCTCCGTATGCTTGATGGCTCATTCGGATAATGATTCCAGTGATGATAGTGACGAAAATGAGGTAAATCACAAATCTCTTACTTATGATGAACTTTATGATG
CTTTTGAAAGTATGCATCAAGATTTGGAAAAACTTGGTTCAAAATATGTGAAATTATCAAAGAAATATAAGACATTGTCTATTGAGAATAAATCTTTGATAAATGAAAAT
GCATGTTTGAAAAATGGAACTAATAATGTTTTGTTGCATAATGAGGTTGATAGTGATTGTGATGAGAAAAATGCTAAGGTTAATAGAATTAAATTTCTTGAGAAAGAGAA
TGAGGATCTCAAATCTTTATCTTGTGAATTGAAACTTGAATTTAATGATTTGAGAAACAAGAATGAGAAAATTGAATCTTTTTCATTAGAATTGAAAGATGAAATTGCTA
GCTTGAAAAATAAAATTATTGATTTGGAAACTAGCAATACTTCTTTAGAGAATGATAAACTTGCCTTGATTGACAAGATTAAATTCATTGAATGTGATAGTCATGAGAAA
AATAATCTTTTGCATGTGCTTAAAGAAAAAGAATTGTTAGCTAATAAAGAGCTTGGAATTGCAAAAGAGTCCATTAAGAAATTGACTATAGGTGCACAAAATTTGGATAA
AATTATTGACATGGGTAAACCGTTAAATGACAAAAGAGGTCTTGGTTATGTTGATGAGAGTGTTACTCCATCAAATTCCAAAACTATTTTTGTTAAAGCATCTTCTAGTA
TGCCTAGTGACATTTTGCCGAATGTTTTGCTAAAATCTAGAATAATGGGTCTTAAAACTCAGCTCCAATGTATTAAGAAAGATGGTCTTTCGGTTAGTCAATATCTTTCT
CAGATTAAGGACATTGCTGATAAGTTTTCTGTAATAGGAGAGCCTTTCTCATATAGAGATCATTTGGTGCATATTTTAGATGGACTTGGTAGTGAATATAACGCTTTTGT
AACTTCAATCCAAAATCGGTCTGATAACCCTGCACTTGAGGATGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATAATGTTGAAAATTCTCTTTGTACTACTAATGACTTCTTTGATGTGAAGGAGAATTATAATTCTCAAAATCAAGAAAAGTTCTATGGAGTAGAAAAAGAAAAATG
TGATGGATCCTCCGTATGCTTGATGGCTCATTCGGATAATGATTCCAGTGATGATAGTGACGAAAATGAGGTAAATCACAAATCTCTTACTTATGATGAACTTTATGATG
CTTTTGAAAGTATGCATCAAGATTTGGAAAAACTTGGTTCAAAATATGTGAAATTATCAAAGAAATATAAGACATTGTCTATTGAGAATAAATCTTTGATAAATGAAAAT
GCATGTTTGAAAAATGGAACTAATAATGTTTTGTTGCATAATGAGGTTGATAGTGATTGTGATGAGAAAAATGCTAAGGTTAATAGAATTAAATTTCTTGAGAAAGAGAA
TGAGGATCTCAAATCTTTATCTTGTGAATTGAAACTTGAATTTAATGATTTGAGAAACAAGAATGAGAAAATTGAATCTTTTTCATTAGAATTGAAAGATGAAATTGCTA
GCTTGAAAAATAAAATTATTGATTTGGAAACTAGCAATACTTCTTTAGAGAATGATAAACTTGCCTTGATTGACAAGATTAAATTCATTGAATGTGATAGTCATGAGAAA
AATAATCTTTTGCATGTGCTTAAAGAAAAAGAATTGTTAGCTAATAAAGAGCTTGGAATTGCAAAAGAGTCCATTAAGAAATTGACTATAGGTGCACAAAATTTGGATAA
AATTATTGACATGGGTAAACCGTTAAATGACAAAAGAGGTCTTGGTTATGTTGATGAGAGTGTTACTCCATCAAATTCCAAAACTATTTTTGTTAAAGCATCTTCTAGTA
TGCCTAGTGACATTTTGCCGAATGTTTTGCTAAAATCTAGAATAATGGGTCTTAAAACTCAGCTCCAATGTATTAAGAAAGATGGTCTTTCGGTTAGTCAATATCTTTCT
CAGATTAAGGACATTGCTGATAAGTTTTCTGTAATAGGAGAGCCTTTCTCATATAGAGATCATTTGGTGCATATTTTAGATGGACTTGGTAGTGAATATAACGCTTTTGT
AACTTCAATCCAAAATCGGTCTGATAACCCTGCACTTGAGGATGTCTGA
Protein sequenceShow/hide protein sequence
MYNVENSLCTTNDFFDVKENYNSQNQEKFYGVEKEKCDGSSVCLMAHSDNDSSDDSDENEVNHKSLTYDELYDAFESMHQDLEKLGSKYVKLSKKYKTLSIENKSLINEN
ACLKNGTNNVLLHNEVDSDCDEKNAKVNRIKFLEKENEDLKSLSCELKLEFNDLRNKNEKIESFSLELKDEIASLKNKIIDLETSNTSLENDKLALIDKIKFIECDSHEK
NNLLHVLKEKELLANKELGIAKESIKKLTIGAQNLDKIIDMGKPLNDKRGLGYVDESVTPSNSKTIFVKASSSMPSDILPNVLLKSRIMGLKTQLQCIKKDGLSVSQYLS
QIKDIADKFSVIGEPFSYRDHLVHILDGLGSEYNAFVTSIQNRSDNPALEDV