; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035324 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035324
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:18903181..18905685
RNA-Seq ExpressionLag0035324
SyntenyLag0035324
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7553295.1 hypothetical protein ISN45_Aa06g038290 [Arabidopsis thaliana x Arabidopsis arenosa]8.3e-3867.77Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +NGIFI+QEGYAKE+LK+          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGIIGH
        TTTH+K  KRIL YIKG I +
Subjt:  TTTHWKMEKRILCYIKGIIGH

XP_020262368.1 uncharacterized protein LOC109838328 [Asparagus officinalis]6.4e-3869.75Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEV+Q +NGIFI+QEGYAKE+LKK          T MECG K++KQ+ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG I
Subjt:  TTTHWKMEKRILCYIKGII

XP_038889191.1 uncharacterized mitochondrial protein AtMg00810-like [Benincasa hispida]7.3e-4277.31Show/hide
Query:  DIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTT
        DIGLMSY+LGIEVKQGE+GIFISQEGY K++LKK          T MECGVKITKQDGGEKV+STYFKSLV SLRYLTCTRPDI YSVGIISR+MEEPTT
Subjt:  DIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTT

Query:  THWKMEKRILCYIKGIIGH
        TH KM K I  YIKG IG+
Subjt:  THWKMEKRILCYIKGIIGH

XP_038895793.1 secreted RxLR effector protein 161-like [Benincasa hispida]5.6e-4273.23Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSYFLGI+VKQGE+GIFISQEGYAKE+LKK          T MECGVKITKQDGGEKV+STYFKSLVGSLRYLTCTR DI+YSVGIIS+++EEP
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGIIGHLHTHLS
        T TH KM KRIL Y+K  IG+  +++S
Subjt:  TTTHWKMEKRILCYIKGIIGHLHTHLS

XP_038896505.1 secreted RxLR effector protein 161-like [Benincasa hispida]1.4e-4070.08Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M +IGLMSY+LGIEVKQGE+ IFISQEGYAK++LKK          T MECGVKI KQDGGEK++STYFKSLV SLRYLTCTRPDI+YSVG ++R+MEEP
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGIIGHLHTHLS
        T TH KM KRIL YIKG+IG+  +++S
Subjt:  TTTHWKMEKRILCYIKGIIGHLHTHLS

TrEMBL top hitse value%identityAlignment
Q9C536 Copia-type polyprotein, putative5.3e-3868.91Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +NGIFI+QEGYAKE+LKK          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG +
Subjt:  TTTHWKMEKRILCYIKGII

Q9C739 Copia-type polyprotein, putative5.3e-3868.91Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +NGIFI+QEGYAKE+LKK          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG +
Subjt:  TTTHWKMEKRILCYIKGII

Q9M2D1 Copia-type polyprotein5.3e-3868.91Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +NGIFI+QEGYAKE+LKK          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG +
Subjt:  TTTHWKMEKRILCYIKGII

Q9SFE1 T26F17.174.5e-3768.07Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +N IFI+QEGYAKE+LKK          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG +
Subjt:  TTTHWKMEKRILCYIKGII

Q9SXB2 T28P6.8 protein5.3e-3868.91Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        M DIGLMSY+LGIEVKQ +NGIFI+QEGYAKE+LKK          T MECG+K++K++ GE VD T FKSLVGSLRYLTCTRPDI+Y+VG++SRYME P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKL---------TWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
        TTTH+K  KRIL YIKG +
Subjt:  TTTHWKMEKRILCYIKGII

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-1232.77Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMECGVKITKQD--------GGEKVDSTYFKSLVGSLRY-LTCTRPDIVYSVGIISRYMEEP
        M D+  + +F+GI ++  E+ I++SQ  Y K++L K     C    T             ++  +T  +SL+G L Y + CTRPD+  +V I+SRY  + 
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMECGVKITKQD--------GGEKVDSTYFKSLVGSLRY-LTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGII
         +  W+  KR+L Y+KG I
Subjt:  TTTHWKMEKRILCYIKGII

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-1131.06Show/hide
Query:  MMDIGLMSYFLGIEV--KQGENGIFISQEGYAKEMLK------------------KLTWMECGVKITKQDGGEKVDSTYFKSLVGSLRY-LTCTRPDIVY
        M D+G     LG+++  ++    +++SQE Y + +L+                  KL+   C   + ++    KV    + S VGSL Y + CTRPDI +
Subjt:  MMDIGLMSYFLGIEV--KQGENGIFISQEGYAKEMLK------------------KLTWMECGVKITKQDGGEKVDSTYFKSLVGSLRY-LTCTRPDIVY

Query:  SVGIISRYMEEPTTTHWKMEKRILCYIKGIIG
        +VG++SR++E P   HW+  K IL Y++G  G
Subjt:  SVGIISRYMEEPTTTHWKMEKRILCYIKGIIG

P92519 Uncharacterized mitochondrial protein AtMg008101.5e-1840Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC-------GVKITKQDGGEKV-DSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPT
        M D+G + YFLGI++K   +G+F+SQ  YA+++L     ++C        +K+       K  D + F+S+VG+L+YLT TRPDI Y+V I+ + M EPT
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC-------GVKITKQDGGEKV-DSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPT

Query:  TTHWKMEKRILCYIKGIIGH-LHTH
           + + KR+L Y+KG I H L+ H
Subjt:  TTHWKMEKRILCYIKGIIGH-LHTH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-1437.39Show/hide
Query:  MSYFLGIEVKQGENGIFISQEGYAKEMLKK---------LTWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTTTHWK
        + YFLGIE K+   G+ +SQ  Y  ++L +          T M    K++   G +  D T ++ +VGSL+YL  TRPDI Y+V  +S++M  PT  H +
Subjt:  MSYFLGIEVKQGENGIFISQEGYAKEMLKK---------LTWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTTTHWK

Query:  MEKRILCYIKGIIGH
          KRIL Y+ G   H
Subjt:  MEKRILCYIKGIIGH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-1638.26Show/hide
Query:  MSYFLGIEVKQGENGIFISQEGYAKEMLKK---------LTWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTTTHWK
        + YFLGIE K+   G+ +SQ  Y  ++L +          T M    K+T   G +  D T ++ +VGSL+YL  TRPD+ Y+V  +S+YM  PT  HW 
Subjt:  MSYFLGIEVKQGENGIFISQEGYAKEMLKK---------LTWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTTTHWK

Query:  MEKRILCYIKGIIGH
          KR+L Y+ G   H
Subjt:  MEKRILCYIKGIIGH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.7e-1534.17Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC---------GVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP
        + D+G + YFLG+E+ +   GI I Q  YA ++L +   + C          V  +   GG+ VD+  ++ L+G L YL  TR DI ++V  +S++ E P
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC---------GVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEP

Query:  TTTHWKMEKRILCYIKGIIG
           H +   +IL YIKG +G
Subjt:  TTTHWKMEKRILCYIKGIIG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-1940Show/hide
Query:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC-------GVKITKQDGGEKV-DSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPT
        M D+G + YFLGI++K   +G+F+SQ  YA+++L     ++C        +K+       K  D + F+S+VG+L+YLT TRPDI Y+V I+ + M EPT
Subjt:  MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMEC-------GVKITKQDGGEKV-DSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPT

Query:  TTHWKMEKRILCYIKGIIGH-LHTH
           + + KR+L Y+KG I H L+ H
Subjt:  TTHWKMEKRILCYIKGIIGH-LHTH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACATTGGTCTCATGAGTTACTTTCTTGGGATTGAGGTCAAACAAGGTGAAAATGGGATTTTTATATCCCAAGAGGGCTATGCTAAAGAAATGCTGAAAAAGTT
AACGTGGATGGAATGTGGAGTCAAAATCACCAAGCAAGATGGAGGAGAAAAGGTAGATTCTACATATTTCAAAAGCTTGGTTGGGAGTTTGAGATACTTGACATGCACAA
GACCGGATATTGTTTATTCAGTTGGAATTATTAGTCGATATATGGAGGAGCCGACAACAACACATTGGAAAATGGAAAAAAGGATACTTTGCTACATCAAAGGGATAATT
GGCCATCTCCACACACATCTTTCTATTTTTCCCCCTCCAACCCCGACCCCCATCACCTCCGCTTGCTCTGACCCCAACATCTCGTTGTCGTCGTGGTCGTCGTCGTCTTC
CGGTGAGGAGAAGCAGCTGTATTTTACAGATGTTACTCCAAGAGGTCGAGTTGATGGAGGAGAGACTATTCTTCCATTGTTGCACCATTCCAATCTCTTTTCTCTTTCAC
CTCCTCGGTGTTTAAGCAATGGCGGTCATTGTAAGGACAATGTTGTTGCAACTTGTTGGACCTTCCAGGCAATGATTGAATTGAATGTTCTACCCTATATACTGATTGCA
TCATCAGAATCTCAATACATGATGGAAGTGTGCTTGGATGGGTTCTCCTCTCAACTTTCAGATGAGAATTGTTGCAAGAGGGGAAACAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGACATTGGTCTCATGAGTTACTTTCTTGGGATTGAGGTCAAACAAGGTGAAAATGGGATTTTTATATCCCAAGAGGGCTATGCTAAAGAAATGCTGAAAAAGTT
AACGTGGATGGAATGTGGAGTCAAAATCACCAAGCAAGATGGAGGAGAAAAGGTAGATTCTACATATTTCAAAAGCTTGGTTGGGAGTTTGAGATACTTGACATGCACAA
GACCGGATATTGTTTATTCAGTTGGAATTATTAGTCGATATATGGAGGAGCCGACAACAACACATTGGAAAATGGAAAAAAGGATACTTTGCTACATCAAAGGGATAATT
GGCCATCTCCACACACATCTTTCTATTTTTCCCCCTCCAACCCCGACCCCCATCACCTCCGCTTGCTCTGACCCCAACATCTCGTTGTCGTCGTGGTCGTCGTCGTCTTC
CGGTGAGGAGAAGCAGCTGTATTTTACAGATGTTACTCCAAGAGGTCGAGTTGATGGAGGAGAGACTATTCTTCCATTGTTGCACCATTCCAATCTCTTTTCTCTTTCAC
CTCCTCGGTGTTTAAGCAATGGCGGTCATTGTAAGGACAATGTTGTTGCAACTTGTTGGACCTTCCAGGCAATGATTGAATTGAATGTTCTACCCTATATACTGATTGCA
TCATCAGAATCTCAATACATGATGGAAGTGTGCTTGGATGGGTTCTCCTCTCAACTTTCAGATGAGAATTGTTGCAAGAGGGGAAACAAGTAA
Protein sequenceShow/hide protein sequence
MMDIGLMSYFLGIEVKQGENGIFISQEGYAKEMLKKLTWMECGVKITKQDGGEKVDSTYFKSLVGSLRYLTCTRPDIVYSVGIISRYMEEPTTTHWKMEKRILCYIKGII
GHLHTHLSIFPPPTPTPITSACSDPNISLSSWSSSSSGEEKQLYFTDVTPRGRVDGGETILPLLHHSNLFSLSPPRCLSNGGHCKDNVVATCWTFQAMIELNVLPYILIA
SSESQYMMEVCLDGFSSQLSDENCCKRGNK