; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036668 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036668
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-1 polyprotein isoform X1
Genome locationchr2:291426..295952
RNA-Seq ExpressionLag0036668
SyntenyLag0036668
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEX17340.1 putative mitochondrial protein [Tanacetum cinerariifolium]9.0e-0730.19Show/hide
Query:  TNLKWTYDSASQTDGQTEVRRIEMETYLRCLQCIVLASG-------------------------NDEEVWMVLRTNLQTGLEA-----PIDRRSTSGGTL
        T LK++     QTDGQTEV    +ETYLR L+  +L S                          +   V+ V +  +  G+++     P           
Subjt:  TNLKWTYDSASQTDGQTEVRRIEMETYLRCLQCIVLASG-------------------------NDEEVWMVLRTNLQTGLEA-----PIDRRSTSGGTL

Query:  EDFRHSKSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG-VGK
        E     +  A  ++ E+LI W D+   ++TWE  V I  QFP+FHLEDK+  W+G +GK
Subjt:  EDFRHSKSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG-VGK

GEY30190.1 hypothetical protein [Tanacetum cinerariifolium]7.6e-0628.52Show/hide
Query:  RLLTKLMGYDFDIVYKPRVENKVVMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFH-SPVGGHGGC-------LY--------I
        R ++KL+GYDF+I Y P  EN V        +   L         G + V   ++VLP  SP I  L   FH S VGGH G        LY         
Subjt:  RLLTKLMGYDFDIVYKPRVENKVVMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFH-SPVGGHGGC-------LY--------I

Query:  LKTEIWCS---PVCFGKNYSNG----KGVVTNLKWTYDSASQTDGQTEVRRIEMETYLRCLQCIVLASGNDEEVWMVLRTNLQTGLEAPIDRRSTSGGTL
        L T+IW         G   S+G      VV  L        +    T +      T+   L   +      ++V   L+ +L    E  + R +      
Subjt:  LKTEIWCS---PVCFGKNYSNG----KGVVTNLKWTYDSASQTDGQTEVRRIEMETYLRCLQCIVLASGNDEEVWMVLRTNLQTGLEAPIDRRSTSGGTL

Query:  EDFRHSKSPADPSKL--EVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWD
         +F   +     S++  EVLI W D+   ++TWE  V I  QF +FHLEDK+  W+
Subjt:  EDFRHSKSPADPSKL--EVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWD

PNX82350.1 retrotransposon protein [Trifolium pratense]3.4e-0644.05Show/hide
Query:  LTKLMGYDFDIVYKPRVENKV---VMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHS-PVGGHGGCL
        L KL+GY F++ YKP +ENKV   +  C   + +    P  WL  EG  +++ G++V+  DSP+I LLL+ FHS P+GGH G L
Subjt:  LTKLMGYDFDIVYKPRVENKV---VMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHS-PVGGHGGCL

TXG60193.1 hypothetical protein EZV62_014766 [Acer yangbiense]5.8e-0657.14Show/hide
Query:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG
        KS A P  +EVLI W  +   +ATWED + IVNQFP+FHLEDK+  W G
Subjt:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG

TXG69438.1 hypothetical protein EZV62_004373 [Acer yangbiense]5.8e-0657.14Show/hide
Query:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG
        KS A P  +EVLI W  +   +ATWED + IVNQFP+FHLEDK+  W G
Subjt:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG

TrEMBL top hitse value%identityAlignment
A0A087GAS3 Uncharacterized protein2.4e-0528.65Show/hide
Query:  GSGAVLMQDGHPIAFFS-------------------------HRLLTKLMGYDFDIVYKPRVENKV--------------VMPCPEFLMLWSLQPS----
        G G VLMQ   PIA+FS                          R LTK++G+DF+I YKPR+ENK                +  P  + L  +  +    
Subjt:  GSGAVLMQDGHPIAFFS-------------------------HRLLTKLMGYDFDIVYKPRVENKV--------------VMPCPEFLMLWSLQPS----

Query:  ------------------GWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFH-SPVGGHGGCLYILKTEIWCSPVCFGK
                           + +V G  ++  G++VLP  SP + L+L+ FH   +GGHGG   +LKT+   S + F K
Subjt:  ------------------GWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFH-SPVGGHGGCLYILKTEIWCSPVCFGK

A0A2K3LUZ6 Retrotransposon protein1.7e-0644.05Show/hide
Query:  LTKLMGYDFDIVYKPRVENKV---VMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHS-PVGGHGGCL
        L KL+GY F++ YKP +ENKV   +  C   + +    P  WL  EG  +++ G++V+  DSP+I LLL+ FHS P+GGH G L
Subjt:  LTKLMGYDFDIVYKPRVENKV---VMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHS-PVGGHGGCL

A0A5C7HSW3 Chromo domain-containing protein2.8e-0657.14Show/hide
Query:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG
        KS A P  +EVLI W  +   +ATWED + IVNQFP+FHLEDK+  W G
Subjt:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG

A0A5C7IJS7 Uncharacterized protein2.8e-0657.14Show/hide
Query:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG
        KS A P  +EVLI W  +   +ATWED + IVNQFP+FHLEDK+  W G
Subjt:  KSPADPSKLEVLIPWTDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDG

A0A6D2HTW2 Uncharacterized protein1.4e-0531.19Show/hide
Query:  MHRDGSGAVLMQDGHPIAFFSHRLLTKLMGYDFDIVYKPRVENKVVMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHSPV-GG
        +H D      + +   ++    + L KL+G++FDI+YKP VENK        +     +   + +VEG  +  +G++V+P +S  I L+L+ FH+ V G 
Subjt:  MHRDGSGAVLMQDGHPIAFFSHRLLTKLMGYDFDIVYKPRVENKVVMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHSPV-GG

Query:  HGGCLYILK
        H G L  +K
Subjt:  HGGCLYILK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCGGGATGGGAGTGGGGCTGTCTTAATGCAAGATGGGCATCCTATTGCTTTCTTTAGCCATAGGCTTTTGACCAAGCTGATGGGGTATGACTTTGACATTGTATA
TAAGCCTCGTGTGGAGAACAAGGTTGTGATGCCTTGTCCAGAATTCCTAATGTTGTGGAGTTTGCAACCTTCCGGTTGGTTACTCGTTGAAGGGCTCGTTGTTGTATTAT
CGGGGAAAATTGTTCTCCCTGAGGACTCTCCTACTATTTCTCTTTTGTTAGAAGCTTTTCACTCTCCTGTTGGGGGCCATGGTGGGTGCTTGTACATATTAAAGACCGAG
ATCTGGTGTTCACCAGTTTGTTTTGGGAAGAATTATTCAAATGGCAAAGGCGTGGTTACCAACCTAAAATGGACGTACGACTCTGCTTCTCAAACCGATGGACAAACGGA
AGTGCGTCGAATAGAAATGGAAACCTATTTGAGATGTTTGCAATGCATTGTCCTAGCGAGTGGTAATGATGAGGAAGTTTGGATGGTCTTAAGAACAAACTTGCAAACTG
GGCTGGAAGCACCTATTGATAGAAGATCTACTTCTGGTGGAACCCTCGAGGATTTTAGGCATTCGAAGTCACCTGCTGATCCTTCAAAATTAGAGGTGTTGATCCCGTGG
ACTGATATGGAGATTTCTAAAGCTACATGGGAAGATGCAGTGTGGATTGTAAATCAGTTTCCTAAATTTCATCTTGAGGACAAGATGGTTTTTTGGGACGGTGTTGGGAA
GCAAGCTGAAGTTCCATCTAAGAAACTGACTAATTTAGAGCTTCAAATTAGATGTGAAAAGGGACTATTTGGGTTCGATTCCTTTCTTTCTACTCAGACCAACATTGGAT
CAATTCCTTCGATGCTTGCTCAGACCGAATCGACCCCATATCGAGAGACTAAACATCGAGGACTTTATTGCACCTTTATGTCTCCATCTCTTGAGTCGAGCTCAATCTTT
GATAGGTGGGCCAAGCCCTTACCTTCTGTTCTCCCTGGGTCGAGTCAGAGGCGAAGACTGGCAAAGTTCATCAGTGGTGTGGTGTTCATGGAAAAGAAGGCATCGCCAAT
CGAGTGGTGGATTTGGATGGAGTCTCGCTTTGATGCTGGGGAGATCCATAAATTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATCGGGATGGGAGTGGGGCTGTCTTAATGCAAGATGGGCATCCTATTGCTTTCTTTAGCCATAGGCTTTTGACCAAGCTGATGGGGTATGACTTTGACATTGTATA
TAAGCCTCGTGTGGAGAACAAGGTTGTGATGCCTTGTCCAGAATTCCTAATGTTGTGGAGTTTGCAACCTTCCGGTTGGTTACTCGTTGAAGGGCTCGTTGTTGTATTAT
CGGGGAAAATTGTTCTCCCTGAGGACTCTCCTACTATTTCTCTTTTGTTAGAAGCTTTTCACTCTCCTGTTGGGGGCCATGGTGGGTGCTTGTACATATTAAAGACCGAG
ATCTGGTGTTCACCAGTTTGTTTTGGGAAGAATTATTCAAATGGCAAAGGCGTGGTTACCAACCTAAAATGGACGTACGACTCTGCTTCTCAAACCGATGGACAAACGGA
AGTGCGTCGAATAGAAATGGAAACCTATTTGAGATGTTTGCAATGCATTGTCCTAGCGAGTGGTAATGATGAGGAAGTTTGGATGGTCTTAAGAACAAACTTGCAAACTG
GGCTGGAAGCACCTATTGATAGAAGATCTACTTCTGGTGGAACCCTCGAGGATTTTAGGCATTCGAAGTCACCTGCTGATCCTTCAAAATTAGAGGTGTTGATCCCGTGG
ACTGATATGGAGATTTCTAAAGCTACATGGGAAGATGCAGTGTGGATTGTAAATCAGTTTCCTAAATTTCATCTTGAGGACAAGATGGTTTTTTGGGACGGTGTTGGGAA
GCAAGCTGAAGTTCCATCTAAGAAACTGACTAATTTAGAGCTTCAAATTAGATGTGAAAAGGGACTATTTGGGTTCGATTCCTTTCTTTCTACTCAGACCAACATTGGAT
CAATTCCTTCGATGCTTGCTCAGACCGAATCGACCCCATATCGAGAGACTAAACATCGAGGACTTTATTGCACCTTTATGTCTCCATCTCTTGAGTCGAGCTCAATCTTT
GATAGGTGGGCCAAGCCCTTACCTTCTGTTCTCCCTGGGTCGAGTCAGAGGCGAAGACTGGCAAAGTTCATCAGTGGTGTGGTGTTCATGGAAAAGAAGGCATCGCCAAT
CGAGTGGTGGATTTGGATGGAGTCTCGCTTTGATGCTGGGGAGATCCATAAATTTGTATAG
Protein sequenceShow/hide protein sequence
MHRDGSGAVLMQDGHPIAFFSHRLLTKLMGYDFDIVYKPRVENKVVMPCPEFLMLWSLQPSGWLLVEGLVVVLSGKIVLPEDSPTISLLLEAFHSPVGGHGGCLYILKTE
IWCSPVCFGKNYSNGKGVVTNLKWTYDSASQTDGQTEVRRIEMETYLRCLQCIVLASGNDEEVWMVLRTNLQTGLEAPIDRRSTSGGTLEDFRHSKSPADPSKLEVLIPW
TDMEISKATWEDAVWIVNQFPKFHLEDKMVFWDGVGKQAEVPSKKLTNLELQIRCEKGLFGFDSFLSTQTNIGSIPSMLAQTESTPYRETKHRGLYCTFMSPSLESSSIF
DRWAKPLPSVLPGSSQRRRLAKFISGVVFMEKKASPIEWWIWMESRFDAGEIHKFV