; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015759 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015759
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein FAR1-RELATED SEQUENCE 3-like
Genome locationscaffold10:20271409..20275147
RNA-Seq ExpressionSpg015759
SyntenySpg015759
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
InterPro domainsIPR001207 - Transposase, mutator type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051992.1 protein FAR1-RELATED SEQUENCE 3-like [Cucumis melo var. makuwa]7.8e-1734.07Show/hide
Query:  IFALGCAIVDSENNLSWEWFFVQLKATIGVR--EDLVFVSDRHKSILKIIPKVFPTAFHG--------------VRIVHLLRNL--RWFFERKNDVDYQA
        IF L   +VDS+ + SW WF  QLK  IG R  E  V V +     L  I  V  T  +                 ++ +LR +  RWFFER+ND DYQ 
Subjt:  IFALGCAIVDSENNLSWEWFFVQLKATIGVR--EDLVFVSDRHKSILKIIPKVFPTAFHG--------------VRIVHLLRNL--RWFFERKNDVDYQA

Query:  TYLTKSAELELREMINNGHSMQ-------------------------------------------------------------NIKVMPPNVKRPVGRPK
        T  TK+    LRE I    SM+                                                             +I ++PPNVKR VGRPK
Subjt:  TYLTKSAELELREMINNGHSMQ-------------------------------------------------------------NIKVMPPNVKRPVGRPK

Query:  KVWIPSRMEFKRRVKCGCCGRAGHNR
        K  I SR+EFKRRVKCG CGR GHNR
Subjt:  KVWIPSRMEFKRRVKCGCCGRAGHNR

XP_008463110.1 PREDICTED: uncharacterized protein LOC103501336 [Cucumis melo]1.0e-1633.63Show/hide
Query:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHK--SILKIIPKVFPTAF-------------------------------HGVRIVHLLRN
        IF L   +VDS+N+ SW WF  Q+K  IG R ++V VS+RHK  S   I  ++ P  F                                 + I  +L  
Subjt:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHK--SILKIIPKVFPTAF-------------------------------HGVRIVHLLRN

Query:  LR-----WFFERKNDVDYQATYLTKSAELELREMINNGHSMQ----------------------------------NIKVMPPNVKRP------VGRPKK
        LR     WFFER+N+VDYQ T  TK+ E  LR+ I    SM+                                  +IK++   +         VGRPKK
Subjt:  LR-----WFFERKNDVDYQATYLTKSAELELREMINNGHSMQ----------------------------------NIKVMPPNVKRP------VGRPKK

Query:  VWIPSRMEFKRRVKCGCCGRAGHNRK
        V IPSRM+FKRR+KCG  GR GHN K
Subjt:  VWIPSRMEFKRRVKCGCCGRAGHNRK

XP_022155207.1 uncharacterized protein LOC111022347 [Momordica charantia]7.3e-2330.6Show/hide
Query:  GIHISYQKAWRAREAALNEIRG-----LLIFALGCAIVDSENNLS-----------WEWFFV-------------QLKATIGVREDLVFVSDRHKSILKI
        GI I+YQKAWR R++A+ EI+G       +    C ++  +N  S           + + F+              LK  IG R+DLV V DRHKSI+K 
Subjt:  GIHISYQKAWRAREAALNEIRG-----LLIFALGCAIVDSENNLS-----------WEWFFV-------------QLKATIGVREDLVFVSDRHKSILKI

Query:  IPKVFPTAFHGVRIVHLLRN--------------------------------------------------------LRWFFERKNDVDYQATYLTKSAEL
          KVF TA H +   +  R+                                                         RWF++R+NDVD+Q T  TKSAE 
Subjt:  IPKVFPTAFHGVRIVHLLRN--------------------------------------------------------LRWFFERKNDVDYQATYLTKSAEL

Query:  ELREMI-------------------------------NN-------------GH--------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGC
        +L E I                               NN             GH         ++ IK++ PNVKRP GRPKK+ IPS +EFK+RVKC  
Subjt:  ELREMI-------------------------------NN-------------GH--------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGC

Query:  CGRAGHNRKTCMLPLSQ
        CGR GHNRK+C   L+Q
Subjt:  CGRAGHNRKTCMLPLSQ

XP_022159005.1 uncharacterized protein LOC111025451 [Momordica charantia]1.3e-1638.46Show/hide
Query:  ISEFGIHISYQKAWRAREAALNEIRGL----------------------------------------LIFALGCAIVDSENNLSWEWFFVQLKATIGVRE
        + E G  I+Y K WRA+E A+ EIRG                                          IF L   +VDSEN+ SW WFF  LK  IGVR+
Subjt:  ISEFGIHISYQKAWRAREAALNEIRGL----------------------------------------LIFALGCAIVDSENNLSWEWFFVQLKATIGVRE

Query:  DLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLRWFFERK
        +LV VS+RHKSI+K + KVF TAFH +   HL +NL+  ++ K
Subjt:  DLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLRWFFERK

XP_024029591.1 uncharacterized protein LOC112094003 [Morus notabilis]1.9e-2337.78Show/hide
Query:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNL--------------------------------
        IF LG AIVDSE + SWEWFF +LK  IG RE+LV V DR  SILK + KVF  A HG  + HLLRNL                                
Subjt:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNL--------------------------------

Query:  ----------------------------------RWFFERKNDVDYQATYLTKSAELELREMINNGHSMQNIK---VMPPNVKRPVGRPKKVWIPSRME-
                                          RWF+ER+N VD   TYLTK A   LR+           K   V+P   KR  GRP+K  I S  E 
Subjt:  ----------------------------------RWFFERKNDVDYQATYLTKSAELELREMINNGHSMQNIK---VMPPNVKRPVGRPKKVWIPSRME-

Query:  FKRRVKCGCCGRAGHNRKTCMLPLS
         KR VKC  C + GHNR+TC  P S
Subjt:  FKRRVKCGCCGRAGHNRKTCMLPLS

TrEMBL top hitse value%identityAlignment
A0A1S3CIV7 uncharacterized protein LOC1035013364.9e-1733.63Show/hide
Query:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHK--SILKIIPKVFPTAF-------------------------------HGVRIVHLLRN
        IF L   +VDS+N+ SW WF  Q+K  IG R ++V VS+RHK  S   I  ++ P  F                                 + I  +L  
Subjt:  IFALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHK--SILKIIPKVFPTAF-------------------------------HGVRIVHLLRN

Query:  LR-----WFFERKNDVDYQATYLTKSAELELREMINNGHSMQ----------------------------------NIKVMPPNVKRP------VGRPKK
        LR     WFFER+N+VDYQ T  TK+ E  LR+ I    SM+                                  +IK++   +         VGRPKK
Subjt:  LR-----WFFERKNDVDYQATYLTKSAELELREMINNGHSMQ----------------------------------NIKVMPPNVKRP------VGRPKK

Query:  VWIPSRMEFKRRVKCGCCGRAGHNRK
        V IPSRM+FKRR+KCG  GR GHN K
Subjt:  VWIPSRMEFKRRVKCGCCGRAGHNRK

A0A5D3C2C0 Protein FAR1-RELATED SEQUENCE 3-like3.8e-1734.07Show/hide
Query:  IFALGCAIVDSENNLSWEWFFVQLKATIGVR--EDLVFVSDRHKSILKIIPKVFPTAFHG--------------VRIVHLLRNL--RWFFERKNDVDYQA
        IF L   +VDS+ + SW WF  QLK  IG R  E  V V +     L  I  V  T  +                 ++ +LR +  RWFFER+ND DYQ 
Subjt:  IFALGCAIVDSENNLSWEWFFVQLKATIGVR--EDLVFVSDRHKSILKIIPKVFPTAFHG--------------VRIVHLLRNL--RWFFERKNDVDYQA

Query:  TYLTKSAELELREMINNGHSMQ-------------------------------------------------------------NIKVMPPNVKRPVGRPK
        T  TK+    LRE I    SM+                                                             +I ++PPNVKR VGRPK
Subjt:  TYLTKSAELELREMINNGHSMQ-------------------------------------------------------------NIKVMPPNVKRPVGRPK

Query:  KVWIPSRMEFKRRVKCGCCGRAGHNR
        K  I SR+EFKRRVKCG CGR GHNR
Subjt:  KVWIPSRMEFKRRVKCGCCGRAGHNR

A0A6J1DNQ8 uncharacterized protein LOC1110223473.5e-2330.6Show/hide
Query:  GIHISYQKAWRAREAALNEIRG-----LLIFALGCAIVDSENNLS-----------WEWFFV-------------QLKATIGVREDLVFVSDRHKSILKI
        GI I+YQKAWR R++A+ EI+G       +    C ++  +N  S           + + F+              LK  IG R+DLV V DRHKSI+K 
Subjt:  GIHISYQKAWRAREAALNEIRG-----LLIFALGCAIVDSENNLS-----------WEWFFV-------------QLKATIGVREDLVFVSDRHKSILKI

Query:  IPKVFPTAFHGVRIVHLLRN--------------------------------------------------------LRWFFERKNDVDYQATYLTKSAEL
          KVF TA H +   +  R+                                                         RWF++R+NDVD+Q T  TKSAE 
Subjt:  IPKVFPTAFHGVRIVHLLRN--------------------------------------------------------LRWFFERKNDVDYQATYLTKSAEL

Query:  ELREMI-------------------------------NN-------------GH--------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGC
        +L E I                               NN             GH         ++ IK++ PNVKRP GRPKK+ IPS +EFK+RVKC  
Subjt:  ELREMI-------------------------------NN-------------GH--------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGC

Query:  CGRAGHNRKTCMLPLSQ
        CGR GHNRK+C   L+Q
Subjt:  CGRAGHNRKTCMLPLSQ

A0A6J1DXF3 uncharacterized protein LOC1110254516.5e-1738.46Show/hide
Query:  ISEFGIHISYQKAWRAREAALNEIRGL----------------------------------------LIFALGCAIVDSENNLSWEWFFVQLKATIGVRE
        + E G  I+Y K WRA+E A+ EIRG                                          IF L   +VDSEN+ SW WFF  LK  IGVR+
Subjt:  ISEFGIHISYQKAWRAREAALNEIRGL----------------------------------------LIFALGCAIVDSENNLSWEWFFVQLKATIGVRE

Query:  DLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLRWFFERK
        +LV VS+RHKSI+K + KVF TAFH +   HL +NL+  ++ K
Subjt:  DLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLRWFFERK

A0A7J7FWV1 MULE domain-containing protein1.6e-1531.52Show/hide
Query:  FALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLR------------------WFFERKNDVDYQAT
        F L  AIV+ E   SWEWFF+ L   +     + F+SDR+  +L+ +PKV PTA+H   + HL  NLR                  W  + ++      T
Subjt:  FALGCAIVDSENNLSWEWFFVQLKATIGVREDLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLR------------------WFFERKNDVDYQAT

Query:  YLTKSAELELREMINNGH-----------------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGCCGRAGHNRKTC
         +  S  L++ ++++                    +M ++ V+PP  K+P GR +K  IP   +  RRV+CG C + GHNRKTC
Subjt:  YLTKSAELELREMINNGH-----------------SMQNIKVMPPNVKRPVGRPKKVWIPSRMEFKRRVKCGCCGRAGHNRKTC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGCCAGAAATCCCGAGAGGACAGCGTCTCGATGCTGCCCTCAAGCGTCTCGACGCTCTGCAGCGCAATTCCCAGCGAGCAGTCCCAAGGGCACAACGT
TCCGACGTTATTCCTGTACGCATGCTATTTGGTGGCAATTGGAATGAGAGGAACAATTATTTAGGAATTTCAGAATTTGGCATCCACATTAGTTACCAAAAAGCA
TGGCGTGCTCGTGAAGCAGCTCTTAATGAAATTAGAGGATTACTTATATTCGCATTGGGTTGTGCAATAGTTGATTCAGAGAACAACTTATCATGGGAATGGTTT
TTTGTTCAACTAAAGGCGACCATTGGTGTGCGAGAAGATCTTGTTTTCGTGTCTGATAGACACAAGAGTATATTGAAAATCATTCCCAAGGTATTTCCTACTGCT
TTTCATGGCGTTCGTATTGTTCACTTGTTGAGGAACTTGAGGTGGTTTTTTGAACGTAAGAACGATGTTGACTATCAAGCTACCTATCTGACAAAGTCCGCTGAA
TTAGAATTGCGTGAAATGATCAACAATGGACACTCAATGCAGAACATAAAAGTAATGCCCCCAAATGTCAAACGTCCAGTTGGTAGACCCAAGAAGGTATGGATT
CCCTCAAGAATGGAGTTTAAAAGGAGGGTAAAATGTGGTTGTTGTGGAAGAGCAGGTCACAATAGGAAGACTTGCATGTTACCCCTTAGCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGCCAGAAATCCCGAGAGGACAGCGTCTCGATGCTGCCCTCAAGCGTCTCGACGCTCTGCAGCGCAATTCCCAGCGAGCAGTCCCAAGGGCACAACGT
TCCGACGTTATTCCTGTACGCATGCTATTTGGTGGCAATTGGAATGAGAGGAACAATTATTTAGGAATTTCAGAATTTGGCATCCACATTAGTTACCAAAAAGCA
TGGCGTGCTCGTGAAGCAGCTCTTAATGAAATTAGAGGATTACTTATATTCGCATTGGGTTGTGCAATAGTTGATTCAGAGAACAACTTATCATGGGAATGGTTT
TTTGTTCAACTAAAGGCGACCATTGGTGTGCGAGAAGATCTTGTTTTCGTGTCTGATAGACACAAGAGTATATTGAAAATCATTCCCAAGGTATTTCCTACTGCT
TTTCATGGCGTTCGTATTGTTCACTTGTTGAGGAACTTGAGGTGGTTTTTTGAACGTAAGAACGATGTTGACTATCAAGCTACCTATCTGACAAAGTCCGCTGAA
TTAGAATTGCGTGAAATGATCAACAATGGACACTCAATGCAGAACATAAAAGTAATGCCCCCAAATGTCAAACGTCCAGTTGGTAGACCCAAGAAGGTATGGATT
CCCTCAAGAATGGAGTTTAAAAGGAGGGTAAAATGTGGTTGTTGTGGAAGAGCAGGTCACAATAGGAAGACTTGCATGTTACCCCTTAGCCAGTAG
Protein sequenceShow/hide protein sequence
MEQPEIPRGQRLDAALKRLDALQRNSQRAVPRAQRSDVIPVRMLFGGNWNERNNYLGISEFGIHISYQKAWRAREAALNEIRGLLIFALGCAIVDSENNLSWEWF
FVQLKATIGVREDLVFVSDRHKSILKIIPKVFPTAFHGVRIVHLLRNLRWFFERKNDVDYQATYLTKSAELELREMINNGHSMQNIKVMPPNVKRPVGRPKKVWI
PSRMEFKRRVKCGCCGRAGHNRKTCMLPLSQ