; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0018786 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0018786
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr01:17677875..17679517
RNA-Seq ExpressionPI0018786
SyntenyPI0018786
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG59671.1 hypothetical protein EZV62_014244 [Acer yangbiense]8.0e-1931.34Show/hide
Query:  SPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV----------------TSSKW
        +P++    +  +S+T   + +G    +KLD  NY+LW            L+ ++ GT+P P EFI    E T +G T                      W
Subjt:  SPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV----------------TSSKW

Query:  LMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDL--KGEPVTLSYLMSCAISGLEVEYLPIVCTI
        L GSMT S+A  ++   TS  + TALENLY + S++++N  R  +Q  +KG   M +YL  +K  ++ L   G+P     L+S  +SGL+ +Y+PIV  I
Subjt:  LMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDL--KGEPVTLSYLMSCAISGLEVEYLPIVCTI

Query:  EGKENSTWYDVHSPLLT
        E +E+ +W ++   LL+
Subjt:  EGKENSTWYDVHSPLLT

TXG72772.1 hypothetical protein EZV62_001351 [Acer yangbiense]1.4e-1832.14Show/hide
Query:  TPQPQVVSPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV--------------
        T QP VVS +    N+           +G    +KLD  NY+LW            L+ ++ GT+P P EFI    E   +G T                
Subjt:  TPQPQVVSPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV--------------

Query:  --TSSKWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEY
              WL GSMT S+A  ++   TS  + TALENLY + S++++N  R  +Q T+KG   M +YL  +K  ++ L   G+P     L+S  +SGL+ +Y
Subjt:  --TSSKWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEY

Query:  LPIVCTIEGKENSTWYDVHSPLLT
        +PIV  IE +E+ +W ++   LL+
Subjt:  LPIVCTIEGKENSTWYDVHSPLLT

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]1.0e-3451.11Show/hide
Query:  HLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDET---------TREGGTKVTSS--KWLMGSMTSSIAHDIIDFKTSREVS
        H L T LTVKLDDKNY+LW            ++ +VL TK  P+++ +   +T           E  + V  +   WL GSMT SIA D+++ +TS EV 
Subjt:  HLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDET---------TREGGTKVTSS--KWLMGSMTSSIAHDIIDFKTSREVS

Query:  TALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEYLPIVCTIEGK
        TALE L+ STS+AR+NQ RN +QNTKKG  +M  YL+F+KQTSEDLK  GEPVTLSYL SC ++G E EYLPI+CTIE K
Subjt:  TALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEYLPIVCTIEGK

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.5e-4448.26Show/hide
Query:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-
        M  E  ENS+  PQVV+ V +   N SP   TS  H LGT LTVKLDDKNYSLW             + +VLGT  KP +F+   +        +V    
Subjt:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-

Query:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS
                    WL GSMT SIA D++DF++SREV  ALE+LY +TS+AR+NQ RNV+QNTKK   +M +YL  +KQ SE LK  GEPV  +YLMSC +S
Subjt:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS

Query:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT
        GLE EYLPIVC IEGK++++W ++ + L+T
Subjt:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.5e-4448.26Show/hide
Query:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-
        M  E  ENS+  PQVV+ V +   N SP   TS  H LGT LTVKLDDKNYSLW             + +VLGT  KP +F+   +        +V    
Subjt:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-

Query:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS
                    WL GSMT SIA D++DF++SREV  ALE+LY +TS+AR+NQ RNV+QNTKK   +M +YL  +KQ SE LK  GEPV  +YLMSC +S
Subjt:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS

Query:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT
        GLE EYLPIVC IEGK++++W ++ + L+T
Subjt:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT

TrEMBL top hitse value%identityAlignment
A0A5C7HRI6 Uncharacterized protein3.9e-1931.34Show/hide
Query:  SPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV----------------TSSKW
        +P++    +  +S+T   + +G    +KLD  NY+LW            L+ ++ GT+P P EFI    E T +G T                      W
Subjt:  SPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV----------------TSSKW

Query:  LMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDL--KGEPVTLSYLMSCAISGLEVEYLPIVCTI
        L GSMT S+A  ++   TS  + TALENLY + S++++N  R  +Q  +KG   M +YL  +K  ++ L   G+P     L+S  +SGL+ +Y+PIV  I
Subjt:  LMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDL--KGEPVTLSYLMSCAISGLEVEYLPIVCTI

Query:  EGKENSTWYDVHSPLLT
        E +E+ +W ++   LL+
Subjt:  EGKENSTWYDVHSPLLT

A0A5C7IW79 Chitin-binding type-1 domain-containing protein6.6e-1932.14Show/hide
Query:  TPQPQVVSPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV--------------
        T QP VVS +    N+           +G    +KLD  NY+LW            L+ ++ GT+P P EFI    E   +G T                
Subjt:  TPQPQVVSPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKV--------------

Query:  --TSSKWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEY
              WL GSMT S+A  ++   TS  + TALENLY + S++++N  R  +Q T+KG   M +YL  +K  ++ L   G+P     L+S  +SGL+ +Y
Subjt:  --TSSKWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEY

Query:  LPIVCTIEGKENSTWYDVHSPLLT
        +PIV  IE +E+ +W ++   LL+
Subjt:  LPIVCTIEGKENSTWYDVHSPLLT

A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like5.0e-3551.11Show/hide
Query:  HLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDET---------TREGGTKVTSS--KWLMGSMTSSIAHDIIDFKTSREVS
        H L T LTVKLDDKNY+LW            ++ +VL TK  P+++ +   +T           E  + V  +   WL GSMT SIA D+++ +TS EV 
Subjt:  HLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDET---------TREGGTKVTSS--KWLMGSMTSSIAHDIIDFKTSREVS

Query:  TALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEYLPIVCTIEGK
        TALE L+ STS+AR+NQ RN +QNTKKG  +M  YL+F+KQTSEDLK  GEPVTLSYL SC ++G E EYLPI+CTIE K
Subjt:  TALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAISGLEVEYLPIVCTIEGK

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.2e-4448.26Show/hide
Query:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-
        M  E  ENS+  PQVV+ V +   N SP   TS  H LGT LTVKLDDKNYSLW             + +VLGT  KP +F+   +        +V    
Subjt:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-

Query:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS
                    WL GSMT SIA D++DF++SREV  ALE+LY +TS+AR+NQ RNV+QNTKK   +M +YL  +KQ SE LK  GEPV  +YLMSC +S
Subjt:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS

Query:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT
        GLE EYLPIVC IEGK++++W ++ + L+T
Subjt:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.2e-4448.26Show/hide
Query:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-
        M  E  ENS+  PQVV+ V +   N SP   TS  H LGT LTVKLDDKNYSLW             + +VLGT  KP +F+   +        +V    
Subjt:  MDFESVENSTPQPQVVSPV-LANANASPASLTSKTHLLGTGLTVKLDDKNYSLW------------LESFVLGTKPKPAEFISFFDETTREGGTKVTSS-

Query:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS
                    WL GSMT SIA D++DF++SREV  ALE+LY +TS+AR+NQ RNV+QNTKK   +M +YL  +KQ SE LK  GEPV  +YLMSC +S
Subjt:  -----------KWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLK--GEPVTLSYLMSCAIS

Query:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT
        GLE EYLPIVC IEGK++++W ++ + L+T
Subjt:  GLEVEYLPIVCTIEGKENSTWYDVHSPLLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.3e-0628.48Show/hide
Query:  LTVKLDDKNYSLWLESF--------VLG-----TKPKPAEFISFFDETTREGGTKVTSSKWLMGSMTSSIAHDIIDFK-TSREVSTALENLYRSTSRARV
        +T+ L+  NY +W E F        VLG     + P P     + +   R+G  K+    W+ G++T S+   II    T+R++  +LENL+R    AR 
Subjt:  LTVKLDDKNYSLWLESF--------VLG-----TKPKPAEFISFFDETTREGGTKVTSSKWLMGSMTSSIAHDIIDFK-TSREVSTALENLYRSTSRARV

Query:  NQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLKG--EPVTLSYLMSCAISGLEVEYLPIVCTIEGK
         QF N ++ T   D  + +Y   +K  S+ L     P++   L+   ++GL  +Y  I+  I+ K
Subjt:  NQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLKG--EPVTLSYLMSCAISGLEVEYLPIVCTIEGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAACATTCTGTCGTATACAAGAATCTTCAATCTTTTTATTTAATATATAATAAGAAAACAATTTCTCACATTATGGATATGGATTTTGAAAGTGTAGAAAATTC
TACTCCACAACCACAGGTTGTTTCCCCCGTTTTAGCCAATGCAAATGCATCCCCTGCCTCGCTAACTTCAAAAACCCATCTTTTGGGAACTGGTCTCACAGTCAAATTGG
ATGACAAAAATTATTCTCTATGGCTTGAAAGTTTTGTTCTTGGAACAAAACCTAAGCCTGCTGAATTCATCTCTTTCTTTGATGAGACAACGAGGGAAGGTGGAACAAAG
GTTACTTCATCCAAGTGGTTAATGGGATCAATGACATCATCAATTGCCCATGATATCATTGACTTCAAAACTTCTCGAGAGGTTTCAACTGCATTAGAGAACTTGTATAG
ATCCACCAGCAGAGCACGTGTTAATCAATTCAGAAATGTCATGCAAAACACAAAGAAAGGAGACTCGAGAATGGGTGACTATCTGAGCTTTATAAAACAAACTTCAGAAG
ATTTAAAAGGTGAACCAGTAACATTAAGTTATTTAATGTCTTGCGCAATATCAGGATTAGAAGTTGAATACCTACCTATAGTTTGTACTATTGAAGGAAAAGAAAATTCA
ACTTGGTATGATGTGCATTCACCACTTCTTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAACATTCTGTCGTATACAAGAATCTTCAATCTTTTTATTTAATATATAATAAGAAAACAATTTCTCACATTATGGATATGGATTTTGAAAGTGTAGAAAATTC
TACTCCACAACCACAGGTTGTTTCCCCCGTTTTAGCCAATGCAAATGCATCCCCTGCCTCGCTAACTTCAAAAACCCATCTTTTGGGAACTGGTCTCACAGTCAAATTGG
ATGACAAAAATTATTCTCTATGGCTTGAAAGTTTTGTTCTTGGAACAAAACCTAAGCCTGCTGAATTCATCTCTTTCTTTGATGAGACAACGAGGGAAGGTGGAACAAAG
GTTACTTCATCCAAGTGGTTAATGGGATCAATGACATCATCAATTGCCCATGATATCATTGACTTCAAAACTTCTCGAGAGGTTTCAACTGCATTAGAGAACTTGTATAG
ATCCACCAGCAGAGCACGTGTTAATCAATTCAGAAATGTCATGCAAAACACAAAGAAAGGAGACTCGAGAATGGGTGACTATCTGAGCTTTATAAAACAAACTTCAGAAG
ATTTAAAAGGTGAACCAGTAACATTAAGTTATTTAATGTCTTGCGCAATATCAGGATTAGAAGTTGAATACCTACCTATAGTTTGTACTATTGAAGGAAAAGAAAATTCA
ACTTGGTATGATGTGCATTCACCACTTCTTACTTAA
Protein sequenceShow/hide protein sequence
MDEHSVVYKNLQSFYLIYNKKTISHIMDMDFESVENSTPQPQVVSPVLANANASPASLTSKTHLLGTGLTVKLDDKNYSLWLESFVLGTKPKPAEFISFFDETTREGGTK
VTSSKWLMGSMTSSIAHDIIDFKTSREVSTALENLYRSTSRARVNQFRNVMQNTKKGDSRMGDYLSFIKQTSEDLKGEPVTLSYLMSCAISGLEVEYLPIVCTIEGKENS
TWYDVHSPLLT