; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000814 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000814
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionUBN2 domain-containing protein
Genome locationchr05:1102497..1104321
RNA-Seq ExpressionPay0000814
SyntenyPay0000814
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046865.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]9.3e-11868.78Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL
        MGSNGN M T QPLI IFKGEGYEFWSI MKTLL+SQDLWDLVEQGY DPDD+GKLREN+KKDSKALVIIQQAVHDS+FSRI  ATT             
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL

Query:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF
            RDFETLMMKNGESIA+FLSRAT I               I+EKVL  LTPKFDHVV  IEESKNLSTFTFIELM SLEAHESRINRSMERNEEKAF
Subjt:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF

Query:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG
        Q  D   K    +     +     G  +N+      +       +         H+     LKPVFKELNEGEKLKVEL NGKELQVEGKGTVGIETHHG
Subjt:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG

Query:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL
        NRI TNVQYVPDIGYNLLSVGQLME+G SILFDDE+KSETFEKFKHFKAKVE QSGMFIKSL
Subjt:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL

KAA0055915.1 copia protein [Cucumis melo var. makuwa]1.6e-13063Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ
        M +NGN M T QPLI IFKGEGYEFWSIRMKTLL SQDLWDLVEQGY DPDDEGKL+ENR+KD KALVI+QQAVHD++FSRIA ATT          AFQ
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR
        GDSRVLVVKLQSL+RDFETLMMKNGESIA+FLSRATTI               I+EKVLR LTPKFDHVVA IEESK+LSTFTFIELM SL+AHESRIN 
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR

Query:  SMERNEEKAFQVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------
        SME+N+EKAF+VKDVVPKYNDSD           YR R RGTGKGCNQNEEQRQF VQSSNKANIQCY CKKFGHVK DC                    
Subjt:  SMERNEEKAFQVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------

Query:  ---------------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQ-----------YVPDI-GYN
                                         LKPVFKELNEGEKLKVELGN KELQVEGK  +GIETH+GNRI TNVQ            V ++  + 
Subjt:  ---------------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQ-----------YVPDI-GYN

Query:  LLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL
        L +         S L+   ++SETFEKFKHFKAKVE QSGMFIKSL
Subjt:  LLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL

KAE8650579.1 hypothetical protein Csa_010963 [Cucumis sativus]2.6e-13673.26Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ
        M +NGNAM T QPLI IFKGEGYEFWS+RMKTLL+SQDLWDLVE  YADPDDEGKLRE R+KDSKALVIIQQAVHDS FSRI   TT          AF+
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR
        GD RVLVVKLQSLR++FETLMMKN ESIANFLSRATTI               I+EKVLR LT KFD VVA IEESK+LSTFTFIELM SL+AHESRINR
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR

Query:  SMERNEEKAFQVKDVVPKYNDSDR----------YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------
        SMERNEEKAFQVKDVVPKYN+SDR          YRG+ RGT KGC QNEE+ QFRVQSSNKANIQCY  KKFGHVK DCC                   
Subjt:  SMERNEEKAFQVKDVVPKYNDSDR----------YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------

Query:  -LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDD
         LKP+F ELNEGEKLKVELGN KELQVE KGTVGIETHHGNRI TNVQYVPDIGYNLLSVGQLME+G+SILFDD
Subjt:  -LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDD

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.3e-12765.66Show/hide
Query:  MKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQGDSRVLVVKLQSLRRDFETLMMKNGESIA
        +KTLL+SQDLWDLVEQGY DPDDEGKLRENRKKDSKALVIIQQAVHDS+FSRIA ATT          AFQGDSRVL+VKLQSLRRDFETLMMKNGESIA
Subjt:  MKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQGDSRVLVVKLQSLRRDFETLMMKNGESIA

Query:  NFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAFQVKDVVPKYNDSDR-----
        +FLSRATTI               I+EKVLR LTPKFDHVVA IEESKNL TFTFIELM SLEAHESRINRSMERNEEKAFQVKD VPKYNDSDR     
Subjt:  NFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAFQVKDVVPKYNDSDR-----

Query:  -----YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDC----------------------------------------------C--
             YRGR  GT KGCN+NE QRQF VQSSNKANIQCY CKKFGHVK DC                                              C  
Subjt:  -----YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDC----------------------------------------------C--

Query:  ----LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVE-NQSG
            LKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRI TNVQYVPDIGYNLLSVGQLME+GYSILFDD       ++     AKV+  QS 
Subjt:  ----LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVE-NQSG

Query:  MFIKSLPVIEVENFCPTTLTIFARNTTSIES
        MF   L V  VE+F    LT  A NTT   S
Subjt:  MFIKSLPVIEVENFCPTTLTIFARNTTSIES

TYK28117.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.9e-13163.56Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL
        M +NGN M TAQPLI IFKGEGYEFWSIRMKTLL SQDLWDLVEQGY DPDDEGKL+ENR+KDSKALVIIQQAVHD++FSRIA ATT             
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL

Query:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF
            RDFETLMMKNGESIA+FLSRATTI               I+EKVLR LTPKFDHVV  IEESK+LSTFTFIELM SL+AHESRIN SME+NEEKAF
Subjt:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF

Query:  QVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-----------------------------
        +VKDVVPKYNDSD           YR R RGTGKGCNQNEEQRQF VQSSNKANIQCY CKKFGHVK DC                              
Subjt:  QVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-----------------------------

Query:  -----------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKS
                               LKPVFKELNEGEKLKVELGNGKELQVEGK T+GIETH+GNRI TNVQYVPDIGYNLLSVGQLME+G+SILFDD    
Subjt:  -----------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKS

Query:  ETFEKFKHFKAKVE-NQSGMFIKSLPVIEVENFCPTTLTIFARNTTSIES
           ++     AKV+  QS MF   L V  VENF    LT  A NTT   S
Subjt:  ETFEKFKHFKAKVE-NQSGMFIKSLPVIEVENFCPTTLTIFARNTTSIES

TrEMBL top hitse value%identityAlignment
A0A5A7TZP7 Putative gag-pol polyprotein, identical4.5e-11868.78Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL
        MGSNGN M T QPLI IFKGEGYEFWSI MKTLL+SQDLWDLVEQGY DPDD+GKLREN+KKDSKALVIIQQAVHDS+FSRI  ATT             
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL

Query:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF
            RDFETLMMKNGESIA+FLSRAT I               I+EKVL  LTPKFDHVV  IEESKNLSTFTFIELM SLEAHESRINRSMERNEEKAF
Subjt:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF

Query:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG
        Q  D   K    +     +     G  +N+      +       +         H+     LKPVFKELNEGEKLKVEL NGKELQVEGKGTVGIETHHG
Subjt:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG

Query:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL
        NRI TNVQYVPDIGYNLLSVGQLME+G SILFDDE+KSETFEKFKHFKAKVE QSGMFIKSL
Subjt:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL

A0A5A7UQM0 Copia protein7.9e-13163Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ
        M +NGN M T QPLI IFKGEGYEFWSIRMKTLL SQDLWDLVEQGY DPDDEGKL+ENR+KD KALVI+QQAVHD++FSRIA ATT          AFQ
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR
        GDSRVLVVKLQSL+RDFETLMMKNGESIA+FLSRATTI               I+EKVLR LTPKFDHVVA IEESK+LSTFTFIELM SL+AHESRIN 
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINR

Query:  SMERNEEKAFQVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------
        SME+N+EKAF+VKDVVPKYNDSD           YR R RGTGKGCNQNEEQRQF VQSSNKANIQCY CKKFGHVK DC                    
Subjt:  SMERNEEKAFQVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-------------------

Query:  ---------------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQ-----------YVPDI-GYN
                                         LKPVFKELNEGEKLKVELGN KELQVEGK  +GIETH+GNRI TNVQ            V ++  + 
Subjt:  ---------------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQ-----------YVPDI-GYN

Query:  LLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL
        L +         S L+   ++SETFEKFKHFKAKVE QSGMFIKSL
Subjt:  LLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVENQSGMFIKSL

A0A5D3BU79 Putative gag-pol polyprotein, identical2.1e-11566.58Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL
        MGSNGN M T QPLI IFKGEGYEFWSI MKTLL+SQDLWDLVEQGY DPDD+GKLREN+KKDSKALVIIQQAVHDS+FSRI  ATT             
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL

Query:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF
            RDFETLMMKNGESIA+FLSRAT I               I+EKVL  LTPKFDHVV  IEESKNLSTFTFIELM SLEAHESRINRSMERNEEKAF
Subjt:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF

Query:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG
        Q  D   K    +     +     G  +N+      +       +         H+     LKPVFKELNEGEKLKVEL NGKELQVEGKGTVGIETHHG
Subjt:  QVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHG

Query:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDD------------ETKSETFEKFKHFKAKVENQSGMFIKSL
        NRI TNVQYVPDIGYNLLSVGQLME+G SILFDD            E+KSETFEKFKHFKAKVE QSGMFIKSL
Subjt:  NRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDD------------ETKSETFEKFKHFKAKVENQSGMFIKSL

A0A5D3DWC7 Putative gag-pol polyprotein, identical9.3e-13263.56Show/hide
Query:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL
        M +NGN M TAQPLI IFKGEGYEFWSIRMKTLL SQDLWDLVEQGY DPDDEGKL+ENR+KDSKALVIIQQAVHD++FSRIA ATT             
Subjt:  MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKL

Query:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF
            RDFETLMMKNGESIA+FLSRATTI               I+EKVLR LTPKFDHVV  IEESK+LSTFTFIELM SL+AHESRIN SME+NEEKAF
Subjt:  QSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAF

Query:  QVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-----------------------------
        +VKDVVPKYNDSD           YR R RGTGKGCNQNEEQRQF VQSSNKANIQCY CKKFGHVK DC                              
Subjt:  QVKDVVPKYNDSD----------RYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCC-----------------------------

Query:  -----------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKS
                               LKPVFKELNEGEKLKVELGNGKELQVEGK T+GIETH+GNRI TNVQYVPDIGYNLLSVGQLME+G+SILFDD    
Subjt:  -----------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKS

Query:  ETFEKFKHFKAKVE-NQSGMFIKSLPVIEVENFCPTTLTIFARNTTSIES
           ++     AKV+  QS MF   L V  VENF    LT  A NTT   S
Subjt:  ETFEKFKHFKAKVE-NQSGMFIKSLPVIEVENFCPTTLTIFARNTTSIES

A0A5D3DWP2 Putative gag-pol polyprotein, identical6.3e-12865.66Show/hide
Query:  MKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQGDSRVLVVKLQSLRRDFETLMMKNGESIA
        +KTLL+SQDLWDLVEQGY DPDDEGKLRENRKKDSKALVIIQQAVHDS+FSRIA ATT          AFQGDSRVL+VKLQSLRRDFETLMMKNGESIA
Subjt:  MKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATT----------AFQGDSRVLVVKLQSLRRDFETLMMKNGESIA

Query:  NFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAFQVKDVVPKYNDSDR-----
        +FLSRATTI               I+EKVLR LTPKFDHVVA IEESKNL TFTFIELM SLEAHESRINRSMERNEEKAFQVKD VPKYNDSDR     
Subjt:  NFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAFQVKDVVPKYNDSDR-----

Query:  -----YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDC----------------------------------------------C--
             YRGR  GT KGCN+NE QRQF VQSSNKANIQCY CKKFGHVK DC                                              C  
Subjt:  -----YRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDC----------------------------------------------C--

Query:  ----LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVE-NQSG
            LKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRI TNVQYVPDIGYNLLSVGQLME+GYSILFDD       ++     AKV+  QS 
Subjt:  ----LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFKHFKAKVE-NQSG

Query:  MFIKSLPVIEVENFCPTTLTIFARNTTSIES
        MF   L V  VE+F    LT  A NTT   S
Subjt:  MFIKSLPVIEVENFCPTTLTIFARNTTSIES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein7.3e-1235.29Show/hide
Query:  ISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGK--------LRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQG
        + +     Y+ WS+RMK +L + D+W++VE+G+ +P++EG         LR++RK+D KAL +I Q + +  F ++  AT+A  G
Subjt:  ISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGK--------LRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQG

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.8e-1024.66Show/hide
Query:  YEFWSIRMKTLLKSQDLWDLVEQGY-------------ADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTA-------FQGDSRVLV-----
        YE W+   K+ L  Q LWD+V  G                P++  K R+   KD+KAL I+Q ++ DS+F +   A++A        +G+ +  +     
Subjt:  YEFWSIRMKTLLKSQDLWDLVEQGY-------------ADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTA-------FQGDSRVLV-----

Query:  VKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESL--EAHESRINRSMERN
        V ++ L +  E L M + ES +++L +A  I               I + V   L+  FD + + +EE  ++   T   L+E      HES         
Subjt:  VKLQSLRRDFETLMMKNGESIANFLSRATTI---------------IMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESL--EAHESRINRSMERN

Query:  EEKAFQVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPV-----------------------------F
        EE  F +   +   + S+++ G         N N+E  +FR+ +  +        +K   + VD  L+ V                             F
Subjt:  EEKAFQVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQFRVQSSNKANIQCYPCKKFGHVKVDCCLKPV-----------------------------F

Query:  KELNEGEKLKVELGNGKELQVEGKGTVGIETHHG-NRIFTNVQYVPDIGYNLLSVGQLMENGYSI
          L+   K  V   +G  L VEGKG V I    G  +   NV +VP +  N+LS G+++   YSI
Subjt:  KELNEGEKLKVELGNGKELQVEGKGTVGIETHHG-NRIFTNVQYVPDIGYNLLSVGQLMENGYSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCAATGGCAATGCTATGAGTACAGCACAACCACTCATTTCAATATTCAAAGGAGAAGGCTACGAGTTTTGGAGTATTCGTATGAAGACTCTTCTAAAATCTCA
AGACTTATGGGACTTAGTAGAACAAGGCTATGCAGATCCTGACGACGAAGGCAAGTTACGGGAGAACAGGAAGAAAGACTCGAAGGCGTTGGTGATTATTCAACAAGCAG
TCCATGACAGTATTTTTTCGCGGATTGCCGTAGCAACAACCGCATTTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGCAATCCCTTAGACGAGACTTTGAGACCTTG
ATGATGAAAAATGGAGAATCAATTGCTAATTTTTTGTCACGGGCAACGACAATTATTATGGAGAAAGTATTGAGAATTTTGACTCCAAAGTTTGATCATGTTGTGGCTAC
GATAGAAGAATCAAAGAATCTGTCCACTTTCACATTTATTGAATTAATGGAATCTCTTGAAGCACATGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAG
CGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATGACAGTGATCGATATCGTGGTCGAGATCGTGGTACCGGAAAAGGATGTAACCAAAATGAAGAACAAAGGCAGTTC
AGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCCTTGCAAGAAGTTTGGTCATGTAAAGGTCGACTGTTGTTTAAAGCCTGTATTCAAGGAGCTTAACGAAGG
AGAAAAGTTGAAGGTAGAGCTCGGAAATGGCAAGGAGCTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTTTCACAAATGTTCAGT
ATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAACTAATGGAGAATGGGTATTCTATCTTGTTTGATGATGAAACCAAATCAGAAACATTTGAGAAGTTCAAG
CATTTCAAGGCAAAGGTAGAAAACCAGAGTGGCATGTTTATCAAATCTCTTCCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAACAC
GACATCCATAGAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGCAATGGCAATGCTATGAGTACAGCACAACCACTCATTTCAATATTCAAAGGAGAAGGCTACGAGTTTTGGAGTATTCGTATGAAGACTCTTCTAAAATCTCA
AGACTTATGGGACTTAGTAGAACAAGGCTATGCAGATCCTGACGACGAAGGCAAGTTACGGGAGAACAGGAAGAAAGACTCGAAGGCGTTGGTGATTATTCAACAAGCAG
TCCATGACAGTATTTTTTCGCGGATTGCCGTAGCAACAACCGCATTTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGCAATCCCTTAGACGAGACTTTGAGACCTTG
ATGATGAAAAATGGAGAATCAATTGCTAATTTTTTGTCACGGGCAACGACAATTATTATGGAGAAAGTATTGAGAATTTTGACTCCAAAGTTTGATCATGTTGTGGCTAC
GATAGAAGAATCAAAGAATCTGTCCACTTTCACATTTATTGAATTAATGGAATCTCTTGAAGCACATGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAG
CGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATGACAGTGATCGATATCGTGGTCGAGATCGTGGTACCGGAAAAGGATGTAACCAAAATGAAGAACAAAGGCAGTTC
AGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCCTTGCAAGAAGTTTGGTCATGTAAAGGTCGACTGTTGTTTAAAGCCTGTATTCAAGGAGCTTAACGAAGG
AGAAAAGTTGAAGGTAGAGCTCGGAAATGGCAAGGAGCTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTTTCACAAATGTTCAGT
ATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAACTAATGGAGAATGGGTATTCTATCTTGTTTGATGATGAAACCAAATCAGAAACATTTGAGAAGTTCAAG
CATTTCAAGGCAAAGGTAGAAAACCAGAGTGGCATGTTTATCAAATCTCTTCCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAACAC
GACATCCATAGAGAGTTGA
Protein sequenceShow/hide protein sequence
MGSNGNAMSTAQPLISIFKGEGYEFWSIRMKTLLKSQDLWDLVEQGYADPDDEGKLRENRKKDSKALVIIQQAVHDSIFSRIAVATTAFQGDSRVLVVKLQSLRRDFETL
MMKNGESIANFLSRATTIIMEKVLRILTPKFDHVVATIEESKNLSTFTFIELMESLEAHESRINRSMERNEEKAFQVKDVVPKYNDSDRYRGRDRGTGKGCNQNEEQRQF
RVQSSNKANIQCYPCKKFGHVKVDCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRIFTNVQYVPDIGYNLLSVGQLMENGYSILFDDETKSETFEKFK
HFKAKVENQSGMFIKSLPVIEVENFCPTTLTIFARNTTSIES