; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G17650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G17650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationClcChr05:26492798..26494155
RNA-Seq ExpressionClc05G17650
SyntenyClc05G17650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649926.1 hypothetical protein Csa_011922 [Cucumis sativus]2.9e-12278.76Show/hide
Query:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR
        MNSTDQL NF A AQ S  +PD EPKKQ+RRRR S RRLYKE+PLDMAEARREIVTALKLHRA STKE AREQQQKQDQ+ KQS PLF Q   CFEAEGR
Subjt:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR

Query:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMA
        RKSRRNPRIYP CSY CSFYL NGSG VAPPP  +NLNTEIPIQ+FDD FKT+DTCSSF SLS W PPSSYICPT+SCPDTHQE+PKS SL EE G LMA
Subjt:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMA

Query:  SDLFWSNNDPTGESEKDMQRWAVEEEKAM-AMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIEDV
        SD+FW NNDPTG SEKDMQ+  V EE+AM AMA+I+SMSMDVKALE DG HSSDNAMEFPDWL INDDFL Q+ NY+CVE DYLQ PDLSCFD  KIED+
Subjt:  SDLFWSNNDPTGESEKDMQRWAVEEEKAM-AMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIEDV

Query:  DGDWLA
        D +WLA
Subjt:  DGDWLA

KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]8.1e-9664.2Show/hide
Query:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR
        MNSTDQLCNF AT     QPQP GE KKQ+RRRRQSRRLYK+MPL+MAEARREIVTALKLHRASTKEA+EQQQKQDQQIK S+P++  Q  PCFE E R 
Subjt:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR

Query:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCS--------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKSTSLSE
        KSRRNPRIYP     CSFY  NGS F+APPP+AQ+L+ +IPIQ+        DT S        SFYSLS  PPSSYICPT      THQEVPKS SLSE
Subjt:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCS--------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKSTSLSE

Query:  EAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNCVEGD
        E G+LMASDLFWSNN PTGESEK++       EEE+   +AEIR  SM+ K LE DG             S+ AMEFPDWL INDDFL    NY     D
Subjt:  EAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNCVEGD

Query:  YLQYPDLSCFDFGKIEDVDGDWLA
        YLQ PDLSC D G+IEDVDGDWLA
Subjt:  YLQYPDLSCFDFGKIEDVDGDWLA

XP_016901295.1 PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo]2.0e-11876.62Show/hide
Query:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR
        MNS DQL NF A AQ S  +PD EPKKQ+RRRR S RRLYKE+PLDMAEARREIVTALKLHRA STKE AREQQQKQDQ+ KQS PLF +L  CFEAEGR
Subjt:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR

Query:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM
        RKS+RNPRIYP CSY CSFYL NGSGFVAPPP  +NLNTEIPIQ+FDD FKT+DTCSSF SLS W PPSSYICPTVSCPDT HQE PKS SL EE G LM
Subjt:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM

Query:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE
        ASD+FW NNDPTG +EKDMQ+ AV EE+AMAMA  +++SMSMDVKALE D HHSSDNAM FPDW+ INDD L Q+ NY+CVE D LQ PDLSCFD GKIE
Subjt:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE

Query:  DVDGDWLA
        D+  +WLA
Subjt:  DVDGDWLA

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]6.2e-9664.02Show/hide
Query:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR
        MNSTDQLCNF AT     QPQP GE KKQ+RRRRQSRRLYK+MPL+MAEARREIVTALKLHRASTKEA+EQQQKQDQQIK S+P++  Q  PCFE E R 
Subjt:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR

Query:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQ------SFDDGFKTVDTCS------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKST
        KSRRNPRIYP     CSFY  NGS F+APPP+AQ+L+ +IPIQ      +F+D    V  CS      SFYSLS  PPSSYICPT      THQEVPKS 
Subjt:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQ------SFDDGFKTVDTCS------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKST

Query:  SLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNC
        SLSEE G+LMASDLFWSNN PTGESEK++       EEE+   +AEIR  S+D K LE DG             S+ AMEFPDWL INDDFL    NY  
Subjt:  SLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNC

Query:  VEGDYLQYPDLSCFDFGKIEDVDGDWLA
           DYLQ PDLSC D G+IEDVDGDWLA
Subjt:  VEGDYLQYPDLSCFDFGKIEDVDGDWLA

XP_038897806.1 uncharacterized protein LOC120085720 [Benincasa hispida]3.4e-11074.48Show/hide
Query:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRK
        MNS DQLCNF A AQ SQP+PDGE KKQ+RRRR S RRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQI QS+P+F QL PCFE +GRRK
Subjt:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRK

Query:  SRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLWPPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMASDL
        SRRN R YP     CSFYL NGSGFVAPP +AQNL TEIP QSFDD FKT    SS+  LS WPPSSYI PTVSC  THQEVPKS SLSEE G LMASD+
Subjt:  SRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLWPPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMASDL

Query:  FWSNNDPTGESEKDMQRWAVEEEKAMAMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDF
        FW NND     +KDMQ  AVEE +A AMAE+R M+MDVKALE+DGHHS +N MEF DW  INDDFL QH NY+CVE DYLQ PDLS + F
Subjt:  FWSNNDPTGESEKDMQRWAVEEEKAMAMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDF

TrEMBL top hitse value%identityAlignment
A0A0A0L091 Uncharacterized protein2.0e-10079.44Show/hide
Query:  MAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSF
        MAEARREIVTALKLHRA STKE AREQQQKQDQ+ KQS PLF Q   CFEAEGRRKSRRNPRIYP CSY CSFYL NGSG VAPPP  +NLNTEIPIQ+F
Subjt:  MAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSF

Query:  DDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWAVEEEKAM-AMAEIRSMSMDVKALE
        DD FKT+DTCSSF SLS W PPSSYICPT+SCPDTHQE+PKS SL EE G LMASD+FW NNDPTG SEKDMQ+  V EE+AM AMA+I+SMSMDVKALE
Subjt:  DDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWAVEEEKAM-AMAEIRSMSMDVKALE

Query:  TDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDF
         DG HSSDNAMEFPDWL INDDFL Q+ NY+CVE DYLQ PDLS + F
Subjt:  TDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDF

A0A1S4DZY0 uncharacterized protein LOC1034937179.6e-11976.62Show/hide
Query:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR
        MNS DQL NF A AQ S  +PD EPKKQ+RRRR S RRLYKE+PLDMAEARREIVTALKLHRA STKE AREQQQKQDQ+ KQS PLF +L  CFEAEGR
Subjt:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR

Query:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM
        RKS+RNPRIYP CSY CSFYL NGSGFVAPPP  +NLNTEIPIQ+FDD FKT+DTCSSF SLS W PPSSYICPTVSCPDT HQE PKS SL EE G LM
Subjt:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM

Query:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE
        ASD+FW NNDPTG +EKDMQ+ AV EE+AMAMA  +++SMSMDVKALE D HHSSDNAM FPDW+ INDD L Q+ NY+CVE D LQ PDLSCFD GKIE
Subjt:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE

Query:  DVDGDWLA
        D+  +WLA
Subjt:  DVDGDWLA

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X29.6e-11976.62Show/hide
Query:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR
        MNS DQL NF A AQ S  +PD EPKKQ+RRRR S RRLYKE+PLDMAEARREIVTALKLHRA STKE AREQQQKQDQ+ KQS PLF +L  CFEAEGR
Subjt:  MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQS-RRLYKEMPLDMAEARREIVTALKLHRA-STKE-AREQQQKQDQQIKQSVPLFSQLCPCFEAEGR

Query:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM
        RKS+RNPRIYP CSY CSFYL NGSGFVAPPP  +NLNTEIPIQ+FDD FKT+DTCSSF SLS W PPSSYICPTVSCPDT HQE PKS SL EE G LM
Subjt:  RKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLW-PPSSYICPTVSCPDT-HQEVPKSTSLSEEAGKLM

Query:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE
        ASD+FW NNDPTG +EKDMQ+ AV EE+AMAMA  +++SMSMDVKALE D HHSSDNAM FPDW+ INDD L Q+ NY+CVE D LQ PDLSCFD GKIE
Subjt:  ASDLFWSNNDPTGESEKDMQRWAVEEEKAMAMA--EIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIE

Query:  DVDGDWLA
        D+  +WLA
Subjt:  DVDGDWLA

A0A6J1FRD8 uncharacterized protein LOC1114462253.0e-9664.02Show/hide
Query:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR
        MNSTDQLCNF AT     QPQP GE KKQ+RRRRQSRRLYK+MPL+MAEARREIVTALKLHRASTKEA+EQQQKQDQQIK S+P++  Q  PCFE E R 
Subjt:  MNSTDQLCNFVAT-AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEAEGRR

Query:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQ------SFDDGFKTVDTCS------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKST
        KSRRNPRIYP     CSFY  NGS F+APPP+AQ+L+ +IPIQ      +F+D    V  CS      SFYSLS  PPSSYICPT      THQEVPKS 
Subjt:  KSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQ------SFDDGFKTVDTCS------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKST

Query:  SLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNC
        SLSEE G+LMASDLFWSNN PTGESEK++       EEE+   +AEIR  S+D K LE DG             S+ AMEFPDWL INDDFL    NY  
Subjt:  SLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYNC

Query:  VEGDYLQYPDLSCFDFGKIEDVDGDWLA
           DYLQ PDLSC D G+IEDVDGDWLA
Subjt:  VEGDYLQYPDLSCFDFGKIEDVDGDWLA

A0A6J1IXC1 uncharacterized protein LOC1114807864.8e-9462.31Show/hide
Query:  MNSTDQLCNFVAT-----AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEA
        MNSTDQLCNF AT         QPQP GE KKQ+RRRR++RRLYK+MPL+MAEARREIVTALKLHRASTKEA+EQQQKQDQQIK S+P++  Q  PCFE 
Subjt:  MNSTDQLCNFVAT-----AQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLF-SQLCPCFEA

Query:  EGRRKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCS---------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKS
        E R KSRRNPRIYP     CSFY  NGS F+APPP+AQ+L+ +IPIQ+        DT S         SFYSLS   PSSYICPT      TH+EVPKS
Subjt:  EGRRKSRRNPRIYPGCSYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCS---------SFYSLSLWPPSSYICPTVS-CPDTHQEVPKS

Query:  TSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYN
         SLSEE G+LMASDLFWSNN PTGESEK++       EEE+   +AEIR  SMD K LE DG             S+ AMEFPDWL INDDFL    NY 
Subjt:  TSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWA--VEEEKAMAMAEIRSMSMDVKALETDGH----------HSSDNAMEFPDWLGINDDFLHQHWNYN

Query:  CVEGDYLQYPDLSCFDFGKIEDVDGDWLA
            DYLQ PDLSC D G+IEDVDGDWLA
Subjt:  CVEGDYLQYPDLSCFDFGKIEDVDGDWLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein5.0e-1933.33Show/hide
Query:  KKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRKSRRNPRIYPGCSYHCSFYLGNGSGFV
        KKQ+RRR  + R Y+E  L+MAEARREIVTALK HRAS ++A      Q     Q + LFS   P             P  +   +   +F L N     
Subjt:  KKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRKSRRNPRIYPGCSYHCSFYLGNGSGFV

Query:  APPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLWPPSSYICPTVSCPDTHQEVPK--STSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWAVEEEK
           PL  NLN     Q F+D  +T  T SS  S S    SS I PT     +    P   +T+ S+ A +L +S          GE+      W  E   
Subjt:  APPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLWPPSSYICPTVSCPDTHQEVPK--STSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWAVEEEK

Query:  AMAMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIEDVDG-DWLA
             EI+  + +V  +E D      + MEFP WL   ++ L   +N           P LSC + G+IE +DG DWLA
Subjt:  AMAMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIEDVDG-DWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTACAGACCAACTCTGCAACTTTGTAGCTACTGCACAATTCTCACAGCCACAGCCAGATGGAGAACCAAAGAAACAGATTAGAAGGAGGCGCCAAAGCCGGCG
GCTTTACAAAGAAATGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACCAAAGAAGCAAGAGAGCAGCAACAAAAAC
AGGATCAACAAATTAAACAATCAGTTCCTCTGTTTTCTCAATTATGTCCATGTTTTGAAGCTGAAGGAAGAAGAAAATCCAGGAGAAATCCCAGGATATACCCAGGTTGT
TCATATCATTGCTCATTTTATTTGGGAAATGGGTCTGGTTTTGTTGCTCCTCCACCTCTTGCACAGAATCTCAATACAGAGATCCCTATACAAAGCTTTGATGATGGTTT
CAAAACTGTGGATACTTGTTCTTCATTTTATTCACTTTCACTCTGGCCCCCATCTTCATATATTTGTCCCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAAT
CAACTTCATTATCTGAGGAAGCAGGGAAGCTAATGGCTTCTGATTTGTTTTGGTCAAATAATGATCCAACTGGAGAGAGTGAAAAAGATATGCAGCGGTGGGCGGTGGAG
GAGGAGAAGGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGACTGATGGCCACCATAGTTCTGATAATGCTATGGAATTTCCAGATTG
GTTGGGCATTAATGATGATTTTCTGCATCAGCATTGGAATTATAATTGCGTAGAGGGGGATTATCTTCAATATCCTGACCTATCTTGCTTCGACTTTGGGAAGATTGAAG
ATGTGGATGGAGATTGGCTGGCATGA
mRNA sequenceShow/hide mRNA sequence
GCATAGGATACCTTATTGTATAGCCGGCGCCGCCTTATAGACTTTGTCTGGGAACAAAGTCACGTCATTGCACAATCGGTTCAGTATTTAAAGCCCTCAATTCTCCCAAC
CGATCATCAATCTTCATCTCTCCACTCTTTTTCCACCTGAGAGAAAAGCAAATAGATGAACATTAAATAGTTAGCTCAAGATTTGAACTAATCTTCTTTACCTCACTAAT
TCAATGAACTCTACAGACCAACTCTGCAACTTTGTAGCTACTGCACAATTCTCACAGCCACAGCCAGATGGAGAACCAAAGAAACAGATTAGAAGGAGGCGCCAAAGCCG
GCGGCTTTACAAAGAAATGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACCAAAGAAGCAAGAGAGCAGCAACAAA
AACAGGATCAACAAATTAAACAATCAGTTCCTCTGTTTTCTCAATTATGTCCATGTTTTGAAGCTGAAGGAAGAAGAAAATCCAGGAGAAATCCCAGGATATACCCAGGT
TGTTCATATCATTGCTCATTTTATTTGGGAAATGGGTCTGGTTTTGTTGCTCCTCCACCTCTTGCACAGAATCTCAATACAGAGATCCCTATACAAAGCTTTGATGATGG
TTTCAAAACTGTGGATACTTGTTCTTCATTTTATTCACTTTCACTCTGGCCCCCATCTTCATATATTTGTCCCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCA
AATCAACTTCATTATCTGAGGAAGCAGGGAAGCTAATGGCTTCTGATTTGTTTTGGTCAAATAATGATCCAACTGGAGAGAGTGAAAAAGATATGCAGCGGTGGGCGGTG
GAGGAGGAGAAGGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGACTGATGGCCACCATAGTTCTGATAATGCTATGGAATTTCCAGA
TTGGTTGGGCATTAATGATGATTTTCTGCATCAGCATTGGAATTATAATTGCGTAGAGGGGGATTATCTTCAATATCCTGACCTATCTTGCTTCGACTTTGGGAAGATTG
AAGATGTGGATGGAGATTGGCTGGCATGATCGTGTGGTTTTATCTCATACCATGCCTTTGAGTAATTAGAAAGAAATGAAGAATCTTCTTTCCAAAACATGATCCTCTCC
CCCTATTTAAATCAATGTTTAAAAA
Protein sequenceShow/hide protein sequence
MNSTDQLCNFVATAQFSQPQPDGEPKKQIRRRRQSRRLYKEMPLDMAEARREIVTALKLHRASTKEAREQQQKQDQQIKQSVPLFSQLCPCFEAEGRRKSRRNPRIYPGC
SYHCSFYLGNGSGFVAPPPLAQNLNTEIPIQSFDDGFKTVDTCSSFYSLSLWPPSSYICPTVSCPDTHQEVPKSTSLSEEAGKLMASDLFWSNNDPTGESEKDMQRWAVE
EEKAMAMAEIRSMSMDVKALETDGHHSSDNAMEFPDWLGINDDFLHQHWNYNCVEGDYLQYPDLSCFDFGKIEDVDGDWLA