; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G014005 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G014005
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF4283 domain-containing protein
Genome locationCG_Chr02:28205436..28206185
RNA-Seq ExpressionClCG02G014005
SyntenyClCG02G014005
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034063.1 hypothetical protein E6C27_scaffold65G00490 [Cucumis melo var. makuwa]1.0e-8064.08Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STTRS VCKFS SQT+ + ++F H LIAWVVGK IRPL+LA  L RHL LTE P VF+LGLG+FVLKF E DF ALE NPW IPNLCI+ FPW PNFKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  S+ID WIRL +L IEYY E+ILR+I +T+GE LVKI PITKDRKKCKYARICVRIN+ +P PSSI++GKI QEIEYEGFD+LC  C CV  LKHDC
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  ------------------SNSMMPLISSELSSA--------GTES
                          SNS  PL+ SE S A        GTES
Subjt:  ------------------SNSMMPLISSELSSA--------GTES

KAG6600114.1 hypothetical protein SDJN03_05347, partial [Cucurbita argyrosperma subsp. sororia]1.1e-7767.13Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STT + VC  SPSQT R+TQ+F H LIAWV G++IRP QLA +LRRHLHLT++ +VF+LGLG+FVLKFSE D+ ALE  PWSIPNLCI+ F W P+FKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  SS+D WIRLH+LSIEYYDEEILR+IA TIG  LVK  P+TK+R+KCK+ARIC+RINLCDP PS IKLG+I+Q+IEYEG DLLC  C  V  LK +C
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  SNSMMPLISSELSSAG
         NS  P  SS L + G
Subjt:  SNSMMPLISSELSSAG

KAG7030785.1 hypothetical protein SDJN02_04822, partial [Cucurbita argyrosperma subsp. argyrosperma]9.1e-7766.67Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STT + VC  SPSQT R+TQ+F H LIAWV G++IRP QLA +LRRHLHLT++ +VF+LGLG+FVLKFSE D+ ALE  PWSIPNLCI+ F W P+FKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  SS+D WIRL +LSIEYYDEEILR+IA TIG  LVK  P+TK+R+KCK+ARIC+RINLCDP PS IKLG+I+Q+IEYEG DLLC  C  V  LK +C
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  SNSMMPLISSELSSAG
         NS  P  SS L + G
Subjt:  SNSMMPLISSELSSAG

KAG7031864.1 hypothetical protein SDJN02_05905, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-7263.05Show/hide
Query:  RSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPSEAK
        ++ VC+ S SQT R+TQ+F H  IAW+ GK++RP ++A  LRRHL LT   +VF+LGLG+FVLKF E DF AL+  PWS+PNLCIHV PW P+FKPSE  
Subjt:  RSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPSEAK

Query:  ISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDCSNS
        +SS+D W+RLH+LSIEYYD+E+L++IA  IG  LVKI P+TK+R KCK+ARICVR+NLCDP PS I+LGKIRQEIEYEGF+LLC  C  V  L+H+C N 
Subjt:  ISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDCSNS

Query:  MMP
         +P
Subjt:  MMP

KGN50454.1 hypothetical protein Csa_000484 [Cucumis sativus]2.1e-8164.49Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STTRS VCKFS SQT+ + ++F H LIAWVVGK IRPL+LA  L RHL LT+ P VF+LGLG+FVLKF E DF A+E NPW IPNLCI+ FPW PNFKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  S+ID WIRL +L IEYY E+ILR+I +T+GEGLVKI PITKDRKKCKYARICVRIN+ +P PSSI++GKI QEIEYEGFDLLC  C CV  LKHDC
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  ------------------SNSMMPLISSELSSA--------GTES
                          SNS  PL+SSE S A        GTES
Subjt:  ------------------SNSMMPLISSELSSA--------GTES

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein4.9e-6860.85Show/hide
Query:  STTRSAV--CKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFK
        STT + V  C  +PS+T R+TQ+F H LIA VVGK+ RP QLA +LR HL LT++ +VF LGLG+FVLKFSE D+ ALE  PWSIPNLCIH FPW P+FK
Subjt:  STTRSAV--CKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFK

Query:  PSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKH
        PSEA  SS++ WIRL +LSIEYYD  IL+ IA+ IG+ LVKI P+T+DR KCK+AR C+ +NLCDP PS I+LG++RQ IEYEGF+ LC  C  V  L+H
Subjt:  PSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKH

Query:  DCSNSMMPLISS
        DCS+   P +++
Subjt:  DCSNSMMPLISS

A0A0A0KNJ5 DUF4283 domain-containing protein1.0e-8164.49Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STTRS VCKFS SQT+ + ++F H LIAWVVGK IRPL+LA  L RHL LT+ P VF+LGLG+FVLKF E DF A+E NPW IPNLCI+ FPW PNFKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  S+ID WIRL +L IEYY E+ILR+I +T+GEGLVKI PITKDRKKCKYARICVRIN+ +P PSSI++GKI QEIEYEGFDLLC  C CV  LKHDC
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  ------------------SNSMMPLISSELSSA--------GTES
                          SNS  PL+SSE S A        GTES
Subjt:  ------------------SNSMMPLISSELSSA--------GTES

A0A5A7SSJ3 DUF4283 domain-containing protein9.8e-6960.91Show/hide
Query:  STTRSAV--CKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFK
        STT + V  C  +PS+T R+TQ+F H LIA VVGK+ RP QLA +LR HL LT++ +VF+LGLG+FVLKFSE D+ ALE  PWSIPNLCIH FPW P+FK
Subjt:  STTRSAV--CKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFK

Query:  PSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKH
        PSEA  SS++ WIRL +LSIEYYD EIL+ IA+ IG  LVKI P+T+DR KCK+AR C+ +NLCDP PS I+LG+IRQ IEYEGF+ LC  C  V  L+H
Subjt:  PSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKH

Query:  DCSNSMMPLISSELSSAGTE
        DCS+   P  S   +  G E
Subjt:  DCSNSMMPLISSELSSAGTE

A0A5A7SUD3 DUF4283 domain-containing protein5.0e-8164.08Show/hide
Query:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS
        STTRS VCKFS SQT+ + ++F H LIAWVVGK IRPL+LA  L RHL LTE P VF+LGLG+FVLKF E DF ALE NPW IPNLCI+ FPW PNFKPS
Subjt:  STTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPS

Query:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC
        EA  S+ID WIRL +L IEYY E+ILR+I +T+GE LVKI PITKDRKKCKYARICVRIN+ +P PSSI++GKI QEIEYEGFD+LC  C CV  LKHDC
Subjt:  EAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDC

Query:  ------------------SNSMMPLISSELSSA--------GTES
                          SNS  PL+ SE S A        GTES
Subjt:  ------------------SNSMMPLISSELSSA--------GTES

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X22.0e-6660.27Show/hide
Query:  SSTTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSE-ID-FQALEANPWSIPNLCIHVFPWIPNF
        +ST  + VC  +PSQT R+ Q+F   LI WVVGK I P QLAV+LRR+LHL  +  VF+LGLGFFVLKFS  +D ++ALE  PWSIP+LCI+VFPWIPNF
Subjt:  SSTTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSE-ID-FQALEANPWSIPNLCIHVFPWIPNF

Query:  KPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLK
        KPSEA I  +D WIRL +LSIEYYD+E+L +IAETIG  LVKI P+T  R+KC YARIC+R+NL  P   S + GK  Q+I YEG DLLC +CGCV  LK
Subjt:  KPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLK

Query:  HDCSNSMMPLISSELSSAG
        HDC       +S+  SS+G
Subjt:  HDCSNSMMPLISSELSSAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.9e-1929.8Show/hide
Query:  CLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKF--SEIDFQALEANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYD
        C+I  V+G  I    L  +LR     +    V DL   FF+++F   E    AL   PW +    + V  W   F P    I +   W+RL  +   YY 
Subjt:  CLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKF--SEIDFQALEANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYD

Query:  EEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDCSNSMMPLISSELSSAGTESV
          +L EIA  +G  L K+   T +  K ++AR+C+ +NL  P   ++ +   R  + YEG   +C  CG    L H C  +++  +     SAG E+V
Subjt:  EEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDCSNSMMPLISSELSSAGTESV

AT2G02103.1 unknown protein2.0e-0523.03Show/hide
Query:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS
        +EID   ++   PW   N  +    W     P+   +++ID W+++  + + Y  EE + EIA  +GE ++ +        +  + R+ VR  + D    
Subjt:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS

Query:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD---CSNSMMPLISSELSSAGTESVE
          +L   ++ I   G        ++ L RLC   FR  H+   C     PL  +   +   +SV+
Subjt:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD---CSNSMMPLISSELSSAGTESVE

AT2G41590.1 unknown protein8.9e-0624.82Show/hide
Query:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS
        +EID   ++   PW   N  +    W     P+   +++ID W+++  + + Y  EE + EIA+ +GE L+     T    +  Y R+ VR  + D    
Subjt:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS

Query:  SIKLGKIRQEIEYEG---------FDLLCRLCGCVFRLKHD
          +L +  Q I ++          ++ L R+C   FR  H+
Subjt:  SIKLGKIRQEIEYEG---------FDLLCRLCGCVFRLKHD

AT5G18636.1 unknown protein1.5e-0523.57Show/hide
Query:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS
        +EID   ++   PW   N  +    W     P+   +++ID W+++  + + Y  EE + EIA+ +GE ++ +        +  + R+ VR  + D    
Subjt:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS

Query:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD
          +L   ++ I   G        ++ L RLC   FR  H+
Subjt:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD

AT5G25200.1 unknown protein1.2e-0523.03Show/hide
Query:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS
        +EID   ++   PW   N  +    W     P+   +++ID W+++  + + Y  EE + EIA+ +GE ++ +        +  + R+ VR  + D    
Subjt:  SEIDFQALE-ANPWSIPNLCIHVFPWIPNFKPSEAKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPS

Query:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD---CSNSMMPLISSELSSAGTESVE
          +L   ++ I   G        ++ L RLC   FR  H+   C     PL  +   +   +SV+
Subjt:  SIKLGKIRQEIEYEG--------FDLLCRLCGCVFRLKHD---CSNSMMPLISSELSSAGTESVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTCAGCCCCGAATTTTCTCCTCCACCACCCGCTCCGCCGTCTGTAAGTTCTCTCCCTCTCAAACCAATCGTATGACTCAAAAATTCACTCACTGTCTCATAGC
CTGGGTCGTCGGAAAGAACATCCGTCCATTGCAACTCGCCGTTCAACTTCGCCGTCATCTCCATCTCACCGAAAATCCGCAGGTCTTCGACCTAGGTCTTGGTTTTTTCG
TACTCAAATTCTCCGAGATCGATTTTCAAGCCCTAGAAGCCAATCCATGGTCAATCCCTAATCTCTGCATCCACGTCTTTCCATGGATTCCCAATTTCAAACCCTCGGAA
GCCAAGATTTCTTCTATTGATGCTTGGATTCGGCTCCATCAGCTCTCCATAGAGTATTACGACGAAGAAATTTTGCGAGAAATTGCAGAGACCATCGGTGAAGGTCTCGT
CAAAATCGTTCCGATTACAAAAGATCGGAAGAAATGTAAGTACGCTCGTATTTGCGTTAGAATAAATTTATGTGATCCACCTCCATCTTCGATCAAACTGGGTAAAATTC
GACAGGAAATTGAGTATGAGGGTTTTGATCTGTTGTGCCGTCTCTGTGGATGTGTTTTTCGTCTGAAGCATGATTGTTCGAATTCAATGATGCCATTGATTTCTTCTGAA
TTATCTTCAGCCGGAACAGAATCTGTTGAACTGGCCGCATATTTATTGTTAAGAAATCCAAGATTAATGAAATATGTTTTTGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTCAGCCCCGAATTTTCTCCTCCACCACCCGCTCCGCCGTCTGTAAGTTCTCTCCCTCTCAAACCAATCGTATGACTCAAAAATTCACTCACTGTCTCATAGC
CTGGGTCGTCGGAAAGAACATCCGTCCATTGCAACTCGCCGTTCAACTTCGCCGTCATCTCCATCTCACCGAAAATCCGCAGGTCTTCGACCTAGGTCTTGGTTTTTTCG
TACTCAAATTCTCCGAGATCGATTTTCAAGCCCTAGAAGCCAATCCATGGTCAATCCCTAATCTCTGCATCCACGTCTTTCCATGGATTCCCAATTTCAAACCCTCGGAA
GCCAAGATTTCTTCTATTGATGCTTGGATTCGGCTCCATCAGCTCTCCATAGAGTATTACGACGAAGAAATTTTGCGAGAAATTGCAGAGACCATCGGTGAAGGTCTCGT
CAAAATCGTTCCGATTACAAAAGATCGGAAGAAATGTAAGTACGCTCGTATTTGCGTTAGAATAAATTTATGTGATCCACCTCCATCTTCGATCAAACTGGGTAAAATTC
GACAGGAAATTGAGTATGAGGGTTTTGATCTGTTGTGCCGTCTCTGTGGATGTGTTTTTCGTCTGAAGCATGATTGTTCGAATTCAATGATGCCATTGATTTCTTCTGAA
TTATCTTCAGCCGGAACAGAATCTGTTGAACTGGCCGCATATTTATTGTTAAGAAATCCAAGATTAATGAAATATGTTTTTGAAGAATAA
Protein sequenceShow/hide protein sequence
MATQPRIFSSTTRSAVCKFSPSQTNRMTQKFTHCLIAWVVGKNIRPLQLAVQLRRHLHLTENPQVFDLGLGFFVLKFSEIDFQALEANPWSIPNLCIHVFPWIPNFKPSE
AKISSIDAWIRLHQLSIEYYDEEILREIAETIGEGLVKIVPITKDRKKCKYARICVRINLCDPPPSSIKLGKIRQEIEYEGFDLLCRLCGCVFRLKHDCSNSMMPLISSE
LSSAGTESVELAAYLLLRNPRLMKYVFEE