; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G26320 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G26320
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionglycine-rich RNA-binding protein 4, mitochondrial
Genome locationClcChr01:36391780..36398283
RNA-Seq ExpressionClc01G26320
SyntenyClc01G26320
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0009058 - biosynthetic process (biological process)
GO:0016607 - nuclear speck (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439484.1 PREDICTED: glycine-rich RNA-binding protein 4, mitochondrial [Cucumis melo]2.8e-6677.38Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWA TVS PSISNLRGP+  NN L  S R SLNLRASFFDYPLASRLM              V +LPYSTNESRLQEEFSNFGEIAEVL+AKDR TKRPK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY
        GYAFIQYTCQDDAMLALENMDCK FDGR IYVE+AKPGS SFGGYPRTSGPP+ERP VE E++ DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY

XP_022977992.1 small RNA-binding protein 11, chloroplastic [Cucurbita maxima]2.2e-6679.41Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWAATVSFPS+SNLRGPI LNNRLD SKR SLNLRASF DYPLASRLM              V +LPYSTNE+RLQEEFSNFGEIAEVL+AKDRLTK PK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY
        GYAFIQYT QDDAMLALENMDCKKFDGR IYVELAKPGS SFGGYP TSGPP+ER  PK+  EEV DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY

XP_038883014.1 UBP1-associated proteins 1B isoform X1 [Benincasa hispida]6.1e-6968.57Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRL----------MSFGISLCRDFKKP--CV-------------------------
        MWAATVSFPSISNLRGPIL NNR DGSKR SLN RASF DYPLASRL          +SF +    +   P  CV                         
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRL----------MSFGISLCRDFKKP--CV-------------------------

Query:  -----GDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKV
              DLPYST ESRLQEEFSNFGEIAEVL+AKD+LTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS SFGGYPRTSGPP+ERPKV
Subjt:  -----GDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKV

Query:  ELEEVEDCWY
        EL+EV DCWY
Subjt:  ELEEVEDCWY

XP_038883015.1 glycine-rich RNA-binding protein GRP2A isoform X2 [Benincasa hispida]2.9e-7175.79Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRL----------MSFGISLCRDFKKP--CVG----------DLPYSTNESRLQEE
        MWAATVSFPSISNLRGPIL NNR DGSKR SLN RASF DYPLASRL          +SF +    +   P  CV           DLPYST ESRLQEE
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRL----------MSFGISLCRDFKKP--CVG----------DLPYSTNESRLQEE

Query:  FSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY
        FSNFGEIAEVL+AKD+LTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS SFGGYPRTSGPP+ERPKVEL+EV DCWY
Subjt:  FSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY

XP_038883016.1 organelle RRM domain-containing protein 6, chloroplastic isoform X3 [Benincasa hispida]2.3e-7178.45Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLM-------------SFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAE
        MWAATVSFPSISNLRGPIL NNR DGSKR SLN RASF DYPLASRL+             S  + L R+       DLPYST ESRLQEEFSNFGEIAE
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLM-------------SFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAE

Query:  VLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY
        VL+AKD+LTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS SFGGYPRTSGPP+ERPKVEL+EV DCWY
Subjt:  VLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY

TrEMBL top hitse value%identityAlignment
A0A0A0KL21 RRM domain-containing protein2.0e-6575.58Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWA TVSFPS+SNLRGPI  NNRL  S R+SLNLRASFFDYPLASRLM              V +LPYSTNESRLQEEFSNFGEIAEVL+AKDR TKRPK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERP----KVELEEVEDCWY
        GYAFIQYTCQDDAMLALE MDCK FDGRMIYVE+A PGS SFGGYPR SGPP+ERP     V+ E+V DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERP----KVELEEVEDCWY

A0A1S3AYG0 glycine-rich RNA-binding protein 4, mitochondrial1.4e-6677.38Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWA TVS PSISNLRGP+  NN L  S R SLNLRASFFDYPLASRLM              V +LPYSTNESRLQEEFSNFGEIAEVL+AKDR TKRPK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY
        GYAFIQYTCQDDAMLALENMDCK FDGR IYVE+AKPGS SFGGYPRTSGPP+ERP VE E++ DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY

A0A6J1CXC0 glycine-rich RNA-binding protein 4, mitochondrial4.9e-6473.81Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWA+TVSFP + N R PILL+N   G+ R SLNLRASFFDYPLASR+M              V +LPYST+E RLQEEFS+FGE+AEVL+AKDR+TKRP+
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY
        GYAFIQYTCQDDAMLALENMD KKFDGR IYVE+A+PGSDSFGGYPRTSGPP+ERPK+ELEEV DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY

A0A6J1EGU7 small RNA-binding protein 11, chloroplastic3.4e-6578.24Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWAATVSFPS+SNLRGPI L+NRLD SKR SLNLRASF DYPLASRLM              V +LPYSTNE+RLQEEFSNFGEIAEVL+AKDRLTK PK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY
        GYAFIQYT QDDAMLALENMDCKKFDGR IYVELAKPGS SF GYP TSGPP+ER  PK+  EEV DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY

A0A6J1IJY1 small RNA-binding protein 11, chloroplastic1.1e-6679.41Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK
        MWAATVSFPS+SNLRGPI LNNRLD SKR SLNLRASF DYPLASRLM              V +LPYSTNE+RLQEEFSNFGEIAEVL+AKDRLTK PK
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPK

Query:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY
        GYAFIQYT QDDAMLALENMDCKKFDGR IYVELAKPGS SFGGYP TSGPP+ER  PK+  EEV DCWY
Subjt:  GYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEER--PKVELEEVEDCWY

SwissProt top hitse value%identityAlignment
P60826 Cold-inducible RNA-binding protein1.9e-1242.05Show/hide
Query:  LCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDS
        +  D  K  VG L + TNE  L++ FS +G+I+EV+V KDR T+R +G+ F+ +   DDA  A+  M+ K  DGR I V+ A   SD+
Subjt:  LCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDS

Q14011 Cold-inducible RNA-binding protein1.9e-1242.05Show/hide
Query:  LCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDS
        +  D  K  VG L + TNE  L++ FS +G+I+EV+V KDR T+R +G+ F+ +   DDA  A+  M+ K  DGR I V+ A   SD+
Subjt:  LCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDS

Q9DED4 Cold-inducible RNA-binding protein B2.9e-1344.44Show/hide
Query:  DFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELA-KPGSDSFGGY
        D  K  +G L + TNE  L++ FS +G+I+EV+V KDR TKR +G+ F+ +   DDA  A+  M+ K  DGR I V+ A K   D  GGY
Subjt:  DFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELA-KPGSDSFGGY

Q9FX45 Organelle RRM domain-containing protein 6, chloroplastic2.9e-1337.5Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGP-PEERPK
        V  L + T E  L++ F  FG +  + +  D++  RPKG+AF++Y  +++AM A++ M  K  DGR+I+VE AK  SD     PR   P P+ +P+
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGP-PEERPK

Q9LIS2 Glycine-rich RNA-binding protein 4, mitochondrial8.5e-1341.86Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG
        VG L + T++S L++ F++FGE+ E  V  DR T R +G+ F+ ++C+D A  A++ MD K+ +GR I V LA   S     SFGG
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG

Arabidopsis top hitse value%identityAlignment
AT1G73530.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.1e-1437.5Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGP-PEERPK
        V  L + T E  L++ F  FG +  + +  D++  RPKG+AF++Y  +++AM A++ M  K  DGR+I+VE AK  SD     PR   P P+ +P+
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGP-PEERPK

AT3G23830.1 glycine-rich RNA-binding protein 46.0e-1441.86Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG
        VG L + T++S L++ F++FGE+ E  V  DR T R +G+ F+ ++C+D A  A++ MD K+ +GR I V LA   S     SFGG
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG

AT3G23830.2 glycine-rich RNA-binding protein 46.0e-1441.86Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG
        VG L + T++S L++ F++FGE+ E  V  DR T R +G+ F+ ++C+D A  A++ MD K+ +GR I V LA   S     SFGG
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGS----DSFGG

AT4G20030.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.5e-3848.26Show/hide
Query:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSL-NLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRP
        MW+AT+SFPS       +  ++ L   + L    ++AS F+YPLAS++M              V +LP+ST+E  L+ EFS FGEIAEV + KD   KR 
Subjt:  MWAATVSFPSISNLRGPILLNNRLDGSKRLSL-NLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRP

Query:  KGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVE---LEEVEDCWY
        KGYAFIQ+T QDDA LA+E MD + ++GRMIY+++AKPG   F G PRTSGPPE+  + E    +EV DCWY
Subjt:  KGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVE---LEEVEDCWY

AT5G06210.1 RNA binding (RRM/RBD/RNP motifs) family protein8.7e-1335.48Show/hide
Query:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSD--SFGGYPRTSGPPE
        +G L + T E  L E FS  G++ E  +  DR++ R KG+ F+ +   D+A  AL   + ++ +GR I+V+ AK        GGYP   GPP+
Subjt:  VGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQDDAMLALENMDCKKFDGRMIYVELAKPGSD--SFGGYPRTSGPPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGCGGCCACCGTCTCCTTCCCTTCAATTTCAAATCTTCGGGGACCCATTCTTCTGAATAATCGATTGGACGGTTCAAAGCGCCTATCTCTGAATCTTCGAGCTTC
TTTCTTTGATTATCCTCTTGCCAGCAGACTCATGTCTTTTGGAATCTCATTATGTCGAGATTTTAAGAAGCCTTGTGTAGGAGATTTACCATACTCCACCAATGAGAGTC
GGCTGCAAGAAGAATTTTCAAATTTTGGTGAGATTGCTGAAGTGTTAGTAGCTAAAGACCGATTAACAAAAAGACCCAAAGGTTATGCGTTTATTCAATACACTTGCCAA
GACGATGCAATGCTTGCTCTGGAAAATATGGATTGCAAGAAATTTGATGGAAGGATGATCTATGTGGAACTCGCAAAACCTGGTAGTGATTCCTTTGGAGGATATCCAAG
AACCTCCGGACCCCCAGAAGAGCGGCCTAAAGTGGAGCTAGAAGAGGTGGAAGATTGTTGGTATTGA
mRNA sequenceShow/hide mRNA sequence
CTAAACTGCATAATTGCGATAATTTTATAATAATTTTAGCTTTAATGCATATAAACTATAGACAACCATTTAAAATATATGAAATTACATTACTTAAATTGTGTGTAAAC
CAAAACAAACAACAATTATTTATGTATATTAATTTGTATAAATTATAATTAACGTAGGTACTTTATTTTTCCCAAATAATTTACATAGATGGTTTTAAAGGATGGCAAGT
AATTCTATCTATTAATCTTGATTTAGAATCTTATAGCTTATAGGTTTAAAAATGTTATTTAATATGAAATAAATTGGTATGTTGATGATATTTTGAGGGACGGTGGTGAG
AATGAGCTAGTTTGGTCCTAGAGAAGTAACCCTTATATTAGCCATTAGCTACCCTAAGATTTGGACAAATGAAATCATTTAATGATGAATTTGACCCAACCACGATCCTA
ACCCACTACCACTTGTTTTTCCTCTTGTCTGGATCTTTTTATTTAATGAGTATTACTTAACCATGATTTACATGTGTTGGTCTTTCTTTTTTGTCAGAAGTTCAAATCTT
TATACTTTTTTTTTTTTTTCTTTTTTGACATACGTAGGGATGGGGAATTGAACTCTAACTTCGAGGTTGATAATCAAGCACAATGCTAGTTGAGCTAACCATAAACGTTG
AGGAGTTTGTGGTATTTACTTTTGGTTAGAAATTCAAATCTCTATACTTTTTTTTTCTTTTTTTACATATGTGGGAGAGAGAGAATTGAATATTTGACTTCGAGTTGGTA
ATCAAATACTATGTCAGTTGAACTATGCTCATTTAGGTCATTCTCCATAAGATTATGTGTGTGAGGTTGGAAGATTTGAACTCAAAATATATCAGGACAACTCGATTTAG
CAAAAAATGGAGTTCACCAACAAGATTATGTGTGTGAGATTGGAGGAAAAACAAAAAGATTGAGCACCTTTCTTCTTTTATAGTGGGCCTTGCTAGTTGCTGTTATTTAC
TTCAGTCTTTTATTTAGATTATATCCTTTATCTTTTATGATATCATTCATTGTTGTTAAATTATGTAATGTAATAAATGATGTGTAATATACTTCTCAATTTGTAAACCA
TAATTAAAGGCTAAAAAATAATGTTTTTGAACAAGACATTTTATATAATTTAACATTGAAATTTTGTGAACGAAAACTGTATAACGTATTTTTGAAGCATGAAAATAGAA
TAAAGTTTTATATTATTGTTAGGAAGATATTTTATGTAACTTAACCTTTAAATTTTGTGAACGAAAATAGAATAACGTTTTTACATTATTGTTAGGAAGAAAGTTTGGAT
GAAAAATCAAAATTGTTAGGGGCTATGAAAATAAGAAACCGCGAATTTGGGGTTTTGAGGAACGAACGAACGAACGAAGCTCCACGGCGTCGACTGGGCAGATTTGAGCT
CCAACATGTGGGCGGCCACCGTCTCCTTCCCTTCAATTTCAAATCTTCGGGGACCCATTCTTCTGAATAATCGATTGGACGGTTCAAAGCGCCTATCTCTGAATCTTCGA
GCTTCTTTCTTTGATTATCCTCTTGCCAGCAGACTCATGTCTTTTGGAATCTCATTATGTCGAGATTTTAAGAAGCCTTGTGTAGGAGATTTACCATACTCCACCAATGA
GAGTCGGCTGCAAGAAGAATTTTCAAATTTTGGTGAGATTGCTGAAGTGTTAGTAGCTAAAGACCGATTAACAAAAAGACCCAAAGGTTATGCGTTTATTCAATACACTT
GCCAAGACGATGCAATGCTTGCTCTGGAAAATATGGATTGCAAGAAATTTGATGGAAGGATGATCTATGTGGAACTCGCAAAACCTGGTAGTGATTCCTTTGGAGGATAT
CCAAGAACCTCCGGACCCCCAGAAGAGCGGCCTAAAGTGGAGCTAGAAGAGGTGGAAGATTGTTGGTATTGATCTTCTTCTTACTAAGATACACAATATATAGGACACCG
ATGGCTTTAACAGTTAATGATAGTTGTGCCCAATGAGATTCAGATCAATCCCAAAGAATCTGTGAGTGCTGAAGTGTTTTCTAAGATGGATTAATTACATTTATTTTCAT
AGTTTCAAGCTTTAGATGCTGATGCCATTCTGCAACTCTTCTCCTGTATTTTGTTACTGGAACAGATATCTATTTAATCCTTGGACTTGAGCCGATGCCTGCAAGTGGGG
CAAGAAGAGTGAGACAGCAGCCATTTATCAATGCAAGTGACATGGTACCTGTGAGCACAGTTTGGCAACACCCGTATTTTATCCCCATTTGAGAAATCGACGAGGCAGAT
AGCACAGGCTGAAGTTGATGAAGGCGACGTCGGTGAGCCAGAATTCGTGTAAGTTGAAGTCGGCAGGGCAACCATGTCTTGCTTCTTCATGCCTGAGTTCAGTCCCCTTA
AAGCCACCCAGTGGACTGTTTGTCTAAGGGTGTGATTGGCGCATTGAAGTATACATATCAGCATGGTATTTAGACCAAGTGTGCAAACAACGGCGCAAACGACTGCAACC
ACTATGACAATGAGGTTGAAATTGCCACCTTGTTCTTGCTCGCGAGCATCTGGGATAGATACACTTGGTGATGGATTAAGGGAAGGAGAGGAGGAAGGTTCCATGAACGA
AGAACGGAGCATTTTTGGCGAGCTGAAGCGGCTGAGAAGAAAGCAAGATAAACACACATGTGAAACAAGTTGAGGTGAAGAAACATAAGATCTGGAGGAGGGCATATGAT
CAAATTAGATATTTGTGGACAGAGAGCTAGGTGGGTTCAATTCAACATGGAGGGTGAGAATGTGATGGGAAGTAGTGGGGTTATTTGAAAAATGAATCAACCTCTTTTTT
TCTTTTGTTTGTGCGCTACAAACAGAAAAGTCAGTCTCATTGTCCCCTCACTTTCTTTCATCCTTAGCACCAAAGAAAATCAATCAATTTCCCCATGTGGCTATTTTCAC
TCGGGACAACAGAACCTGATATAGAGTTATAACACATTTCAAGACCAGCTTTCATGTATTTTTGTACTCTAAGAAGTAAGTAGAGTGAGATCAAATTATGCAAACTCAAA
GTGAAACTTCTCACATCAACGCTACGACTAGCATTTGAAGTTGTAAGCAAGAGACGGGTTATCAGTTCAAAGACGTCTGCAAAGGGGAGTGCAATGTATATCATTTTTGG
GTTCATTGGTGAGGGCAAAGGCATATGAATGGGTACAAAGTTCAATCTCCAAATTATGTGGTTCTAGTTTACTCAACTACTGACCTCTTCATCGTTATTGACTATAAAAG
GATTGTATTCACTTCATCACTATTTTAATCCATTCTTTATTCTGTATGCATACCAAAACTATTCCTTTCAACTGCTGTGTATGATGATTGTCTACAAATCTAAGATAAAT
TTGATAACTTG
Protein sequenceShow/hide protein sequence
MWAATVSFPSISNLRGPILLNNRLDGSKRLSLNLRASFFDYPLASRLMSFGISLCRDFKKPCVGDLPYSTNESRLQEEFSNFGEIAEVLVAKDRLTKRPKGYAFIQYTCQ
DDAMLALENMDCKKFDGRMIYVELAKPGSDSFGGYPRTSGPPEERPKVELEEVEDCWY