; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G09230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G09230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA repair protein RAD4 isoform X2
Genome locationClcChr01:10147538..10150178
RNA-Seq ExpressionClc01G09230
SyntenyClc01G09230
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006298 - mismatch repair (biological process)
GO:0000111 - nucleotide-excision repair factor 2 complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0071942 - XPC complex (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR004583 - DNA repair protein Rad4
IPR036985 - Transglutaminase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655233.1 DNA repair protein RAD4 isoform X2 [Cucumis sativus]3.1e-9985.33Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        +ED GEAIPD GGSCSQTSTDR TLA+VSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLE ERCNEN+I SCS DVDV 
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDLDDSDWEDGCVR LDGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTA+SLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

XP_031741822.1 DNA repair protein RAD4 isoform X1 [Cucumis sativus]3.1e-9985.02Show/hide
Query:  ICLEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVD
        I +ED GEAIPD GGSCSQTSTDR TLA+VSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLE ERCNEN+I SCS DVD
Subjt:  ICLEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVD

Query:  VHEVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALL
        V EVNLQNS+SEVLEDLDDSDWEDGCVR LDGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALL
Subjt:  VHEVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALL

Query:  SLLPAHLLKISPTKQLTASSLKPLVTW
        SLLPAHLLKISP KQLTA+SLKPLV W
Subjt:  SLLPAHLLKISPTKQLTASSLKPLVTW

XP_031741823.1 DNA repair protein RAD4 isoform X3 [Cucumis sativus]3.1e-9985.33Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        +ED GEAIPD GGSCSQTSTDR TLA+VSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLE ERCNEN+I SCS DVDV 
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDLDDSDWEDGCVR LDGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTA+SLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

XP_038874851.1 DNA repair protein RAD4 isoform X1 [Benincasa hispida]1.4e-9980.32Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        +EDAG+AIPDSGGSCSQTSTDRGTLANVSR+AVGKLLSRASGR LSG RKHALHPCDL   PKSTVG+D N AMD KV LEAE C EN+IVSCS D DV 
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDP---------
        EVNLQN +SEVLEDLDDSDWEDGCV TLDGTES PLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDP         
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDP---------

Query:  ---------------LIQSALLSLLPAHLLKISPTKQLTASSLKPLVTW
                       L+QSALLSLLPAHLLKISP KQLTASSLKPLVTW
Subjt:  ---------------LIQSALLSLLPAHLLKISPTKQLTASSLKPLVTW

XP_038874852.1 DNA repair protein RAD4 isoform X2 [Benincasa hispida]1.2e-10388.89Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        +EDAG+AIPDSGGSCSQTSTDRGTLANVSR+AVGKLLSRASGR LSG RKHALHPCDL   PKSTVG+D N AMD KV LEAE C EN+IVSCS D DV 
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQN +SEVLEDLDDSDWEDGCV TLDGTES PLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDP+IQSALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTASSLKPLVTW
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

TrEMBL top hitse value%identityAlignment
A0A0A0KQC2 Uncharacterized protein1.5e-9985.33Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        +ED GEAIPD GGSCSQTSTDR TLA+VSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLE ERCNEN+I SCS DVDV 
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDLDDSDWEDGCVR LDGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTA+SLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

A0A1S3CCP3 DNA repair protein RAD4 isoform X47.3e-9984.89Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        ++DAGEAIPD GGSCSQTS DR TLANVSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLEAERCNEN+  SCS DVDVH
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDL DSDWEDGCV+T DGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEI EFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTASSLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

A0A1S3CDX3 DNA repair protein RAD4 isoform X51.5e-9684Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        ++DAGEAIPD GGSCSQTS DR   ANVSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLEAERCNEN+  SCS DVDVH
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDL DSDWEDGCV+T DGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEI EFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTASSLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

A0A5A7V3W6 DNA repair protein RAD4 isoform X36.4e-9575Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRG-------------------------------TLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGED
        ++DAGEAIPD GGSCSQTS DRG                               TLANVSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+D
Subjt:  LEDAGEAIPDSGGSCSQTSTDRG-------------------------------TLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGED

Query:  VNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFV
        VN AMD KVTLEAERCNEN+  SCS DVDVHEVNLQNS+SEVLEDL DSDWEDGCV+T DGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEI EFV
Subjt:  VNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFV

Query:  HKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKISPTKQLTASSLKPLVTW
        HKVHLLCLLGRGRLIDRACNDPLIQ+ALLSLLPAHLLKISP KQLTASSLKPLV W
Subjt:  HKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKISPTKQLTASSLKPLVTW

A0A5D3DT68 DNA repair protein RAD4 isoform X27.3e-9984.89Show/hide
Query:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH
        ++DAGEAIPD GGSCSQTS DR TLANVSRVAV KLLSRASGR LSG RKHAL PCDL    KST+G+DVN AMD KVTLEAERCNEN+  SCS DVDVH
Subjt:  LEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVH

Query:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL
        EVNLQNS+SEVLEDL DSDWEDGCV+T DGTESQPLTIE SE+Q+ PDST+RKPIRRASAADKEI EFVHKVHLLCLLGRGRLIDRACNDPLIQ+ALLSL
Subjt:  EVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSL

Query:  LPAHLLKISPTKQLTASSLKPLVTW
        LPAHLLKISP KQLTASSLKPLV W
Subjt:  LPAHLLKISPTKQLTASSLKPLVTW

SwissProt top hitse value%identityAlignment
Q8W489 DNA repair protein RAD42.0e-2939.63Show/hide
Query:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL
        S++ +    LA  SRVAV K+L ++S R   G +K     CD  ++ K   G+        K  L+A   +  +     G+VD  E+N            
Subjt:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL

Query:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI
         DSDWED  + +LD T       +++ LTIEF +    PD+ ++K   RA+A DK  AE VHKVHLLCLL RGR++D ACNDPLIQ+ALLSLLP++L K+
Subjt:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI

Query:  SPTKQLTASSLKPLVTW
        S  +++T   + PL+ W
Subjt:  SPTKQLTASSLKPLVTW

Arabidopsis top hitse value%identityAlignment
AT5G16630.1 DNA repair protein Rad4 family1.4e-3039.63Show/hide
Query:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL
        S++ +    LA  SRVAV K+L ++S R   G +K     CD  ++ K   G+        K  L+A   +  +     G+VD  E+N            
Subjt:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL

Query:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI
         DSDWED  + +LD T       +++ LTIEF +    PD+ ++K   RA+A DK  AE VHKVHLLCLL RGR++D ACNDPLIQ+ALLSLLP++L K+
Subjt:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI

Query:  SPTKQLTASSLKPLVTW
        S  +++T   + PL+ W
Subjt:  SPTKQLTASSLKPLVTW

AT5G16630.2 DNA repair protein Rad4 family1.4e-3039.63Show/hide
Query:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL
        S++ +    LA  SRVAV K+L ++S R   G +K     CD  ++ K   G+        K  L+A   +  +     G+VD  E+N            
Subjt:  SQTSTDRGTLANVSRVAVGKLLSRASGRSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDL

Query:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI
         DSDWED  + +LD T       +++ LTIEF +    PD+ ++K   RA+A DK  AE VHKVHLLCLL RGR++D ACNDPLIQ+ALLSLLP++L K+
Subjt:  DDSDWEDGCVRTLDGT-------ESQPLTIEFSEMQQTPDSTRRKPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKI

Query:  SPTKQLTASSLKPLVTW
        S  +++T   + PL+ W
Subjt:  SPTKQLTASSLKPLVTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGCGCAAGGCTGTTGCAAAAGGCAGAAAAATCATGCTTCAATATCTGCAGTGCTAGCGTTGAGATGCTGTATATAGCATCTCCATGCTACCACTTGGACACCTC
AGCTGAATGCAGGACCTATGCTGAACACTACTTTTGGCAGTATTGCGATACTGCAATGATTTTGATAAATATGTTGTTACTTTTGAGGATTTGTTTGGAAGATGCTGGTG
AGGCCATACCAGACTCAGGAGGAAGCTGTTCACAGACCAGTACTGACAGAGGAACTTTAGCCAATGTTTCAAGGGTGGCCGTGGGCAAACTTCTAAGTCGTGCATCTGGA
CGTTCCTTGTCAGGAACAAGGAAACATGCTCTGCATCCATGTGATTTGGTCAGAAAGCCAAAATCTACAGTTGGAGAAGATGTAAATCCTGCTATGGACAATAAGGTGAC
ATTAGAGGCTGAGAGGTGCAATGAAAATATAATAGTTAGCTGTTCTGGGGACGTTGATGTTCATGAAGTAAATTTGCAGAATTCTATATCAGAAGTCTTAGAAGATTTGG
ATGATTCTGATTGGGAAGATGGTTGTGTTCGCACTTTGGATGGGACAGAGTCTCAACCATTGACTATTGAGTTTAGTGAGATGCAGCAGACCCCTGACTCTACCAGGAGG
AAACCTATTCGTCGAGCTTCTGCTGCTGATAAGGAAATTGCTGAGTTTGTGCATAAAGTTCATCTGCTTTGTTTACTTGGACGGGGCAGATTAATTGACCGAGCTTGCAA
TGACCCTCTTATTCAGTCTGCTTTGCTTTCTCTTCTTCCAGCACACTTGCTGAAGATCTCACCTACCAAGCAACTGACAGCCAGCTCTCTGAAACCCCTGGTTACTTGGC
CCGAGCTTGAAGCACCGTTTGCACAAATGGAAGTCAAAAGTTGCTCATTGAAAGACATTCACTTCTGTATTTGGCGTGAATTAGAAGAATTCAACATTGAAGACATGGAA
GCCAGAAGGAAGATAGTCCTCTCATTATCACAAGCCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGCGCAAGGCTGTTGCAAAAGGCAGAAAAATCATGCTTCAATATCTGCAGTGCTAGCGTTGAGATGCTGTATATAGCATCTCCATGCTACCACTTGGACACCTC
AGCTGAATGCAGGACCTATGCTGAACACTACTTTTGGCAGTATTGCGATACTGCAATGATTTTGATAAATATGTTGTTACTTTTGAGGATTTGTTTGGAAGATGCTGGTG
AGGCCATACCAGACTCAGGAGGAAGCTGTTCACAGACCAGTACTGACAGAGGAACTTTAGCCAATGTTTCAAGGGTGGCCGTGGGCAAACTTCTAAGTCGTGCATCTGGA
CGTTCCTTGTCAGGAACAAGGAAACATGCTCTGCATCCATGTGATTTGGTCAGAAAGCCAAAATCTACAGTTGGAGAAGATGTAAATCCTGCTATGGACAATAAGGTGAC
ATTAGAGGCTGAGAGGTGCAATGAAAATATAATAGTTAGCTGTTCTGGGGACGTTGATGTTCATGAAGTAAATTTGCAGAATTCTATATCAGAAGTCTTAGAAGATTTGG
ATGATTCTGATTGGGAAGATGGTTGTGTTCGCACTTTGGATGGGACAGAGTCTCAACCATTGACTATTGAGTTTAGTGAGATGCAGCAGACCCCTGACTCTACCAGGAGG
AAACCTATTCGTCGAGCTTCTGCTGCTGATAAGGAAATTGCTGAGTTTGTGCATAAAGTTCATCTGCTTTGTTTACTTGGACGGGGCAGATTAATTGACCGAGCTTGCAA
TGACCCTCTTATTCAGTCTGCTTTGCTTTCTCTTCTTCCAGCACACTTGCTGAAGATCTCACCTACCAAGCAACTGACAGCCAGCTCTCTGAAACCCCTGGTTACTTGGC
CCGAGCTTGAAGCACCGTTTGCACAAATGGAAGTCAAAAGTTGCTCATTGAAAGACATTCACTTCTGTATTTGGCGTGAATTAGAAGAATTCAACATTGAAGACATGGAA
GCCAGAAGGAAGATAGTCCTCTCATTATCACAAGCCGTCTAG
Protein sequenceShow/hide protein sequence
MESARLLQKAEKSCFNICSASVEMLYIASPCYHLDTSAECRTYAEHYFWQYCDTAMILINMLLLLRICLEDAGEAIPDSGGSCSQTSTDRGTLANVSRVAVGKLLSRASG
RSLSGTRKHALHPCDLVRKPKSTVGEDVNPAMDNKVTLEAERCNENIIVSCSGDVDVHEVNLQNSISEVLEDLDDSDWEDGCVRTLDGTESQPLTIEFSEMQQTPDSTRR
KPIRRASAADKEIAEFVHKVHLLCLLGRGRLIDRACNDPLIQSALLSLLPAHLLKISPTKQLTASSLKPLVTWPELEAPFAQMEVKSCSLKDIHFCIWRELEEFNIEDME
ARRKIVLSLSQAV