; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G032190 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G032190
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationCicolChr02:27865619..27872446
RNA-Seq ExpressionCcUC02G032190
SyntenyCcUC02G032190
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048059.1 universal stress protein A-like protein isoform X2 [Cucumis melo var. makuwa]6.8e-5571.69Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE-----------------------------------QVKVE
        MG+VSERERKI+VAVDE EESLYALSWCLKNVI +NSKDTLILL+ARPPRPIYTA+DGT+                                    VKVE
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE-----------------------------------QVKVE

Query:  TRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSST
        TRVESGDARDVICQMVEKLGAD+LV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSS T
Subjt:  TRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSST

XP_004144759.1 universal stress protein A-like protein isoform X2 [Cucumis sativus]9.8e-5470.48Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MG+VSERERKI+VAVDE EESLYALSWCLKNV+ QNSKDTLILL+ARPPRPIYTA+DGT                                    VKVET
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
        RVESGDARDVICQ+VEKLGA +LV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSS  S
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

XP_022946419.1 universal stress protein PHOS32-like [Cucurbita moschata]6.3e-5370.12Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MGDVSERERKILVAVDE EESLYALSWCL+NVISQNSKDTLILL+ARPPRPIYTA+DGT                                    VKVET
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS
        RVESGDARDVIC+MVEKLGADVLV G HGYGPIKRA IGSVSNHCAK VKCP+LIVKKPK+S++
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS

XP_038890239.1 universal stress protein A-like protein isoform X1 [Benincasa hispida]8.8e-5568.33Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE----------------------------------------
        MGDVSERERKILVAVDE EESLYALSWCLKNVISQNSKDTLILL+ARPPRPIYTALDGT+                                        
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE----------------------------------------

Query:  --------QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
                 VKVE+RVESGDARDVICQMVEKLGADVLV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNS+  S
Subjt:  --------QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

XP_038890240.1 universal stress protein A-like protein isoform X2 [Benincasa hispida]3.6e-5674.1Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MGDVSERERKILVAVDE EESLYALSWCLKNVISQNSKDTLILL+ARPPRPIYTALDGT                                    VKVE+
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
        RVESGDARDVICQMVEKLGADVLV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNS+  S
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

TrEMBL top hitse value%identityAlignment
A0A0A0LIZ3 Usp domain-containing protein1.2e-5265Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE----------------------------------------
        MG+VSERERKI+VAVDE EESLYALSWCLKNV+ QNSKDTLILL+ARPPRPIYTA+DGT+                                        
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE----------------------------------------

Query:  --------QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
                 VKVETRVESGDARDVICQ+VEKLGA +LV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSS  S
Subjt:  --------QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

A0A5A7U3N2 Universal stress protein A-like protein isoform X23.3e-5571.69Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE-----------------------------------QVKVE
        MG+VSERERKI+VAVDE EESLYALSWCLKNVI +NSKDTLILL+ARPPRPIYTA+DGT+                                    VKVE
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTE-----------------------------------QVKVE

Query:  TRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSST
        TRVESGDARDVICQMVEKLGAD+LV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSS T
Subjt:  TRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSST

A0A6J1DUV9 universal stress protein A-like protein9.8e-5268.94Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MGDVSERER+ILVAVDE EESLYALSWCLKNVISQNS+D L+LL+ARPPRP+YTALDGT                                  + VKVET
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKN
        +V SGDAR+VICQMV+KL ADVLV GSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK+
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKN

A0A6J1G3Q3 universal stress protein PHOS32-like3.1e-5370.12Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MGDVSERERKILVAVDE EESLYALSWCL+NVISQNSKDTLILL+ARPPRPIYTA+DGT                                    VKVET
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS
        RVESGDARDVIC+MVEKLGADVLV G HGYGPIKRA IGSVSNHCAK VKCP+LIVKKPK+S++
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS

A0A6J1KHF4 universal stress protein PHOS32-like3.1e-5370.12Show/hide
Query:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET
        MGDVSERERKILVAVDE EESLYALSWCL+NVISQNSKDTLILL+ARPPRPIYTA+DGT                                    VKVET
Subjt:  MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT----------------------------------EQVKVET

Query:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS
        RVESGDARDVIC+MVEKLGADVLV G HGYGPIKRA IGSVSNHCAK VKCP+LIVKKPK+S++
Subjt:  RVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSS

SwissProt top hitse value%identityAlignment
Q57951 Universal stress protein MJ05312.1e-0640Show/hide
Query:  VKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK
        VK+ T +  G   + I +  EK  AD++V G+ G   ++R  +GSV+    K+  CPVL+VKKPK
Subjt:  VKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK

Q8L4N1 Universal stress protein PHOS341.1e-0727.44Show/hide
Query:  RKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARP-----------------------------PRPIYTALDGTEQVKVETR------------
        RKI VAVD SEES +A+ W + + I     D +++LH  P                             P+P     D     KV               
Subjt:  RKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARP-----------------------------PRPIYTALDGTEQVKVETR------------

Query:  ---VESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAF---IGSVSNHCAKSVKCPVLIVKKP
           V+  D R+ +C   E+L    ++ GS G+G  KR     +GSVS++C     CPV++V+ P
Subjt:  ---VESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAF---IGSVSNHCAKSVKCPVLIVKKP

Q8LGG8 Universal stress protein A-like protein2.0e-0942.86Show/hide
Query:  VKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKK
        V  E  +++GD +DVICQ V+++  D LV GS G G  ++ F+G+VS  C K  +CPV+ +K+
Subjt:  VKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKK

Q8VYN9 Universal stress protein PHOS325.8e-0928.75Show/hide
Query:  RKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARP------------------------PRPIYTALDGTEQVKVETR---------------VE
        RKI VAVD SEES +A+ W + + I     D ++LLH  P                        P+P     D     KV                  V+
Subjt:  RKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARP------------------------PRPIYTALDGTEQVKVETR---------------VE

Query:  SGDARDVICQMVEKLGADVLVTGSHGYGPIKR----AFIGSVSNHCAKSVKCPVLIVKKP
          D R+ +C  +E+LG   ++ GS G+G  K+      +GSVS++C     CPV++V+ P
Subjt:  SGDARDVICQMVEKLGADVLVTGSHGYGPIKR----AFIGSVSNHCAKSVKCPVLIVKKP

Arabidopsis top hitse value%identityAlignment
AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.9e-1831.9Show/hide
Query:  RKILVAVDESEESLYALSW--------CLKNVISQNSKDTLILLHARPPRPIYTALDG-------------------------------------TEQVK
        ++++VA+DES+ S YAL W         L    ++     L ++H + P   + A                                         +Q++
Subjt:  RKILVAVDESEESLYALSW--------CLKNVISQNSKDTLILLHARPPRPIYTALDG-------------------------------------TEQVK

Query:  VETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK
         ET V  G+A+++IC+ VEK+  D+LV GS G G IKRAF+GSVS++CA    CP+LIVK PK
Subjt:  VETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-1831.71Show/hide
Query:  RKILVAVDESEESLYALSW--------CLKNVISQNSKDTLILLHARPPRPIYTALDG--------------------------------------TEQV
        ++++VA+DES+ S YAL W         L    ++     L ++H + P   + A                                          +Q+
Subjt:  RKILVAVDESEESLYALSW--------CLKNVISQNSKDTLILLHARPPRPIYTALDG--------------------------------------TEQV

Query:  KVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK
        + ET V  G+A+++IC+ VEK+  D+LV GS G G IKRAF+GSVS++CA    CP+LIVK PK
Subjt:  KVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPK

AT3G58450.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.2e-1934.3Show/hide
Query:  SERERKILVAVDESEESLYALSWC---LKNVISQNSK-----DTLILLHARP----------------------PRPIYTALDGT--------------E
        ++++ K++VA+DES+ S  AL W    L+ VIS   +       L LLH  P                      P P+  A + +              +
Subjt:  SERERKILVAVDESEESLYALSWC---LKNVISQNSK-----DTLILLHARP----------------------PRPIYTALDGT--------------E

Query:  QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
         VK ET +  GD +++ICQ VE+   D+LV GS G G IKRAF+GSVS++CA+  KCP+LIV+ P+ +S+++
Subjt:  QVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

AT3G58450.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.4e-2035.76Show/hide
Query:  SERERKILVAVDESEESLYALSWC---LKNVISQNSK-----DTLILLHARP---------------PRPIYTALDGT--------------EQVKVETR
        ++++ K++VA+DES+ S  AL W    L+ VIS   +       L LLH  P               P P+  A + +              + VK ET 
Subjt:  SERERKILVAVDESEESLYALSWC---LKNVISQNSK-----DTLILLHARP---------------PRPIYTALDGT--------------EQVKVETR

Query:  VESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS
        +  GD +++ICQ VE+   D+LV GS G G IKRAF+GSVS++CA+  KCP+LIV+ P+ +S+++
Subjt:  VESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKKPKNSSSTS

AT3G62550.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.7e-3045.1Show/hide
Query:  RERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT-----------------------------------EQVKVETRVESG
        +ERKI+VAVDESEES+ ALSW L N+    S +TLILL+ +PP P+Y++LD                                       + +E RV  G
Subjt:  RERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGT-----------------------------------EQVKVETRVESG

Query:  DARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKK
        DA++VIC  V+KL  D+LV G+H YG  KRA +GSVS +CAK VKCPV+IVKK
Subjt:  DARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNHCAKSVKCPVLIVKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACGTGTCAGAACGTGAGAGGAAGATCCTTGTGGCCGTCGATGAAAGCGAGGAGAGCTTGTATGCGCTATCTTGGTGTTTGAAGAATGTAATTTCTCAAAATTC
CAAAGATACCCTCATCCTCCTCCACGCCCGGCCACCCCGCCCCATTTACACCGCCTTGGATGGCACAGAACAGGTGAAGGTGGAGACGAGAGTGGAAAGTGGAGATGCAA
GAGATGTGATATGTCAGATGGTGGAGAAATTGGGTGCTGACGTGTTGGTAACGGGTAGCCATGGCTATGGTCCAATTAAAAGGGCATTTATTGGGAGTGTGAGCAACCAT
TGTGCAAAGAGTGTAAAGTGCCCTGTTTTAATTGTAAAGAAGCCCAAAAACTCATCATCAACTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACGTGTCAGAACGTGAGAGGAAGATCCTTGTGGCCGTCGATGAAAGCGAGGAGAGCTTGTATGCGCTATCTTGGTGTTTGAAGAATGTAATTTCTCAAAATTC
CAAAGATACCCTCATCCTCCTCCACGCCCGGCCACCCCGCCCCATTTACACCGCCTTGGATGGCACAGAACAGGTGAAGGTGGAGACGAGAGTGGAAAGTGGAGATGCAA
GAGATGTGATATGTCAGATGGTGGAGAAATTGGGTGCTGACGTGTTGGTAACGGGTAGCCATGGCTATGGTCCAATTAAAAGGGCATTTATTGGGAGTGTGAGCAACCAT
TGTGCAAAGAGTGTAAAGTGCCCTGTTTTAATTGTAAAGAAGCCCAAAAACTCATCATCAACTTCTTAAGTTTGTTTCCATTTCTCTCTTTAATTTTCTTTCCTTACCCC
TATCATTTTGTACAAAACCCTTTTCTTTTTAAATTATATATGTGAAGTTTGTGTAACTATAAATTATCTTAGCAACGAAATTTTACTTTGCTTTATATATAATTATTTCA
ATAGAAAAGTAAGTGATGAGAATTAATGTCTAACGTAAATATTGTTCAACTGATTTTGTAGTAATGAATTCTCGATCAAAAGGTTAGAGGTTTGAATTCACCTCTCTACA
TATTGTTGAACTAAAAAAATGATCGATTTCTATAGGTAGAGTTTTGTTACATTTTTGTTTATTTATTTATTTTAATAAAGTTAGTCTCTATGACAATTTTCTTTATAAAT
TGAAAAAAAAAAATATTGTAGAGAAGTTCATTTATTGCATGCTAATTATTGTTATTATTTAATTAATTTTAATGAAAATTAATTTTAAAGTACTAAATTTAGAATTTATT
GAAAATACATAGGAAAAATAAAAGTATAGGGATCACAAAACTCAAATTATAGGAATCAAAATAGCATTTTAACATTTTAAATATTATCAAGGAAGTGATAGATAGATCAA
AGTGAGTTTAACTCAACAGTAATTGACATGTAATCTATTCGTTCAAGTTGGATGTTTGATCTCCGATCTTAACAATTGTTGTACTAAAAAGTATAAAAAATAATAGGAAC
TATCCTCATCAAGTCATTTGCATTTCTTAATATAATTAGTTAAAATGAAAAAATAATTAATATCAATTATAAACGTAAGTAGATATTGTGTTCCATAATATAGAGTTATG
GAACATGATTAGACTAAAAAGACTAGAGGTTATAAAGATGATTTAAAATTAATATTATGAATCTACCTTCATTCTCCATTGTTGGTAGCATTGTTTTTCATTTCACTTTT
TTAGATCCTACTTTTAAATTTCATAATGAGATAAGAATAAGATACTTGCAATCATTGATCTTTTAACACATAACTATAATAATTAGTATGGACAAATAATAATAATAAAG
AAAACTATAACGATTGGTAGGAGTGACAACTATTGACTTGCTTTCTCTAACATCTTTTCTCATACTAATTTGCCACAACTTTTTAGTTTAAGAGTTCTTTTTTTTTTTTT
CTTTCTTTCTTTCTTTCTTTTTTCCCTTTTCTTTCTTTCTTTCTTTCATGGATCAACACAAACATGGAAGCAACCTAACTTTATGCAATTCTATAAAATATATATATCTT
CTAAGTAGGCTTTTTACTAAAGCAAAAGGGCCAAGAAGAGTTCATAGTTTAGAGATATGATGGTACTTTATTGAAAAAGTTTGAGATACTTATAAAGTCTAAATAAATAG
GAATGTTTTTAAATATAGCAAAATCAGTCAAATTATTTATAAATATAGAAAAATTTTACTGTCTATCAACAATAGACATCTATTGCTTAAATGATAGACTACTATAGACT
TCTATCACTTTTATCACTCAAGCGATGGAAATCTATTGTGGTATATCGCTGATAGACAGTGAAAATTTTCTATATTTGTAAATAGTTTGATGTTTTTTTTTCTATTTATA
ATAATTTTTCAAATAAATAAGTTTGTTTTTTACATGAGTAGCCTAATAGACTATTATTCTAAGTTGGTCATCGATGGTGGTGGTCAAATGTGGCCAGAGTGACCAACGGG
GCAACGATAGTGATGAGAGTTGAATGCACCGTTGGTGGACATGAGTATAAAATATTAGTGCATACAAAATTGACCCCTTGGACATTTAGTCGAGTTCATATCTCAACTCA
TAGCAACTCAATCCTCACCCCAAGCACCCCTTTTTAGTCATCTAAAATCATTTAGGAATAAATTAGAGTTTATTAGTAGGGAATCTAAAAATAGAAATTTGAACACGAAA
TTCAATGAAAATAGAATTGTATTTCATGTCTTCAGATATATGTTTGGTAGTAGATTCAAAAATTAAATTTTGATTTAAACAATTGTTCAAAATGTGTTTCATAGTGTATT
TATGAACCATAAAACTAATGGTAGTTATTTGCTAAATATTAAGTTAAATAATATAAGCTATTAGTAGTTAATTTATGAACCTGTTTTATATTACAAAATTTTGTATAGTT
ATTATTTTAGAATATCATATAACTTATGGTATATTATATAATACATATTTTAATTTAACAAAAACAATTGAGCTTATAATATGAAATTATAATTTTGCATTTTAAATTTT
GAGCAAATTGCAAAAACTACCCCTAAAGTATGATGGTAATTAGAATTATACCCTTAAACTTTCAATTTTAAAACATTGAGCTTGCAAACTTAAATGGGTATTAAAATTAG
GTCAAAGTAAACTTTCAAACTTACATAATTGTATAAACTGTAACATTTTTATAAGTTTGGTGGCTCAAATTTATCATTTGTAGCTCCAATTTGTAAAATTATGCTAAGCT
CGAGGGTTAATTTCAACACTTAAGTTTAGGAGACCAATTTTTACAATTGAAAGTTTTGAGAGTATAATTGTAACTATCATCGTACTTTAAGGAGTGGTTTTTGCAATTTG
CATTTATATTTTCAAATTTATTTAAAACATAATAGTGTTATTTTCAAAATTTAATCACATAATATTGAAAACATTCCCTAAAAATAGAATCTAGACTGCCTACCAAATAC
ATATTTCCCAACCTCAATAATCTGATAACATAAAACGGAATCCAAAAAATATATATATATATATATTTTTTCCAAAATTCATCTCAAACACACCATTACTTTCAACCCAC
TCTAATACACTCACACATTATAACTATTGTTTGCAATTTGCTTGTCTAAGGATGGCATGATAGATTCTTTCAGATTGAAAAGCAGGGTGATTGATGGTGATTCTTACCGG
CATTCAATCTGAACTACGGAAAACGAAAAAAACAAAGAAGAAGGAGAATCGAATTAGCATTCATTGGAAAAAAGATCGAATCGATAAGGAAAGGAAAAGAGGAAAAAACG
TATAAATAAAATATAAAAATTCCCTTGCACCCTCCGCACCAAACTTGGGAAATTGTCAGAAATAGCATGTTGAAGTTTTCATCTTACAAATAGCATGGTGTAGTTTTCAT
CCTACAAAACTAGCACCTCTTTTAGAACTCGTTCAAACAGATTTTCTTAGTGCATCTCAAGTGACATCGACCATTTAGTTAAAAAATTAACTCATCGATCTTATACCAAA
TCTTATATATTTTCAAAATTTCTCCAAATTCAGTTATCCTTATCAAATATTATATATAATTTTTTAAATCTTTAGCCTAATTTCTAATTCGAAATCAATTTATCAATTAT
CCGAAG
Protein sequenceShow/hide protein sequence
MGDVSERERKILVAVDESEESLYALSWCLKNVISQNSKDTLILLHARPPRPIYTALDGTEQVKVETRVESGDARDVICQMVEKLGADVLVTGSHGYGPIKRAFIGSVSNH
CAKSVKCPVLIVKKPKNSSSTS