; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G00770 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G00770
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUBA domain-containing protein
Genome locationClcChr01:522258..525076
RNA-Seq ExpressionClc01G00770
SyntenyClc01G00770
Gene Ontology termsGO:0043162 - ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0000813 - ESCRT I complex (cellular component)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR009060 - UBA-like superfamily
IPR038870 - Ubiquitin-associated protein 1
IPR042575 - Ubiquitin-associated protein 1, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035731.1 hypothetical protein SDJN02_02529 [Cucurbita argyrosperma subsp. argyrosperma]7.6e-10173.29Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI
        MAYDYRNK G+Y+AHPPMYGP ASSSPSPS HPMYTQSMYPRIGQQ  SP  PVAR+SSHHHSSS +PSP+  SSSGLGIRVTIKPEYRITPPPQLSPQ+
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI

Query:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY
        GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEH PS+PVES+SSMGS GDPVVSKYVASGLSREAVS AVANYGDNPTK                  
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY

Query:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                         VQEFVKGYTLLREMGF S KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

XP_004145917.1 uncharacterized protein LOC101219735 [Cucumis sativus]4.3e-10474.91Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD
        MAYD+RN SGHYD+H PMY  TASSSPSPSPHPMY+ SMYPRIGQQAPS T PVARLSSHH+SSS SPSP+SSSGLGIRVTIKPEYRITPPPQLSPQ+GD
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD

Query:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL
        IPRSNFQFDFEFEKKVLAEAEKE PNWNRFGLEH P KPVESTSSMGSIGDPVVSKYVASGL+REAVSFAVANYGDNPTK                    
Subjt:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL

Query:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                       VQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

XP_008437562.1 PREDICTED: uncharacterized protein LOC103482940 isoform X1 [Cucumis melo]5.3e-10274.18Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD
        MAYD+RN SGHYD+H PMY  TASS+PSPSPHPMY+QSMYPRIGQQAPS T PVARLSSHHHSSS S SP+SSSGLGIRVTIKPEYRITPPPQLSPQ+GD
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD

Query:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL
        IPRSNF FDFEFEKKVLAEAEKE PNWNRFGLE LP KPVESTSSMGSIGDP VSKYVASGL+REAVSFAVANYGDNPTK                    
Subjt:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL

Query:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                       VQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

XP_022958203.1 uncharacterized protein LOC111459499 [Cucurbita moschata]7.6e-10173.29Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI
        MAYDYRNK G+Y+AHPPMYGP ASSSPSPS HPMYTQSMYPRIGQQ  SP  PVAR+SSHHHSSS +PSP+  SSSGLGIRVTIKPEYRITPPPQLSPQ+
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI

Query:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY
        GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEH PS+PVES+SSMGS GDPVVSKYVASGLSREAVS AVANYGDNPTK                  
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY

Query:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                         VQEFVKGYTLLREMGF S KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

XP_038874870.1 uncharacterized protein LOC120067367 [Benincasa hispida]6.4e-10877.7Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSS--SQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQI
        MAYDYRNK GHYDAHPPMYGPTASSSPSPSPHPMY+QSMYPRIGQQAPS T PVARLSSHHHSS  S SPSP+SSSGLGIRVTIKPEYRITPPPQLSPQI
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSS--SQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQI

Query:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVES-TSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVG
        GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVES TSSM SIGDPVVSKYVASGLSREAVSFAVANYGDNPTK                 
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVES-TSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVG

Query:  YGLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                          VQEFVKGYTLLREMGFSS+KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  YGLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

TrEMBL top hitse value%identityAlignment
A0A0A0KNB8 Uncharacterized protein2.1e-10474.91Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD
        MAYD+RN SGHYD+H PMY  TASSSPSPSPHPMY+ SMYPRIGQQAPS T PVARLSSHH+SSS SPSP+SSSGLGIRVTIKPEYRITPPPQLSPQ+GD
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD

Query:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL
        IPRSNFQFDFEFEKKVLAEAEKE PNWNRFGLEH P KPVESTSSMGSIGDPVVSKYVASGL+REAVSFAVANYGDNPTK                    
Subjt:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL

Query:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                       VQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

A0A1S3AUC1 uncharacterized protein LOC103482940 isoform X21.4e-10073.9Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD
        MAYD+RN SGHYD+H PMY  TASS+PSPSPHPMY+QSMYPRIGQQAPS T PVARLSSHHHSSS S SP+SSSGLGIRVTIKPEYRITPPPQLSPQ+GD
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD

Query:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL
        IPRSNF FDFEFEKKVLAEAEKE PNWNRFGLE LP KPVESTSSMGSIGDP VSKYVASGL+REAVSFAVANYGDNPTK                    
Subjt:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL

Query:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLG
                                       VQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLG
Subjt:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLG

A0A1S3AUG0 uncharacterized protein LOC103482940 isoform X12.6e-10274.18Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD
        MAYD+RN SGHYD+H PMY  TASS+PSPSPHPMY+QSMYPRIGQQAPS T PVARLSSHHHSSS S SP+SSSGLGIRVTIKPEYRITPPPQLSPQ+GD
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGD

Query:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL
        IPRSNF FDFEFEKKVLAEAEKE PNWNRFGLE LP KPVESTSSMGSIGDP VSKYVASGL+REAVSFAVANYGDNPTK                    
Subjt:  IPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGL

Query:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                       VQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  SDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

A0A6J1H2H7 uncharacterized protein LOC1114594993.7e-10173.29Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI
        MAYDYRNK G+Y+AHPPMYGP ASSSPSPS HPMYTQSMYPRIGQQ  SP  PVAR+SSHHHSSS +PSP+  SSSGLGIRVTIKPEYRITPPPQLSPQ+
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI

Query:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY
        GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEH PS+PVES+SSMGS GDPVVSKYVASGLSREAVS AVANYGDNPTK                  
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY

Query:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                         VQEFVKGYTLLREMGF S KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

A0A6J1K3U0 velvet complex subunit B3.1e-10072.56Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI
        MAYDYRNK G+Y+AHPPMYGP ASSSPSPS HPMYTQSMYPRIGQQ  SP  PV R+SSHHHSSS +PSP+  SSSGLGIRVTIKPEYRITPPPQLSPQ+
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPT--SSSGLGIRVTIKPEYRITPPPQLSPQI

Query:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY
        GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEH PS+PVES+SSMGS GDPVV+KYVASGLSREAVS AVANYGDNPTK                  
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGY

Query:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                         VQEFVKGYTLLREMGF S KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  GLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53330.1 Ubiquitin-associated/translation elongation factor EF1B protein6.1e-5649.29Show/hide
Query:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSP-HPMYTQSMYPRIGQQ-APSPTL--PVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSP
        M YDYRNKSG      PMYGP  S+SPSPS  HPMY    YP+IGQQ  P P    P  R SS  H++S       SSG+GIRV +KPEYRITPPPQL P
Subjt:  MAYDYRNKSGHYDAHPPMYGPTASSSPSPSP-HPMYTQSMYPRIGQQ-APSPTL--PVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSP

Query:  QIGDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVE-STSSMGSIG--DPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGW
        ++GDI RS+FQFDF  E+KVLAEAEK+ P+W++FG E+ P+K  E S SS+G +   D VV KY ASGL+REAV+ AVANYGDNPTK             
Subjt:  QIGDIPRSNFQFDFEFEKKVLAEAEKETPNWNRFGLEHLPSKPVE-STSSMGSIG--DPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGW

Query:  VVVGYGLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
                                              VQEF  G+T +REMGF +  V +AL M++NDTDKA+AH L G+S
Subjt:  VVVGYGLSDKLEEALETSAIMARTEICILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTACGATTACAGAAACAAGTCCGGCCACTACGATGCTCATCCGCCGATGTACGGCCCAACCGCTTCTTCTTCCCCATCTCCCTCTCCCCATCCCATGTATACACA
GTCAATGTACCCCAGGATCGGTCAACAAGCTCCTTCACCGACCCTTCCGGTGGCCCGTCTCTCTTCCCACCATCATTCTTCTTCTCAATCTCCATCACCTACTTCCTCTT
CAGGATTGGGCATCAGGGTTACCATTAAACCGGAATATCGAATTACTCCTCCGCCTCAATTATCTCCACAAATTGGAGATATTCCTCGGAGCAATTTCCAATTTGATTTT
GAGTTTGAGAAAAAGGTTTTAGCTGAAGCAGAGAAAGAAACTCCGAATTGGAATCGGTTTGGGCTGGAACATCTTCCTTCTAAGCCAGTGGAATCCACATCCTCAATGGG
TTCAATTGGAGATCCAGTTGTGAGCAAGTATGTTGCATCTGGCTTGAGTCGGGAAGCTGTTTCATTTGCTGTTGCTAATTATGGCGACAATCCTACCAAGGTAATTATGA
ACGATTGGTACACAGTGGTGCATGGTTGGGTTGTTGTCGGCTACGGGCTCAGCGATAAGTTAGAGGAAGCGCTGGAGACAAGTGCTATAATGGCTAGAACAGAGATATGT
ATACTTGATATGATTTATCACATATTAGATAACGTTCAAGAATTTGTAAAAGGCTACACACTTCTACGAGAAATGGGATTTTCTTCCATCAAGGTGGTCGAGGCATTGCT
CATGTATGACAATGACACTGATAAGGCTGTAGCTCACTTTCTTGGTGGTACGTCTTAA
mRNA sequenceShow/hide mRNA sequence
CTTGAAAAACAAATCTCAAAGAGAAATTTGTTGAAAATGAAATCCAAAATTTGAAAACATAAATTGGGAACGAAACATTATAACTTCAACACCTTGATTTGTCAATTGTC
CACCGTCTGTCCCAGAGCTTTTTCCTCTGTTCTCAATGGCGTACGATTACAGAAACAAGTCCGGCCACTACGATGCTCATCCGCCGATGTACGGCCCAACCGCTTCTTCT
TCCCCATCTCCCTCTCCCCATCCCATGTATACACAGTCAATGTACCCCAGGATCGGTCAACAAGCTCCTTCACCGACCCTTCCGGTGGCCCGTCTCTCTTCCCACCATCA
TTCTTCTTCTCAATCTCCATCACCTACTTCCTCTTCAGGATTGGGCATCAGGGTTACCATTAAACCGGAATATCGAATTACTCCTCCGCCTCAATTATCTCCACAAATTG
GAGATATTCCTCGGAGCAATTTCCAATTTGATTTTGAGTTTGAGAAAAAGGTTTTAGCTGAAGCAGAGAAAGAAACTCCGAATTGGAATCGGTTTGGGCTGGAACATCTT
CCTTCTAAGCCAGTGGAATCCACATCCTCAATGGGTTCAATTGGAGATCCAGTTGTGAGCAAGTATGTTGCATCTGGCTTGAGTCGGGAAGCTGTTTCATTTGCTGTTGC
TAATTATGGCGACAATCCTACCAAGGTAATTATGAACGATTGGTACACAGTGGTGCATGGTTGGGTTGTTGTCGGCTACGGGCTCAGCGATAAGTTAGAGGAAGCGCTGG
AGACAAGTGCTATAATGGCTAGAACAGAGATATGTATACTTGATATGATTTATCACATATTAGATAACGTTCAAGAATTTGTAAAAGGCTACACACTTCTACGAGAAATG
GGATTTTCTTCCATCAAGGTGGTCGAGGCATTGCTCATGTATGACAATGACACTGATAAGGCTGTAGCTCACTTTCTTGGTGGTACGTCTTAATTAAGAATTATGGAAAT
GGTTGTGTCAAGAATTCTGTTGTACTTCAACTATGACATCTTTGTATATGTAATATGACAATAGAGGCCTCGTGGGATTGTAAATGTGATAAATATAGGCATTTATAGTG
TATTGTATTCATTTACCTGATGGTGTTTCACCAATGCTTGCAGACTCTCAATGGCAGGTAATGATGATTACGTCTTATATTTTTCATGTCTTTGGATTGAATACTGTTGC
AGAATTAACTGATAGGTTGATTGCAAAGTTCAATGCTTGATAACTATTT
Protein sequenceShow/hide protein sequence
MAYDYRNKSGHYDAHPPMYGPTASSSPSPSPHPMYTQSMYPRIGQQAPSPTLPVARLSSHHHSSSQSPSPTSSSGLGIRVTIKPEYRITPPPQLSPQIGDIPRSNFQFDF
EFEKKVLAEAEKETPNWNRFGLEHLPSKPVESTSSMGSIGDPVVSKYVASGLSREAVSFAVANYGDNPTKVIMNDWYTVVHGWVVVGYGLSDKLEEALETSAIMARTEIC
ILDMIYHILDNVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFLGGTS