; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014880 (gene) of Snake gourd v1 genome

Gene IDTan0014880
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglutamic acid-rich protein isoform X1
Genome locationLG11:16066601..16071973
RNA-Seq ExpressionTan0014880
SyntenyTan0014880
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR019351 - Protein of unknown function DUF2039


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602535.1 Eukaryotic translation initiation factor 3 subunit M, partial [Cucurbita argyrosperma subsp. sororia]5.6e-8382.79Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MSSK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E  DDNE TDDTDED      EDE
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE

Query:  DECENEEKDKVEDEK
        DECENEEKDK E+E+
Subjt:  DECENEEKDKVEDEK

KAG7033212.1 hypothetical protein SDJN02_07266 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-8382.79Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MSSK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E  DDNE TDDTDED      EDE
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE

Query:  DECENEEKDKVEDEK
        DECENEEKDK E+E+
Subjt:  DECENEEKDKVEDEK

XP_008459394.1 PREDICTED: uncharacterized protein LOC103498541 [Cucumis melo]4.1e-8684.13Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MS+KQGPPKHQN+YAWKP AG+KINETEVGGRFRP S+ITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLC GCAKEQGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE
        VDQTVGRD SEVEAEQKMLQEAIKNARERDRRTLLRAM+KGK+K+SN+NKSAV E+TKV +SI S TE+  E+ R EDDNEITDDTD+D+Y  E+EDE E
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE

Query:  CENEEKDK
        CENEE DK
Subjt:  CENEEKDK

XP_022954477.1 glutamic acid-rich protein isoform X1 [Cucurbita moschata]7.3e-8382.41Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MSSK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E   DDNE TDDTDED      ED
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED

Query:  EDECENEEKDKVEDEK
        EDECENEEKDK E+E+
Subjt:  EDECENEEKDKVEDEK

XP_023538167.1 glutamic acid-rich protein [Cucurbita pepo subsp. pepo]7.3e-8382.41Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MSSK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E   DDNE TDDTDED      ED
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED

Query:  EDECENEEKDKVEDEK
        EDECENEEKDK E+E+
Subjt:  EDECENEEKDKVEDEK

TrEMBL top hitse value%identityAlignment
A0A0A0KSJ8 Uncharacterized protein1.7e-8281.78Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MS+KQGPPKHQNKYAWKP AG+KINETEVGGRFRP S+ITGVCLRCK+QIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLC GCAKEQGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE
        VDQTVGRD SEVEAEQKMLQEAIKNARERDRRTLLRAM+KGK+K+SN+NKSAV+E+TK  +SI S TE   E+ R EDDNE TDDTD D+Y  E+EDE E
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE

Query:  CENEEKDKVEDEKE
        CENE   K ED KE
Subjt:  CENEEKDKVEDEKE

A0A1S3CAL0 uncharacterized protein LOC1034985412.0e-8684.13Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MS+KQGPPKHQN+YAWKP AG+KINETEVGGRFRP S+ITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLC GCAKEQGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE
        VDQTVGRD SEVEAEQKMLQEAIKNARERDRRTLLRAM+KGK+K+SN+NKSAV E+TKV +SI S TE+  E+ R EDDNEITDDTD+D+Y  E+EDE E
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE

Query:  CENEEKDK
        CENEE DK
Subjt:  CENEEKDK

A0A6J1BX54 uncharacterized protein LOC1110063052.0e-7881.31Show/hide
Query:  SKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCRVD
        +K GPPKHQN+YAWKP AG KINETEVGGRFRP S+ITGVCLRCK+QIDWKRRYGKYKPL+EPAKCQLCSKRAVRQAYHNLC GCAKEQGVCAKCRCRVD
Subjt:  SKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCRVD

Query:  QTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE
         TVGRD+SEVEAEQKMLQEAI+NARERD+RTLLRAM+KGKSKTS+++KSAVKE+TKV +  P S EE  +L RKEDDN+ITD ++ED  E+EDEDE+E
Subjt:  QTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDE

A0A6J1GR32 glutamic acid-rich protein isoform X13.5e-8382.41Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        MSSK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E   DDNE TDDTDED      ED
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE---DDNEITDDTDEDDYEDEDED

Query:  EDECENEEKDKVEDEK
        EDECENEEKDK E+E+
Subjt:  EDECENEEKDKVEDEK

A0A6J1JP90 ribosome biogenesis protein BOP1 homolog isoform X14.6e-8382.33Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        M+SK G PKHQNKYAWKP AG+KINETEVGGRFRP SEITGVCLRCK+QIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLC GCAK+QGVCAKCRCR
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE
        VDQT+GRD SEVEAEQKMLQEAIKNARERD+RTLLRAM+KGKSKTSN+NKSA  E++K  +SIPSSTEE   L R E  DDNE TD TDED Y    EDE
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKE--DDNEITDDTDEDDYEDEDEDE

Query:  DECENEEKDKVEDEK
        DECENEEKDK E+E+
Subjt:  DECENEEKDKVEDEK

SwissProt top hitse value%identityAlignment
Q68FU5 Uncharacterized protein C9orf85 homolog6.3e-1331.52Show/hide
Query:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV
        MSS++G      P KHQN + +K  K  + +   ++  +        GVC RCKE ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA +  V
Subjt:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV

Query:  CAKCRCRVDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTK
        CAKC  + ++ V   + E E  +    E         RR+  R  D  +   +  +     EDT+
Subjt:  CAKCRCRVDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTK

Q96MD7 Uncharacterized protein C9orf852.3e-1539.45Show/hide
Query:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV
        MSS++G      P KHQN +++K  K  + +   ++  +        GVC RCKE ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV

Query:  CAKCRCRVD
        CAKC  + D
Subjt:  CAKCRCRVD

Q9CQ90 Uncharacterized protein C9orf85 homolog3.0e-1540.38Show/hide
Query:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV
        MSS++G      P KHQN + +K  K  + +   ++  +        GVC RCKE ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSSKQG------PPKHQNKYAWK-PKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Arabidopsis top hitse value%identityAlignment
AT3G02220.1 unknown protein1.2e-5157.08Show/hide
Query:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR
        M S+QGPPKHQNK+AW PKAG KINETEVGGRFRP SEITGVC RC+EQI WKR+YGKYK L+E  KCQ C+KR VRQAYH LC GCAKEQ VCAKC   
Subjt:  MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCR

Query:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPS-STEELTELDRKED----DNEITDDTDEDDYEDED
        VDQ +GRD  EVEAEQK+L E IKNARERDRRTLLRAM+K      +  +++  + +KV +  PS S EE      +         + D   +D    E 
Subjt:  VDQTVGRDSSEVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPS-STEELTELDRKED----DNEITDDTDEDDYEDED

Query:  EDEDECENEEKDKVEDEKE
        +++D   ++E D  ED  E
Subjt:  EDEDECENEEKDKVEDEKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCAAGCAGGGCCCTCCGAAGCACCAAAACAAATACGCTTGGAAACCCAAGGCCGGCCAGAAAATCAACGAGACGGAGGTTGGAGGCAGGTTTCGGCCATATTC
TGAGATCACTGGTGTTTGTCTTCGCTGCAAGGAGCAAATTGATTGGAAACGCCGTTACGGCAAGTACAAACCCCTTTCTGAACCTGCCAAATGTCAACTGTGTTCGAAGC
GGGCTGTTCGTCAAGCGTATCATAATCTCTGTTCTGGTTGTGCCAAGGAGCAAGGTGTATGCGCAAAGTGTCGCTGTCGTGTAGACCAAACTGTTGGAAGGGATTCTTCT
GAAGTGGAGGCGGAGCAAAAAATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATAGAAGAACTCTATTACGTGCTATGGACAAAGGGAAAAGCAAGACTTCAAA
TAGAAATAAATCAGCCGTTAAAGAAGATACCAAGGTTGAGAATTCAATTCCTTCATCAACAGAAGAGCTGACTGAATTAGACAGAAAGGAGGATGACAATGAAATTACTG
ATGACACGGATGAGGATGATTACGAAGATGAAGACGAAGATGAAGATGAATGTGAAAATGAAGAGAAGGATAAAGTTGAAGATGAGAAAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATAACTATTTATCGCTTTCCCCTTCTTTATTCTAAAGCCCTATTTCGCGCTACACTCCCTCCAATTACCGCACTGCACACCAGTTCGCTGCTCGCCTTCTCAGCAGCTGT
CGCCGGCGCCGCTCACTGCTCCTGCCCTACGCTGTCTCTGCAGCCGCCACCGCACGCTGCACTTGCTGCAGTTTCAGCCTCTCAACGAACTCGCCGCCGCTCCCTCTCAC
ACTCTCACCTCTCGGATCAATGAAAGTTATTCCTTTTTTCTCAGAAACAGAAACAACGGTTCGGGCTCTAATCCGTGCTTGTTGATGAGATCGAGCCCACCCTCCGCCGC
CCATTTTTCAGCACTAATCGGTGCTCTGAGTTCTTTGCAAGCAGTAAAACGAGAGGCTAGAGAAGTGGTTTTGAAGCTGTAGTGCAAAACTCGTGATACTCAGAATGAGC
AGCAAGCAGGGCCCTCCGAAGCACCAAAACAAATACGCTTGGAAACCCAAGGCCGGCCAGAAAATCAACGAGACGGAGGTTGGAGGCAGGTTTCGGCCATATTCTGAGAT
CACTGGTGTTTGTCTTCGCTGCAAGGAGCAAATTGATTGGAAACGCCGTTACGGCAAGTACAAACCCCTTTCTGAACCTGCCAAATGTCAACTGTGTTCGAAGCGGGCTG
TTCGTCAAGCGTATCATAATCTCTGTTCTGGTTGTGCCAAGGAGCAAGGTGTATGCGCAAAGTGTCGCTGTCGTGTAGACCAAACTGTTGGAAGGGATTCTTCTGAAGTG
GAGGCGGAGCAAAAAATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATAGAAGAACTCTATTACGTGCTATGGACAAAGGGAAAAGCAAGACTTCAAATAGAAA
TAAATCAGCCGTTAAAGAAGATACCAAGGTTGAGAATTCAATTCCTTCATCAACAGAAGAGCTGACTGAATTAGACAGAAAGGAGGATGACAATGAAATTACTGATGACA
CGGATGAGGATGATTACGAAGATGAAGACGAAGATGAAGATGAATGTGAAAATGAAGAGAAGGATAAAGTTGAAGATGAGAAAGAGTAGTAAATTAATCTTTTTATTTAC
AAGCTGAATTGATGACGATGAAGATGAAGTTAAGGATGAGGAATAGTAACTTACCAGAGAAGTTGTACTACCAATTTCTTTATGAAAGGCTAAATTTTGATTTCATTTGT
TTTTTAGGGTTGCGTTTATTTGTTTGTACTATCAACTGTCGTTTAATGAAAAGAAAAATTACAATACATGGTCTGAAGAATTTGAGCTGTTGAGATCTGCAAAGACAATA
TATCCATATCTCATATGGTTGAATAATATTAGTGTATTACATGTAGTGCCTAATGTTGTATTGAAAAAATATTTATCTTTTCAAAGAATATGGCAGGGG
Protein sequenceShow/hide protein sequence
MSSKQGPPKHQNKYAWKPKAGQKINETEVGGRFRPYSEITGVCLRCKEQIDWKRRYGKYKPLSEPAKCQLCSKRAVRQAYHNLCSGCAKEQGVCAKCRCRVDQTVGRDSS
EVEAEQKMLQEAIKNARERDRRTLLRAMDKGKSKTSNRNKSAVKEDTKVENSIPSSTEELTELDRKEDDNEITDDTDEDDYEDEDEDEDECENEEKDKVEDEKE