; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000591 (gene) of Snake gourd v1 genome

Gene IDTan0000591
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein, putative
Genome locationLG10:5053603..5056936
RNA-Seq ExpressionTan0000591
SyntenyTan0000591
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028797.1 hypothetical protein SDJN02_09978, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-8378.73Show/hide
Query:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DG
        GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+W+N ITTFLKRPNGG AN        +H+ AT   NAAS  +G
Subjt:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DG

Query:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        GG  +VDFL+EMKKVC VA P+LKVRTARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQK
Subjt:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

XP_004134033.1 uncharacterized protein LOC101222608 [Cucumis sativus]2.2e-8680.48Show/hide
Query:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEE
        TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSWRN I+TFLKRPN GGS N++++ + HA + ATA ++    GG  AEVDFLEE
Subjt:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEE

Query:  MKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT
        MKK CK AHPKL+V T RVELEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT
Subjt:  MKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT

Query:  KTHRNFWLLA
        KTHRNFWLLA
Subjt:  KTHRNFWLLA

XP_008438448.1 PREDICTED: uncharacterized protein LOC103483538 [Cucumis melo]1.7e-8680.86Show/hide
Query:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM
        ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN GGS N++++ + HA + ATA ++    GG  A+VDFLEEM
Subjt:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM

Query:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
        KK CKVAHPK+KV T RVELEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
Subjt:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK

Query:  THRNFWLLA
        THRNFWLLA
Subjt:  THRNFWLLA

XP_023538863.1 uncharacterized protein LOC111799663 [Cucurbita pepo subsp. pepo]2.2e-8379.19Show/hide
Query:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--G
        GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPNSW+N ITTFLKRPNGG AN            AT   NAASD  G
Subjt:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--G

Query:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        GG  +VDFL+EMKKVCKVA P+L VRTARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQK
Subjt:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

XP_038881793.1 homeobox protein 5 [Benincasa hispida]1.0e-8881.74Show/hide
Query:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSAN----------AHSHAHAHAHSHATAPANAASDGGG
        TST   +KVMVVVDPTRESAAALQYALSHAVIDND+VILLHVDNPNSW+N ITTFLKRPNGGSAN           H++A+A A + ATA ++    GG 
Subjt:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSAN----------AHSHAHAHAHSHATAPANAASDGGG

Query:  AAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG
         AEVDFLEEMKK CK AHPKLKV T RVELEGKDKA+MIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMK AKMLDTAEYLIENSKCTCVAVQKKG
Subjt:  AAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKG

Query:  QNAGYLLNTKTHRNFWLLA
        QNAGYLLNTKTHRNFWLLA
Subjt:  QNAGYLLNTKTHRNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0L6Y4 Uncharacterized protein1.0e-8680.48Show/hide
Query:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEE
        TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSWRN I+TFLKRPN GGS N++++ + HA + ATA ++    GG  AEVDFLEE
Subjt:  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEE

Query:  MKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT
        MKK CK AHPKL+V T RVELEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT
Subjt:  MKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNT

Query:  KTHRNFWLLA
        KTHRNFWLLA
Subjt:  KTHRNFWLLA

A0A1S3AWE0 uncharacterized protein LOC1034835388.0e-8780.86Show/hide
Query:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM
        ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN GGS N++++ + HA + ATA ++    GG  A+VDFLEEM
Subjt:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM

Query:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
        KK CKVAHPK+KV T RVELEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
Subjt:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK

Query:  THRNFWLLA
        THRNFWLLA
Subjt:  THRNFWLLA

A0A5A7U1M3 Uncharacterized protein8.0e-8780.86Show/hide
Query:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM
        ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN GGS N++++ + HA + ATA ++    GG  A+VDFLEEM
Subjt:  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN-GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEM

Query:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
        KK CKVAHPK+KV T RVELEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK
Subjt:  KKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTK

Query:  THRNFWLLA
        THRNFWLLA
Subjt:  THRNFWLLA

A0A6J1FDQ7 uncharacterized protein LOC1114448127.0e-8378.28Show/hide
Query:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DG
        GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+W+N ITTFLKRPNGG AN        +H+ AT   NAAS   G
Subjt:  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DG

Query:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK
        GG  +VDFL+EMKKVC VA P+LKVR ARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKMLDTAEYLIENS CTCVAVQK
Subjt:  GGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQK

Query:  KGQNAGYLLNTKTHRNFWLLA
        KGQNAGYLLNTKTHRNFWLLA
Subjt:  KGQNAGYLLNTKTHRNFWLLA

A0A6J1IE79 uncharacterized protein LOC111473211 isoform X12.4e-8379.73Show/hide
Query:  GGLLNTTNTTT-STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--
        GG   TTNTTT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPNSW+N ITTFLKRPNGG AN        +H+ AT   NAASD  
Subjt:  GGLLNTTNTTT-STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--

Query:  GGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQ
        GGG  +VDFL+EMKKVC VA P+LKVRTARVE+EGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G      KMLDTAEYLIENS CTCVAVQ
Subjt:  GGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQ

Query:  KKGQNAGYLLNTKTHRNFWLLA
        KKGQNAGYLLNTKTHRNFWLLA
Subjt:  KKGQNAGYLLNTKTHRNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.6e-1832.67Show/hide
Query:  MVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHP
        MVVVD T ++  ALQ+AL+H V D D + LLHV               R   G A   +    ++ +H                 + +  +K  C++  P
Subjt:  MVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHP

Query:  KLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWL
         +K     VE   ++K   I+ ++K  G  +LV+GQR+  S    I  +R  GG   G       EY I NS C  +AV+KK  N GYL+ TK H++FWL
Subjt:  KLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWL

Query:  LA
        LA
Subjt:  LA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.4e-1327.4Show/hide
Query:  TTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKK
        T +  +VMVVVD    S  AL++AL H +   D + LL+   P         F K   G   N  S                          + +  +KK
Subjt:  TTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKK

Query:  VCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKT
        +C+   P ++V   R++ + K+K   I+ + K   + LLV+G+ +     +    +  G  K      T +Y +E + C  +AV+ K +   GYL+ TK 
Subjt:  VCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKT

Query:  HRNFWLLA
        H+NFWLLA
Subjt:  HRNFWLLA

AT4G13450.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.2e-5651.64Show/hide
Query:  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKRPN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SW+N  ++FL+ P+    S++  S A     + + A ANA +   G  + +F
Subjt:  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKRPN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDF

Query:  LEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL
        LE+MK++C++A PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LG RR GG ++G+K +DTAEYLIENSKCTCV V KKGQN GY+
Subjt:  LEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYL

Query:  LNTKTHRNFWLLA
        LNTKTH+NFWLLA
Subjt:  LNTKTHRNFWLLA

AT4G13450.2 Adenine nucleotide alpha hydrolases-like superfamily protein5.2e-3043.75Show/hide
Query:  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKRPN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDF
        ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SW+N  ++FL+ P+    S++  S A     + + A ANA +   G  + +F
Subjt:  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKRPN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDF

Query:  LEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGY
        LE+MK++C++A PK++V T  + ++G  KA  I+     LG+D+++IGQRR++S+++LGY
Subjt:  LEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGY

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.8e-1429.06Show/hide
Query:  KVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVA
        +VMVVVD    S  AL++A++H +   D + LL+   P  +R +     KR N                         +D       + +  +KK+C+  
Subjt:  KVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVA

Query:  HPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFW
         P ++V   R+E + KDK   I+ ++K   + LLV+GQ +      L  R +    +G +     +Y +EN+ C  +AV+ K +   GYL+ TK H+NFW
Subjt:  HPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQN-AGYLLNTKTHRNFW

Query:  LLA
        LLA
Subjt:  LLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTTGAGGTCCGAAAAATGGAATGCGGCGGCGGCTGCTGCTGTGGATGGTAGCATCGCCGGAGGTAGACGGCAGCCATGGGAGGAGGGCGGGCTATTAAATAC
TACTAATACTACTACGTCCACCACGTCACCAAAAAAGGTCATGGTCGTCGTAGATCCCACACGAGAGTCCGCCGCCGCGCTCCAGTACGCGCTTTCGCATGCTGTCATTG
ATAACGACGAGGTCATTCTTCTCCATGTTGATAACCCTAATTCTTGGAGGAACACCATTACTACATTCCTTAAGAGGCCCAATGGCGGATCCGCCAATGCTCATTCTCAT
GCTCATGCTCATGCTCATTCTCATGCCACGGCACCTGCAAATGCCGCCTCCGACGGCGGAGGAGCGGCGGAGGTCGATTTTCTTGAGGAGATGAAGAAGGTCTGCAAGGT
TGCTCATCCCAAACTGAAAGTGCGCACGGCGAGGGTTGAATTGGAAGGCAAAGACAAGGCCGCCATGATTATGGCTCAAACCAAGGCTCTCGGTATTGATCTGCTAGTCA
TAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACAGC
AAATGCACTTGTGTTGCTGTACAAAAGAAGGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTGTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
CACAAACTTTGAGATAGGAATTGATTTAAAAAATAAAAAACAACAAAGGAGAAAAACTAGGGTTTTTTTGGTGGCGGAAGAAAGGGCGAGAAGATCGGAGAAGACGGTGA
CCGAATCAGAAGGAAGCAGAAAAATCGGAAGAAATGGAAGAGTTGAGGTCCGAAAAATGGAATGCGGCGGCGGCTGCTGCTGTGGATGGTAGCATCGCCGGAGGTAGACG
GCAGCCATGGGAGGAGGGCGGGCTATTAAATACTACTAATACTACTACGTCCACCACGTCACCAAAAAAGGTCATGGTCGTCGTAGATCCCACACGAGAGTCCGCCGCCG
CGCTCCAGTACGCGCTTTCGCATGCTGTCATTGATAACGACGAGGTCATTCTTCTCCATGTTGATAACCCTAATTCTTGGAGGAACACCATTACTACATTCCTTAAGAGG
CCCAATGGCGGATCCGCCAATGCTCATTCTCATGCTCATGCTCATGCTCATTCTCATGCCACGGCACCTGCAAATGCCGCCTCCGACGGCGGAGGAGCGGCGGAGGTCGA
TTTTCTTGAGGAGATGAAGAAGGTCTGCAAGGTTGCTCATCCCAAACTGAAAGTGCGCACGGCGAGGGTTGAATTGGAAGGCAAAGACAAGGCCGCCATGATTATGGCTC
AAACCAAGGCTCTCGGTATTGATCTGCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATG
TTGGACACAGCAGAGTATTTGATTGAGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAGGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTT
CTGGCTGTTGGCTTGATATCATATCATCAACCTCTCTTCACCATTTTCCATTCTCCACAATCCACCACTGTTTTCAATTCATCACACTCTCTTTCTCTCTCTCTCTCTCT
CTCTCTTTAAACTCAAAACAGATTGTCTTTCAATTTGGGTCACTTTCTTCTTCCCCCATTGTGAGATCCATTGCAAGCTCACCCAGTTGAATTTGTTGATTCATTCATCT
CTAGGCTTGGCTTGGCTTGGTTTGGTTTGGATTGGTCACCCTCCTTTTCTTTCTTTCCAGGCAGTGCATCAACATTTATTTGTACATACTTTTGGACCCACAGAATTAAC
CAACTGAAAATGAGTTAATCCACTCTCCTTTTTCTTTATTGCTTTCTTTTTTTTTTTCTTCCTTTTGCTCCAAAGGATTTGTTTGTTAATCATGTGCTTATTTGTGTCAG
TCTGTTTAATCTTGACATTGTAATTTTGTTCCTTTGAGCTTGAGTTTGTATAAGTGGTGAAATTAGTACTCTTCTTACAACCTAACCTAGAAAATTGCCCTGAAGCCAAC
TTAGTCAACTGTCGTGTCGAGTCGAGTAGTAGTTAAAAATGTCCAAACCGAACAAGAAAATATAACCTAACTACTGTAAAATCCCACATCATCTAGTAACAAAAAAAAAA
TGTCCACACCCAGTCTCAAATCTTCGAATGCCAGCAACTTTACTGTAAGAAGTTATTTATTTCACTGCTTATCCAAGTTCATGGGGTTCAATTTGATGAAAAGGTTAAAT
TGGACTCATTTAGACAGGTTCAAGAAGGAAAAATAGAAGTAAAAAGCCCTCCTTTTCTTTGTTTTGTTACTTTGTGTAAGATTGTTCTTATCCATATTGAAATAAAATGA
AGTCTCAATGTTTGTAGGGATGATCTTGTTGAGCGAGAAAGAAGATCAAACCATCCAATTTTTAGGATGAGCATTTTCCATCCAGATTAATGAGATTAATAAAA
Protein sequenceShow/hide protein sequence
MEELRSEKWNAAAAAAVDGSIAGGRRQPWEEGGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSH
AHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENS
KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA