; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020336 (gene) of Snake gourd v1 genome

Gene IDTan0020336
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionreactive Intermediate Deaminase A, chloroplastic-like
Genome locationLG01:18177713..18182439
RNA-Seq ExpressionTan0020336
SyntenyTan0020336
Gene Ontology termsGO:1901565 - organonitrogen compound catabolic process (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0019239 - deaminase activity (molecular function)
InterPro domainsIPR006056 - RidA family
IPR006175 - YjgF/YER057c/UK114 family
IPR019897 - RidA, conserved site
IPR035959 - RutC-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152932.1 reactive Intermediate Deaminase A, chloroplastic [Cucumis sativus]2.4e-9091.98Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AARTFHMPAFD+T LRSK+PLAVGVGC SVAGTTLWRSSSTSKRQ+PFASLGIST +S+KEAV+TDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDD+EDQTEQVLKNMGEILKAGG+SYSSVVKTTIMLADLKDFKKVNEIY KYFPSPAPARSTY+VA LPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

XP_008463412.1 PREDICTED: reactive Intermediate Deaminase A, chloroplastic-like [Cucumis melo]2.8e-9193.05Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW A R+FHMPAFD+T LRSKTPLAVGVGC SVAGTTLWRSSSTSKRQ+PFASLGIST +S+KEAV+TDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSD++EDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

XP_022942433.1 reactive Intermediate Deaminase A, chloroplastic [Cucurbita moschata]1.6e-8994.12Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AAR  HMPA DIT LRSKTPLAVGVGCASVAGTTL RSSSTSKRQ+PFASL ISTDAS+KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDDIE QTEQVLKNMGEILKAGGA YSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

XP_022976324.1 reactive Intermediate Deaminase A, chloroplastic [Cucurbita maxima]9.2e-9094.12Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AAR  HMPA DIT LRSKTPLAVGVGCASVAGTT  RSSSTSKRQ+PFASLGISTD S+KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDDIE QTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

XP_038903207.1 reactive Intermediate Deaminase A, chloroplastic-like [Benincasa hispida]6.8e-9394.65Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AARTFHMPAFD+T LRSKTPLAVGVGCASVAGTTLWRSSSTSKRQ+PFASLGISTD S+KEAV+TDKAPAALGPYSQAIKANNLLFVSG LGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDD+EDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSP+PARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

TrEMBL top hitse value%identityAlignment
A0A0A0L2Z6 Uncharacterized protein1.2e-9091.98Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AARTFHMPAFD+T LRSK+PLAVGVGC SVAGTTLWRSSSTSKRQ+PFASLGIST +S+KEAV+TDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDD+EDQTEQVLKNMGEILKAGG+SYSSVVKTTIMLADLKDFKKVNEIY KYFPSPAPARSTY+VA LPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

A0A1S3CJ54 reactive Intermediate Deaminase A, chloroplastic-like1.4e-9193.05Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW A R+FHMPAFD+T LRSKTPLAVGVGC SVAGTTLWRSSSTSKRQ+PFASLGIST +S+KEAV+TDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSD++EDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

A0A6J1FUT7 reactive Intermediate Deaminase A, chloroplastic7.6e-9094.12Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AAR  HMPA DIT LRSKTPLAVGVGCASVAGTTL RSSSTSKRQ+PFASL ISTDAS+KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDDIE QTEQVLKNMGEILKAGGA YSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

A0A6J1IN69 reactive Intermediate Deaminase A, chloroplastic4.4e-9094.12Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AAR  HMPA DIT LRSKTPLAVGVGCASVAGTT  RSSSTSKRQ+PFASLGISTD S+KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSDDIE QTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

A0A6P4APP4 reactive Intermediate Deaminase A, chloroplastic1.1e-8083.42Show/hide
Query:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE
        MAW AARTF++PA D+  LR++ PLAVGVG ASVAG+ +WRSSS+ KR  PFA LGISTD  +KEAVKTDKAPAALGPYSQAIKANN L+VSGVLGL PE
Subjt:  MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPE

Query:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
        TGKFVSD++EDQTEQVLKNMGEILKAGGA YSSVVKTTIMLADLKDFKKVNEIY KYFPSPAPARSTYQVAALPLDAK+EIECIA+L
Subjt:  TGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

SwissProt top hitse value%identityAlignment
P52758 2-iminobutanoate/2-iminopropanoate deaminase2.4e-3250.41Show/hide
Query:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA
        ++  + T KAP A+GPYSQA+  +  +++SG +G++P +G+ VS  + ++ +Q LKNMGEILKA G  +++VVKTT++LAD+ DF  VNEIY +YF S  
Subjt:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA

Query:  PARSTYQVAALPLDAKVEIECIA
        PAR+ YQVAALP  +++EIE +A
Subjt:  PARSTYQVAALPLDAKVEIECIA

P52759 2-iminobutanoate/2-iminopropanoate deaminase1.0e-3049.59Show/hide
Query:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA
        +++ + T KAPAA+G YSQA+  +  ++VSG +G++P +G+ V   + ++ +Q LKN+GEILKA G  +++VVKTT++LAD+ DF  VNEIY  YF    
Subjt:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA

Query:  PARSTYQVAALPLDAKVEIECIA
        PAR+ YQVAALP  +++EIE IA
Subjt:  PARSTYQVAALPLDAKVEIECIA

P52760 2-iminobutanoate/2-iminopropanoate deaminase5.3e-3251.22Show/hide
Query:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA
        +++ + T KAPAA+GPYSQA++ +  +++SG +GL+P +G+ V   + ++ +Q LKN+GEILKA G  +++VVKTT++LAD+ DF  VNEIY  YF    
Subjt:  MKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPA

Query:  PARSTYQVAALPLDAKVEIECIA
        PAR+ YQVAALP  ++VEIE IA
Subjt:  PARSTYQVAALPLDAKVEIECIA

Q10121 RutC family protein C23G10.22.4e-3253.28Show/hide
Query:  KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAP
        ++ + +  AP A+GPYSQA++A N +++SG LGL+P+TG  + + + +QT Q LKN+GE+LKA GA Y +VVKTT++L ++ DF  VNE+YG+YF SP P
Subjt:  KEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAP

Query:  ARSTYQVAALPLDAKVEIECIA
        AR+ YQVAALP    VEIE +A
Subjt:  ARSTYQVAALPLDAKVEIECIA

Q94JQ4 Reactive Intermediate Deaminase A, chloroplastic8.7e-6772.11Show/hide
Query:  MAWFAARTFHMPAFDITT-LRS-KTPL-AVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGL
        M W   R+ + P  D++T LRS +TPL A GVGCA+ AG +L+R SS   R  PFASL +S  +  KE V T+KAPAALGPYSQAIKANNL+F+SGVLGL
Subjt:  MAWFAARTFHMPAFDITT-LRS-KTPL-AVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGL

Query:  NPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
         PETGKFVS+ +EDQTEQVLKNMGEILKA GA YSSVVKTTIMLADL DFK VNEIY KYFP+P+PARSTYQVAALPL+AK+EIECIATL
Subjt:  NPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL

Arabidopsis top hitse value%identityAlignment
AT3G04480.1 endoribonucleases4.8e-0427Show/hide
Query:  APAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFK-KVNEIYGKYFPSPAPARSTYQV
        AP+ +GPYSQA    ++L ++G LGL+P T    ++    +  Q L N   I ++   S SS     ++    +  + + N+++ K+      A+S+ +V
Subjt:  APAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFK-KVNEIYGKYFPSPAPARSTYQV

AT3G20390.1 endoribonuclease L-PSP family protein6.2e-6872.11Show/hide
Query:  MAWFAARTFHMPAFDITT-LRS-KTPL-AVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGL
        M W   R+ + P  D++T LRS +TPL A GVGCA+ AG +L+R SS   R  PFASL +S  +  KE V T+KAPAALGPYSQAIKANNL+F+SGVLGL
Subjt:  MAWFAARTFHMPAFDITT-LRS-KTPL-AVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGL

Query:  NPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL
         PETGKFVS+ +EDQTEQVLKNMGEILKA GA YSSVVKTTIMLADL DFK VNEIY KYFP+P+PARSTYQVAALPL+AK+EIECIATL
Subjt:  NPETGKFVSDDIEDQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGGTTTGCTGCTCGGACCTTTCACATGCCGGCGTTCGACATCACTACATTGCGCTCCAAGACTCCCTTAGCCGTCGGCGTCGGTTGCGCTTCGGTCGCCGGAAC
CACCTTGTGGCGATCTTCCTCAACTTCCAAGCGCCAACTTCCCTTCGCATCCCTCGGCATTTCTACCGATGCTAGTATGAAGGAAGCTGTTAAGACTGACAAGGCTCCAG
CGGCATTAGGGCCATATTCTCAGGCTATCAAAGCCAACAACCTTCTCTTTGTGTCTGGTGTCCTAGGTCTAAATCCTGAGACAGGGAAATTCGTGTCAGATGATATTGAA
GACCAAACTGAGCAGGTCCTCAAAAATATGGGTGAAATATTGAAAGCTGGAGGTGCCAGCTATTCTTCCGTGGTTAAGACAACTATTATGTTGGCTGATCTGAAGGATTT
CAAGAAAGTAAATGAGATTTATGGTAAATACTTTCCATCTCCTGCTCCCGCTCGGTCAACATATCAGGTTGCAGCATTGCCTTTGGATGCTAAGGTTGAGATCGAGTGCA
TCGCTACACTTTGA
mRNA sequenceShow/hide mRNA sequence
GTAACCTTCTATAAATAAGACGTTTTTTATTTGTCTATTGTTATCACCAAACAAGATCAGGGCTGCTCAGTGCGAACAACGAGAAGGAAAGCTATATACTTTGACGATCA
GAGAGAGATGGCTTGGTTTGCTGCTCGGACCTTTCACATGCCGGCGTTCGACATCACTACATTGCGCTCCAAGACTCCCTTAGCCGTCGGCGTCGGTTGCGCTTCGGTCG
CCGGAACCACCTTGTGGCGATCTTCCTCAACTTCCAAGCGCCAACTTCCCTTCGCATCCCTCGGCATTTCTACCGATGCTAGTATGAAGGAAGCTGTTAAGACTGACAAG
GCTCCAGCGGCATTAGGGCCATATTCTCAGGCTATCAAAGCCAACAACCTTCTCTTTGTGTCTGGTGTCCTAGGTCTAAATCCTGAGACAGGGAAATTCGTGTCAGATGA
TATTGAAGACCAAACTGAGCAGGTCCTCAAAAATATGGGTGAAATATTGAAAGCTGGAGGTGCCAGCTATTCTTCCGTGGTTAAGACAACTATTATGTTGGCTGATCTGA
AGGATTTCAAGAAAGTAAATGAGATTTATGGTAAATACTTTCCATCTCCTGCTCCCGCTCGGTCAACATATCAGGTTGCAGCATTGCCTTTGGATGCTAAGGTTGAGATC
GAGTGCATCGCTACACTTTGACCTTTTACTTAATTTCAGGTACCCAGTTTTCAAGACATCATAAATTAATAAGATGGAGATAGCAACCACGGGAACACTCCAAATCCAAT
TTGGCTGTTATAATCCCTGAGGCTTTATTTTTACAGTAGTGTAACCACTTGTGACAGTCATACGTGCTTTTTACCAGTGAAAATAGAAGTCAGATGAAGCATATTGATAT
TGTTGCCTAGTGTTGCAAATTACATGAACGTCTTCTAACATAAAATGTTGTCTTAACAAACCAACTGTCCAGGTTCTTTGTTCTGTAATTAAGCAGCCAACAGGTCAATT
GATAGGAGATAAAACCTTGAGCGTGTTGTGGCCTTTGGAATCCACTTATCCGTTTAGCCCCCCTTTCAATGGATGTCTATCTACCTCTTCAAAACCAACCTTCTTTCCAA
TGAACGCTACTAAGCCTAAGAAGACCTCATCCCTAGTCCCTAAAATGTCTTTAAGATTATCAATATTAGGAAAATATTTGGCTTTTATGGATTTTAGCCAGAAGCAAATC
ATGGTCGTTGAAAATTCTTTATTAAAACAATTGTTATCCCGAAAATTCAAGCCTCCATGATCCATAGGCGTAGTTAGTATTTCTCATCTTACCCCAATGCATTTTCTTTC
TTTTTTATCTTACCCCACCAGAAATTTTCTAGGAGTTTGATTGATTTCCTTGCAATAACTCTCAAGAATTTTTTTGATCGAAATAGCAAATGTAAGTTGGAATTGATTGA
GCCATGACTTCTGTTAGAGTTTCCTTTCCAGCTAAGGAAAAAGGTTCCTTTTTAGGCTTCTACCTTGATAAAACTTTTCTCACTCCCTCGTTTAAAAGTTGAAAGCTTTG
CTTGTTCTGTTTGTTTTTTTCCCTACGAATAGCAGAAGGGAGGAAGATCAAGACGTACGTAGACATTTCCAGGGTCCACCAGCTGGACTATAAAAAATCAGCCAAAGTTA
ATATAAGCCCTTTGCTTGTATTTTTCTTCATGCTAAAGTTAATTTTGATAGATTAATAATTTTCCAGAAAGC
Protein sequenceShow/hide protein sequence
MAWFAARTFHMPAFDITTLRSKTPLAVGVGCASVAGTTLWRSSSTSKRQLPFASLGISTDASMKEAVKTDKAPAALGPYSQAIKANNLLFVSGVLGLNPETGKFVSDDIE
DQTEQVLKNMGEILKAGGASYSSVVKTTIMLADLKDFKKVNEIYGKYFPSPAPARSTYQVAALPLDAKVEIECIATL