; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021720 (gene) of Snake gourd v1 genome

Gene IDTan0021720
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationLG01:105710667..105714647
RNA-Seq ExpressionTan0021720
SyntenyTan0021720
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591856.1 hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sororia]3.2e-12084.5Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P  +  FGY+ DL+GGC R FPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022140712.1 uncharacterized protein LOC111011276 [Momordica charantia]5.5e-12084.13Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSPPLIT   RT+TTT R  PST+I+QAY Y+Q  P  +RFFGY TDL+GGCSRRFPACAS SSGPQVPAASAPLIQSD  AA RTSALEKL+TIEEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ ++ S +N EHRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK VIQ+PGDLLCLA S+FLSSGSLFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]1.3e-12185.24Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P  N  FGY+ DL+GGC RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]1.5e-12084.87Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP    N  FGY+ DL+GGC RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]5.0e-12184.87Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+ TTAR  PST+IIQAYQ++QP P  N  FGY+ DL+GGC RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

TrEMBL top hitse value%identityAlignment
A0A6J1CHU0 uncharacterized protein LOC1110112762.7e-12084.13Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSPPLIT   RT+TTT R  PST+I+QAY Y+Q  P  +RFFGY TDL+GGCSRRFPACAS SSGPQVPAASAPLIQSD  AA RTSALEKL+TIEEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ ++ S +N EHRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK VIQ+PGDLLCLA S+FLSSGSLFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434366.4e-12285.24Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P  N  FGY+ DL+GGC RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1FPY6 uncharacterized protein LOC1114459063.3e-11884.93Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEE
        MQPSPPLI+  +RT+TTT R  PSTMII AY QY Q YP  N F GYKT LI GC RRFPA A+ASSGP VPAASAP IQSD+G ASRTS LEK   IEE
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEE

Query:  GLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFG
         LEKAIYRCRFMAFLGV GSLVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ERTLSKRN+EHRSNLFG
Subjt:  GLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFG

Query:  LFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        LFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKKA IQ+PGDLLCLAAS+FLSSGSLFLLSKLTE
Subjt:  LFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1INA6 uncharacterized protein LOC1114770897.0e-12184.87Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP    N  FGY+ DL+GGC RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQ+PGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1J2R4 uncharacterized protein LOC1114807456.6e-11985.66Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEE
        MQPSPPLI+  +R++TTT R  PSTMIIQAY QY Q YP  N F GYKT LI GC RRFPA A+ASSGP VPAASAP IQSDIG ASRTSALEK   IEE
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEE

Query:  GLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFG
         LEKAIYRCRFMAFLGV GSLVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ERTLSKRN+EHRSNLFG
Subjt:  GLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFG

Query:  LFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE
        LFTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSKKA IQ+PGDLLCLAAS+FLSSGSLFLLSKLTE
Subjt:  LFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.0e-6351.82Show/hide
Query:  ITDLTRTITTT--ARPGPSTMIIQAYQYRQPYPN---INRFFGYKTDLIGGCSRRFPACASA-------SSGPQVPAASAPLIQSDIGAASRTSALEKLD
        +T   RTI     A P PS +I   ++   P      I+ F G K       SR      S+       S G    A+++    +   AA  +++  + +
Subjt:  ITDLTRTITTT--ARPGPSTMIIQAYQYRQPYPN---INRFFGYKTDLIGGCSRRFPACASA-------SSGPQVPAASAPLIQSDIGAASRTSALEKLD

Query:  TIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRS
         +EEG+EK IY CRFM FLG LGSL+GSVLCF+KGC++V  SF +Y VNRGKVI LLVEAID+YLLGTVMLVFG GLYELFIS L +  + +   V +RS
Subjt:  TIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRS

Query:  NLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKL
        +LFG+FTLKERP+W++V +V+ELKTKLGHVIVMLLLIG FDKSK+ VI +  DLLC++ SIF SS  LFLLS+L
Subjt:  NLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKL

AT5G13720.1 Uncharacterised protein family (UPF0114)2.3e-3942.66Show/hide
Query:  PACASASSGPQVPAASAPLIQSDIGAASRTSALEK-LDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVN------RGKVIML
        P  +++SS P     +   + S  G     S   +   + E  +E+ I+  RF+A L V GSL GS+LCF+ GCV++  ++  Y+ N       G++++ 
Subjt:  PACASASSGPQVPAASAPLIQSDIGAASRTSALEK-LDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVN------RGKVIML

Query:  LVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLC
        LVEAIDVYL GTVML+F  GLY LFIS    +           S+LFG+F +KERPKWM +++++ELKTK+GHVIVM+LL+  F++SK   I T  DLL 
Subjt:  LVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQTPGDLLC

Query:  LAASIFLSSGSLFLLSKL
         +  IFLSS SL++L  L
Subjt:  LAASIFLSSGSLFLLSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTA
TCCAAATATTAATAGGTTTTTTGGGTATAAAACCGACCTTATCGGTGGTTGTAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTT
CTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCCGGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTC
ATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAA
AGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAAC
GCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACCTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACA
AAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTCGACAAGAGTAAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATT
TCTTTCCTCTGGTAGCCTGTTTCTGCTGTCTAAACTAACCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATAATCCCTTCCCTCAGCGCGGGCAATATCAACGTGCCGTGATGAAACAGAGAACGAACTGATCATGGAGTATCCCAACTTTAAAGGAGGCCCATAACTTCGGCCCCTAT
CCCTAGTCATGACTCAGTCATAACTATCTCAATTCCCATTTCCTCGGAGTTCAGTGGCCGACGCCACCGTACGGCAATCTTCTTTGATGTTAAAACCCTTTCTCTCCCCC
TCCATCTCCGGCTGCCACCATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCT
ACCAGTACCGGCAACCTTATCCAAATATTAATAGGTTTTTTGGGTATAAAACCGACCTTATCGGTGGTTGTAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGA
CCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCCGGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGC
CATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAAT
ATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATC
AGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACCTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGAC
CGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTCGACAAGAGTAAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTT
GCTTAGCTGCTTCAATATTTCTTTCCTCTGGTAGCCTGTTTCTGCTGTCTAAACTAACCGAATAACAGTAATAAGTTATGTACAAATATAAATATGTAATACACCTTTTT
TTTCACCTTTTTTGGCCCTCCTCTGAGAACGGTTGAAACCGAGAAATGTTAGTTCTTGTAAATGGATTTGTAAGAAGATGTTACTGGAGTGTAGAATGACTGCAAAGTAG
TAAATAAATAAGGCAATAAAGCTTTAGAAAGCT
Protein sequenceShow/hide protein sequence
MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNINRFFGYKTDLIGGCSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRF
MAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKT
KLGHVIVMLLLIGFFDKSKKAVIQTPGDLLCLAASIFLSSGSLFLLSKLTE