; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037614 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037614
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationchr2:7734810..7738469
RNA-Seq ExpressionLag0037614
SyntenyLag0037614
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591856.1 hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sororia]2.5e-12586.99Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R LTTTARPST+IIQAYQ++QPNPKF+  FGYR DLVGGCGR FPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R+V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022140712.1 uncharacterized protein LOC111011276 [Momordica charantia]6.3e-12485.5Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSPPLITGP R LTTT RPST+I+QAY Y+Q NPKF+RFFGY TDLVGGC RRFPACAS SSGPQVPA SAPLIQSD   A RTSALEK++TIEEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLG+A+S S +N EHRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK VIQSPGDLLCLA S+FLSSGSLFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]1.0e-12687.73Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R LTTTARPST+IIQAYQ++QPNPKFN  FGYR DLVGGCGRRFPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R+V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]1.1e-12587.36Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R LTTTARPST+IIQAYQ++QPN KFN  FGYR DLVGGCGRRFPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]3.9e-12687.36Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R L TTARPST+IIQAYQ++QPNPKFN  FGYR DLVGGCGRRFPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R+V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

TrEMBL top hitse value%identityAlignment
A0A1S3BA03 uncharacterized protein LOC103487632 isoform X13.1e-12185.13Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R  TTT RPST+IIQAYQY+QPNPKFN  FGYRTDLVG C R FPACAS S GPQVPA SAPLIQ+ +G ASRTS LEK++TIEE LE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GSILCF+KGCVHVA SFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG+ARSLSK NVEH+SNLFGLF 
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMNV TVNELKTKLGHVIVMLLLIGFFDKSKK VIQSP DLLCLA SIFLSSG+LFLL+KLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1CHU0 uncharacterized protein LOC1110112763.0e-12485.5Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSPPLITGP R LTTT RPST+I+QAY Y+Q NPKF+RFFGY TDLVGGC RRFPACAS SSGPQVPA SAPLIQSD   A RTSALEK++TIEEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLG+A+S S +N EHRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKW+ + TVNELKTKLGHVIVMLLLIGFF+K+KK VIQSPGDLLCLA S+FLSSGSLFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434365.0e-12787.73Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R LTTTARPST+IIQAYQ++QPNPKFN  FGYR DLVGGCGRRFPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R+V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1INA6 uncharacterized protein LOC1114770895.5e-12687.36Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE
        MQPSP LITGP R LTTTARPST+IIQAYQ++QPN KFN  FGYR DLVGGCGRRFPACAS SSGPQVPA SAP +QSD+G ASRTSALEK+DT+EEGLE
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLE

Query:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT
        KAIYRCRFMA LGV+GSL+GS+LCFVKGCVHVA S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+R V HRSNLFGLFT
Subjt:  KAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFT

Query:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        LKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKK VIQSPGDLLCLA SIFLSS +LFLLSKLTE
Subjt:  LKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

A0A6J1J2R4 uncharacterized protein LOC1114807451.4e-12187.04Show/hide
Query:  MQPSPPLITGPTRILTTTARPSTMIIQAY-QYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGL
        MQPSPPLI+GP+R LTTT RPSTMIIQAY QY Q  PKFN F GY+T L+ GCGRRFPA A+ASSGP VPA SAP IQSDIG ASRTSALEK   IEE L
Subjt:  MQPSPPLITGPTRILTTTARPSTMIIQAY-QYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGL

Query:  EKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLF
        EKAIYRCRFMA LGV GSLVGSILCF+KGCVHVA SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLG+ R+LSKRN+EHRSNLFGLF
Subjt:  EKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLF

Query:  TLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE
        TLKERPKWMN+TTVNELKTKLGHVIVMLLLIGFFDKSKKA IQSPGDLLCLAAS+FLSSGSLFLLSKLTE
Subjt:  TLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)6.6e-6360Show/hide
Query:  SSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLEKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++    +    A  +++  + + +EEG+EK IY CRFM  LG +GSL+GS+LCF+KGC++V +SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLEKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFTLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLF
        MLVFG GLYELFIS+L ++ S +   V +RS+LFG+FTLKERP+W+ V +V+ELKTKLGHVIVMLLLIG FDKSK+ VI S  DLLC++ SIF SS  LF
Subjt:  MLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFTLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLF

Query:  LLSKL
        LLS+L
Subjt:  LLSKL

AT5G13720.1 Uncharacterised protein family (UPF0114)5.4e-4143.58Show/hide
Query:  PACASASSGPQVPATSAPLIQSDIGTASRTSALEK-VDTIEEGLEKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVN------RGKVIML
        P  +++SS P     +   + S  GT    S   +   + E  +E+ I+  RF+A+L V GSL GS+LCF+ GCV++ E++  Y+ N       G++++ 
Subjt:  PACASASSGPQVPATSAPLIQSDIGTASRTSALEK-VDTIEEGLEKAIYRCRFMAVLGVVGSLVGSILCFVKGCVHVAESFSEYFVN------RGKVIML

Query:  LVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFTLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLC
        LVEAIDVYL GTVML+F  GLY LFISH               S+LFG+F +KERPKWM +++++ELKTK+GHVIVM+LL+  F++SK   I +  DLL 
Subjt:  LVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFTLKERPKWMNVTTVNELKTKLGHVIVMLLLIGFFDKSKKAVIQSPGDLLC

Query:  LAASIFLSSGSLFLLSKL
         +  IFLSS SL++L  L
Subjt:  LAASIFLSSGSLFLLSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCGTCTCCACCGTTGATTACTGGCCCTACCAGAATTCTAACGACGACCGCTCGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAGCCTAATCCAAA
ATTCAATAGGTTTTTTGGGTATAGAACCGACCTTGTCGGTGGTTGTGGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGGCCTCAAGTTCCAGCTACTTCTGCTC
CTTTAATCCAGTCCGATATTGGCACTGCGTCCCGGACGTCGGCACTGGAAAAGGTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCA
GTTTTGGGCGTCGTAGGATCTTTGGTTGGGTCTATACTCTGTTTCGTCAAGGGGTGCGTTCATGTTGCAGAATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGAT
AATGTTGCTAGTTGAAGCCATAGATGTGTATCTTTTAGGAACTGTGATGCTAGTCTTTGGCACGGGTCTCTATGAGCTGTTTATCAGCCATCTTGGAAGTGCACGGTCGT
TATCAAAGAGAAATGTTGAGCATAGATCCAACTTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGAACGTAACGACCGTTAACGAGCTGAAAACGAAGCTC
GGGCATGTCATAGTGATGCTGCTTCTGATTGGGTTCTTTGACAAGAGTAAAAAGGCGGTTATACAATCTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATTCCTTTC
CTCTGGTAGTCTGTTTTTGCTGTCTAAACTAACCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCGTCTCCACCGTTGATTACTGGCCCTACCAGAATTCTAACGACGACCGCTCGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAGCCTAATCCAAA
ATTCAATAGGTTTTTTGGGTATAGAACCGACCTTGTCGGTGGTTGTGGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGGCCTCAAGTTCCAGCTACTTCTGCTC
CTTTAATCCAGTCCGATATTGGCACTGCGTCCCGGACGTCGGCACTGGAAAAGGTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCA
GTTTTGGGCGTCGTAGGATCTTTGGTTGGGTCTATACTCTGTTTCGTCAAGGGGTGCGTTCATGTTGCAGAATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGAT
AATGTTGCTAGTTGAAGCCATAGATGTGTATCTTTTAGGAACTGTGATGCTAGTCTTTGGCACGGGTCTCTATGAGCTGTTTATCAGCCATCTTGGAAGTGCACGGTCGT
TATCAAAGAGAAATGTTGAGCATAGATCCAACTTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGAACGTAACGACCGTTAACGAGCTGAAAACGAAGCTC
GGGCATGTCATAGTGATGCTGCTTCTGATTGGGTTCTTTGACAAGAGTAAAAAGGCGGTTATACAATCTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAATATTCCTTTC
CTCTGGTAGTCTGTTTTTGCTGTCTAAACTAACCGAATAA
Protein sequenceShow/hide protein sequence
MQPSPPLITGPTRILTTTARPSTMIIQAYQYRQPNPKFNRFFGYRTDLVGGCGRRFPACASASSGPQVPATSAPLIQSDIGTASRTSALEKVDTIEEGLEKAIYRCRFMA
VLGVVGSLVGSILCFVKGCVHVAESFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSLSKRNVEHRSNLFGLFTLKERPKWMNVTTVNELKTKL
GHVIVMLLLIGFFDKSKKAVIQSPGDLLCLAASIFLSSGSLFLLSKLTE