; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002787 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002787
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:45652703..45653551
RNA-Seq ExpressionLag0002787
SyntenyLag0002787
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575681.1 Zinc finger A20 and AN1 domain-containing stress-associated protein 8, partial [Cucurbita argyrosperma subsp. sororia]1.5e-9171.07Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSKFLRP+PSS+FL      E F++ PS HS+P F QSPRRT   PT+DL  S T +    L++ +SYRRS NFN         S++PN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW
        +R+R P +     I RLRNPSF  SD +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGAW
Subjt:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTG
        ISYATR SP TEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTG
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTG

XP_022145979.1 uncharacterized protein LOC111015297 [Momordica charantia]4.4e-9673.24Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFLEEPFSQFP---SPH-SNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFN------LINPDLR--IQ
        MAAIVTRRLSSKFLRPLPSSS    P SQ P    PH S+P F    RRT TNP++ LFNS TAS +PSL++ +S  RST+FN      +I P+    IQ
Subjt:  MAAIVTRRLSSKFLRPLPSSSFLEEPFSQFP---SPH-SNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFN------LINPDLR--IQ

Query:  SNSPNFDYRVR-----DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLV
        S SPNFD+RVR     D CSI   RNP+ LN    QK GFSSTSENE +QKP +L HQ++EGPTVERDLSALAGETRGV+E MMKNVY LSKAMAVLGLV
Subjt:  SNSPNFDYRVR-----DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLV

Query:  QLGIGAWISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        QLGIG WISYATRGSPITEVSIQSFVAFGFPFS+AFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLNVLFVRVR+VS LC+TGLSVG+LFAL+SR
Subjt:  QLGIGAWISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

XP_022954073.1 uncharacterized protein LOC111456447 [Cucurbita moschata]3.5e-10173.63Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSKFLRP+PSS+FL      E F++ PS HS+PSF QSPRRT   PT+DL NSHT  +D  L++ +SYRRS NFNL N         PN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW
        +R+R P +     I RLRNPSF  SD +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGAW
Subjt:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        ISYATR SP TEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTGLSVG+LFALLSR
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

XP_022991920.1 uncharacterized protein LOC111488415 [Cucurbita maxima]2.1e-9872.35Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSK+LRP PSS+ L      E F++ PS HS+PSF QSPRRT   P +DL NSHT  +D  L++ +SYRRS NFN     L   S++PN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVR------DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGA
        +R+R       P  I RLRNPSF   D +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGA
Subjt:  YRVR------DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGA

Query:  WISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        WISYATR SPITEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTGLSVG+LFALLSR
Subjt:  WISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

XP_023549384.1 uncharacterized protein LOC111807746 [Cucurbita pepo subsp. pepo]2.5e-9972.26Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSKFLRP+PSS+FL      E F++ PS HS+PSF QSPR+T   PT+DL NSHT  +D  L++ + +RRS NFN         ++SPN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW
        +R+R P +     I RLRNPSF  SD +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGAW
Subjt:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        ISYATR SP TEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTGLSVG+LFALLSR
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

TrEMBL top hitse value%identityAlignment
A0A0A0K8D3 Uncharacterized protein2.3e-9069.9Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSS   RP   S+FL      EP ++F SPHS+PSFL S RRT  N T+DLFNS + S                   I  DL +Q+ + NF+
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRV-RDPCSIRRLRNPSFLN-SDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWISY
          +     SI +LRNPSF++ SDF++KS FS+TSE E  QKP D KHQ++EGPTVERDLSALA ETR VIE MMKNVYRLSKAMAVLGLVQLGIGAWISY
Subjt:  YRV-RDPCSIRRLRNPSFLN-SDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWISY

Query:  ATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
         TRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTL+LQI KNLN LFVRVRTVSFLCVTGLSVG+LFALLSR
Subjt:  ATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

A0A5A7UUD1 Uncharacterized protein3.3e-8968.28Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTN-PTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNF
        MAAIVTRRLSS   RP   S+FL      +PF++ PSPHS+PSFL+SPRRT  N  TLDLFNS + S                   I PDL+ Q+ SP F
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTN-PTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNF

Query:  DYRVRD-PCSIRRLRNPSFL-NSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWIS
        ++ +     SI +L NP+F+  SDF++KS FS+T E E  QK  D KHQ++EGPTVERDLSALA ETR V+E MMKNVYRLSKAMAVLGLVQLG+GAWIS
Subjt:  DYRVRD-PCSIRRLRNPSFL-NSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWIS

Query:  YATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        Y TRGSPITEVSIQSFVAFGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTL+LQI KNLN LFVRVRTVS LCVTGLSVG+LFALLSR
Subjt:  YATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

A0A6J1CWU5 uncharacterized protein LOC1110152972.1e-9673.24Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFLEEPFSQFP---SPH-SNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFN------LINPDLR--IQ
        MAAIVTRRLSSKFLRPLPSSS    P SQ P    PH S+P F    RRT TNP++ LFNS TAS +PSL++ +S  RST+FN      +I P+    IQ
Subjt:  MAAIVTRRLSSKFLRPLPSSSFLEEPFSQFP---SPH-SNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFN------LINPDLR--IQ

Query:  SNSPNFDYRVR-----DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLV
        S SPNFD+RVR     D CSI   RNP+ LN    QK GFSSTSENE +QKP +L HQ++EGPTVERDLSALAGETRGV+E MMKNVY LSKAMAVLGLV
Subjt:  SNSPNFDYRVR-----DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLV

Query:  QLGIGAWISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        QLGIG WISYATRGSPITEVSIQSFVAFGFPFS+AFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLNVLFVRVR+VS LC+TGLSVG+LFAL+SR
Subjt:  QLGIGAWISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

A0A6J1GQ34 uncharacterized protein LOC1114564471.7e-10173.63Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSKFLRP+PSS+FL      E F++ PS HS+PSF QSPRRT   PT+DL NSHT  +D  L++ +SYRRS NFNL N         PN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW
        +R+R P +     I RLRNPSF  SD +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGAW
Subjt:  YRVRDPCS-----IRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        ISYATR SP TEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTGLSVG+LFALLSR
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

A0A6J1JU98 uncharacterized protein LOC1114884151.0e-9872.35Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD
        MAAIVTRRLSSK+LRP PSS+ L      E F++ PS HS+PSF QSPRRT   P +DL NSHT  +D  L++ +SYRRS NFN     L   S++PN D
Subjt:  MAAIVTRRLSSKFLRPLPSSSFL-----EEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFD

Query:  YRVR------DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGA
        +R+R       P  I RLRNPSF   D +QK GFSST ENE  +KP D KHQ++EGPTVERDLSALAGETR V+E MMKNVY LSKAMA+LGLVQLGIGA
Subjt:  YRVR------DPCSIRRLRNPSFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGA

Query:  WISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR
        WISYATR SPITEVSIQSFV+FGFPFS+AFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VSFLCVTGLSVG+LFALLSR
Subjt:  WISYATRGSPITEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12650.1 unknown protein1.8e-4760.22Show/hide
Query:  SIRRLRNPSFLNSDFNQKSGFSSTS--ENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWISYATRGSPI
        +I +LR+ ++    F+  SG +     E  K ++   +KHQE+EGPTVERDLSAL  ETR V+E MMKN+Y LS AM  LGL QL +GA I YATR  P+
Subjt:  SIRRLRNPSFLNSDFNQKSGFSSTS--ENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWISYATRGSPI

Query:  TEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLS
         E++IQS +AFGFPF+MA ++R+SLKPM FFKKMEE GRLQILTLTLQ+AKNLN+LFVR R VS LCV  L  G LF LLS
Subjt:  TEVSIQSFVAFGFPFSMAFILRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCATTGTTACGCGCAGGTTAAGCTCCAAATTTCTCAGACCACTTCCTTCTTCCTCCTTCCTCGAAGAACCCTTTTCCCAATTCCCTTCTCCACATTCTAATCC
CTCGTTTTTGCAATCTCCCCGACGAACAGTTACAAACCCTACACTTGACCTCTTCAATTCCCACACAGCTTCTTCAGATCCCTCCCTCGACTCTCCCAAATCTTACCGAA
GATCCACGAATTTCAATCTAATCAATCCCGATTTACGGATTCAGAGTAATAGCCCCAATTTCGATTATCGGGTTCGGGATCCCTGTTCGATTCGGAGATTGAGAAACCCT
AGCTTCTTGAACTCGGATTTCAACCAGAAATCTGGATTCTCTTCAACCTCGGAGAACGAGAAGTCGCAGAAACCCGGCGATTTGAAGCACCAAGAAATGGAAGGGCCGAC
CGTGGAGCGAGACCTGTCGGCGTTGGCCGGTGAAACCAGAGGGGTGATTGAAGTGATGATGAAGAACGTGTACAGGTTAAGCAAAGCTATGGCGGTTCTGGGTCTGGTTC
AACTGGGCATCGGGGCTTGGATTTCGTACGCAACTCGAGGTTCACCAATTACAGAAGTTTCTATCCAGAGCTTCGTGGCGTTCGGGTTTCCATTCTCGATGGCCTTCATT
CTGCGGCAGTCTCTGAAGCCGATGATGTTCTTCAAGAAGATGGAGGAACAAGGTAGGTTGCAGATTCTAACTCTGACTCTTCAGATTGCTAAGAATTTGAATGTTCTGTT
TGTTAGAGTGCGAACTGTTTCTTTCTTGTGTGTAACTGGATTGTCTGTTGGACTTCTGTTTGCTTTGCTTTCAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCATTGTTACGCGCAGGTTAAGCTCCAAATTTCTCAGACCACTTCCTTCTTCCTCCTTCCTCGAAGAACCCTTTTCCCAATTCCCTTCTCCACATTCTAATCC
CTCGTTTTTGCAATCTCCCCGACGAACAGTTACAAACCCTACACTTGACCTCTTCAATTCCCACACAGCTTCTTCAGATCCCTCCCTCGACTCTCCCAAATCTTACCGAA
GATCCACGAATTTCAATCTAATCAATCCCGATTTACGGATTCAGAGTAATAGCCCCAATTTCGATTATCGGGTTCGGGATCCCTGTTCGATTCGGAGATTGAGAAACCCT
AGCTTCTTGAACTCGGATTTCAACCAGAAATCTGGATTCTCTTCAACCTCGGAGAACGAGAAGTCGCAGAAACCCGGCGATTTGAAGCACCAAGAAATGGAAGGGCCGAC
CGTGGAGCGAGACCTGTCGGCGTTGGCCGGTGAAACCAGAGGGGTGATTGAAGTGATGATGAAGAACGTGTACAGGTTAAGCAAAGCTATGGCGGTTCTGGGTCTGGTTC
AACTGGGCATCGGGGCTTGGATTTCGTACGCAACTCGAGGTTCACCAATTACAGAAGTTTCTATCCAGAGCTTCGTGGCGTTCGGGTTTCCATTCTCGATGGCCTTCATT
CTGCGGCAGTCTCTGAAGCCGATGATGTTCTTCAAGAAGATGGAGGAACAAGGTAGGTTGCAGATTCTAACTCTGACTCTTCAGATTGCTAAGAATTTGAATGTTCTGTT
TGTTAGAGTGCGAACTGTTTCTTTCTTGTGTGTAACTGGATTGTCTGTTGGACTTCTGTTTGCTTTGCTTTCAAGATGA
Protein sequenceShow/hide protein sequence
MAAIVTRRLSSKFLRPLPSSSFLEEPFSQFPSPHSNPSFLQSPRRTVTNPTLDLFNSHTASSDPSLDSPKSYRRSTNFNLINPDLRIQSNSPNFDYRVRDPCSIRRLRNP
SFLNSDFNQKSGFSSTSENEKSQKPGDLKHQEMEGPTVERDLSALAGETRGVIEVMMKNVYRLSKAMAVLGLVQLGIGAWISYATRGSPITEVSIQSFVAFGFPFSMAFI
LRQSLKPMMFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRTVSFLCVTGLSVGLLFALLSR