; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022199 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022199
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSnoaL-like domain-containing protein
Genome locationChr05:21861282..21866566
RNA-Seq ExpressionHG10022199
SyntenyHG10022199
Gene Ontology termsNA
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037630.1 putative protein transporter [Cucumis melo var. makuwa]1.8e-9588.61Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMALHS
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL S
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMALHS

Query:  KM
         +
Subjt:  KM

KAG7033068.1 hypothetical protein SDJN02_07121, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-15274.62Show/hide
Query:  MNSIVASPAPF-SPI-FNHKKSS-PPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIA
        M+S+ + P+PF SP+ FNHKKS+ PPLLIS SFT    PR+    PLRVSSSSS           ++ P+DTL+SAS VVR FY G+NRHDLASVE LIA
Subjt:  MNSIVASPAPF-SPI-FNHKKSS-PPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIA

Query:  ENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKK-QIIYARDSVEPAFKPGEM
        ENCVYEDLIFSRPFVGRKDIL+FFKKFNDSISKDLQFVIDDIST+DSSA+GVLWHLEWKGKEFPFSKGCSFYRLV  D KK QIIYARDSVEPA KPGEM
Subjt:  ENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKK-QIIYARDSVEPAFKPGEM

Query:  ALHSKMKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCS
        AL SKMKREGRQHGMVRTYRIIPSPWNPRP+TRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKA+DK KGS+KFK+HHLL DCS
Subjt:  ALHSKMKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCS

Query:  L-DAFDGCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDI-HHVDDEDEMNYFGVEFVVDETMEGDDGWCLIEERQTI
        L   FD CPSFIDDD +D                     DD+ QKVTDFES+VEDI  HVDD+DEM+Y  VE VVDE MEG D WCL+EER  I
Subjt:  L-DAFDGCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDI-HHVDDEDEMNYFGVEFVVDETMEGDDGWCLIEERQTI

XP_008458855.1 PREDICTED: uncharacterized protein LOC103498136 isoform X1 [Cucumis melo]5.2e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

XP_008458857.1 PREDICTED: uncharacterized protein LOC103498136 isoform X2 [Cucumis melo]5.2e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

XP_038889553.1 uncharacterized protein LOC120079446 isoform X1 [Benincasa hispida]8.8e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKSS P+LIS+SFTSPPIPRS  YSPLRVSSSSSDNPA+TVPSPP +A LDTL+SASHVVRDFYHGINRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDLIFSRPFVGRK+ILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDV GKKQIIYARDSVEPAFKPGE+AL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

TrEMBL top hitse value%identityAlignment
A0A0A0KQZ4 SnoaL-like domain-containing protein7.3e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS  P  IS+SFTSP IP+  PYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSAS+VVRDFY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

A0A1S3C8U3 uncharacterized protein LOC103498136 isoform X12.5e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

A0A1S3C9E5 uncharacterized protein LOC103498136 isoform X22.5e-9589.9Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

A0A5A7T2M8 SnoaL-like domain-containing protein8.6e-9688.61Show/hide
Query:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC
        MNSI++S +P  PIFNHKKS PP  IS+SFTSP IPR   YSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVR+FY G+NRHDLASVE LIAENC
Subjt:  MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENC

Query:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMALHS
        VYEDL+FSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL DVDGKKQIIYARDSVEPAFKPGEMAL S
Subjt:  VYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMALHS

Query:  KM
         +
Subjt:  KM

A0A5A7T7S1 Putative Calcium-binding site1.4e-8582.98Show/hide
Query:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD
        MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKC KPRCSGCRLHPVQKA+DKTKGS KFK H LLID S+D F+
Subjt:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD

Query:  GCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDI-HHVDDEDEMNYFGVEFVVDETMEGDDGWCLIEERQTI
         CPS IDDDG+DQSLEEID++LRVG+IHRH D D ++QKVTDFES+VE I HHVDDEDE +Y  VEFVVDE +EGDD WCL+EERQ I
Subjt:  GCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDI-HHVDDEDEMNYFGVEFVVDETMEGDDGWCLIEERQTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein1.2e-4962.8Show/hide
Query:  PRSIPYS-PLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQF
        P S+ +S P R+S+S    PA    +     P     SAS VV  FY  +N HDL+SV  LIA++CVYEDL+FS PFVGRK IL FF KF +S S DLQF
Subjt:  PRSIPYS-PLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQF

Query:  VIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL
        VIDDISTEDSSA+GV WHLEWKGK FPFSKGCSFYRL  +DGK+QI+Y RD VEPA KPGE  L
Subjt:  VIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMAL

AT2G41730.1 unknown protein6.2e-1443.27Show/hide
Query:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD
        MK EGR  G+VRT  I P  +NPRP +   N LD PP  G FTKV SK TNHS  TG+  + + S C + P  K+  K+KG  K +      +  LD   
Subjt:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD

Query:  GCPS
        G  S
Subjt:  GCPS

AT5G24640.1 unknown protein5.8e-1239.42Show/hide
Query:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD
        MK EGR  G+VRT  I P  +NP    R  N LD PP  G FTK+  K TNHS   G+  + + S C + P  K+  K+KG  K +      +  LD   
Subjt:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFD

Query:  GCPS
        G  S
Subjt:  GCPS

AT5G40690.1 CONTAINS InterPro DOMAIN/s: EF-Hand 1, calcium-binding site (InterPro:IPR018247)6.4e-3539.53Show/hide
Query:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHL----------
        MKREG+QHGMVRTYRI+P   NPRPE++ VN L   PTAGLFTKV+SKPTNHSKFTGKCG+ RC  C +HPV K++ KTKGS K +++ +          
Subjt:  MKREGRQHGMVRTYRIIPSPWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHL----------

Query:  -----------------LIDCSLDAFDGCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDIHHVDDEDEMNY--FGVEFVVDET
                         ++D   D +     +  D+ N++      +   V NI    DG +     T+ E   +D    DD+  M++   G+  ++D  
Subjt:  -----------------LIDCSLDAFDGCPSFIDDDGNDQSLEEIDIELRVGNIHRHYDGDDKMQKVTDFESKVEDIHHVDDEDEMNY--FGVEFVVDET

Query:  MEGDDGWCLIEERQT
         E D+GW L+EE  T
Subjt:  MEGDDGWCLIEERQT

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein1.8e-2133.86Show/hide
Query:  SASHVVRDFYHGINRHDLASVEALIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL
        S    V  FY  IN  +   + + I+ +C  +D  F +PF G+++ + FF++   S+ ++++F ++++   D  +  V WHLEWKG++ PF++GCSFY  
Subjt:  SASHVVRDFYHGINRHDLASVEALIAENCVYEDLIFSRPFVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRL

Query:  VDVDGKKQIIYARDSVEPAFKPGEMAL
        +D  G+  I  AR  +E   KPG + L
Subjt:  VDVDGKKQIIYARDSVEPAFKPGEMAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCCATCGTAGCATCTCCAGCCCCATTCTCTCCCATTTTCAACCACAAAAAGTCTTCTCCTCCACTTTTAATTTCCTATTCTTTCACTTCTCCTCCGATTCCCAG
AAGCATACCCTATTCCCCTCTTCGCGTCTCATCTTCTTCTTCAGACAACCCGGCCGTCACCGTTCCATCTCCACCCACCGATGCTCCTCTTGACACCCTCCGATCCGCTT
CTCATGTCGTAAGGGACTTTTACCATGGAATCAACCGCCATGACCTCGCCTCCGTTGAAGCCCTCATTGCTGAGAATTGCGTTTACGAGGACCTTATCTTTTCTCGCCCT
TTCGTCGGTCGCAAGGACATTCTTCTTTTCTTCAAAAAGTTTAACGATTCGATCAGTAAAGATCTTCAGTTTGTTATTGACGATATATCCACCGAAGACTCGTCCGCTCT
GGGTGTACTTTGGCATTTAGAATGGAAAGGGAAAGAGTTTCCTTTTAGTAAGGGATGCAGCTTTTATCGCTTGGTTGATGTTGATGGCAAGAAACAAATAATCTATGCAA
GAGACAGCGTTGAGCCTGCATTCAAGCCTGGGGAGATGGCTTTGCACTCCAAAATGAAGAGGGAAGGTCGCCAACATGGTATGGTAAGAACCTACCGGATCATCCCTTCC
CCATGGAACCCAAGACCCGAAACCCGATTCGTTAACGAGCTCGATTTCCCTCCTACTGCTGGCCTTTTCACAAAGGTCTCATCCAAGCCTACCAACCACTCCAAATTCAC
CGGAAAATGCGGCAAGCCCCGATGCTCTGGCTGCAGGCTACACCCGGTTCAAAAGGCCAGGGATAAAACCAAGGGATCTCATAAGTTCAAGTCACACCATCTCCTCATTG
ACTGCTCACTGGATGCCTTCGATGGTTGCCCCAGTTTCATCGATGATGATGGGAACGACCAGTCATTGGAAGAAATAGATATTGAATTGAGGGTTGGAAACATTCATCGC
CACTATGACGGTGACGACAAGATGCAGAAAGTAACAGATTTTGAATCGAAGGTCGAAGATATTCATCATGTGGATGATGAAGATGAAATGAATTACTTTGGAGTGGAGTT
CGTCGTGGACGAAACCATGGAGGGAGATGATGGTTGGTGTTTGATAGAAGAAAGACAAACAATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACTCCATCGTAGCATCTCCAGCCCCATTCTCTCCCATTTTCAACCACAAAAAGTCTTCTCCTCCACTTTTAATTTCCTATTCTTTCACTTCTCCTCCGATTCCCAG
AAGCATACCCTATTCCCCTCTTCGCGTCTCATCTTCTTCTTCAGACAACCCGGCCGTCACCGTTCCATCTCCACCCACCGATGCTCCTCTTGACACCCTCCGATCCGCTT
CTCATGTCGTAAGGGACTTTTACCATGGAATCAACCGCCATGACCTCGCCTCCGTTGAAGCCCTCATTGCTGAGAATTGCGTTTACGAGGACCTTATCTTTTCTCGCCCT
TTCGTCGGTCGCAAGGACATTCTTCTTTTCTTCAAAAAGTTTAACGATTCGATCAGTAAAGATCTTCAGTTTGTTATTGACGATATATCCACCGAAGACTCGTCCGCTCT
GGGTGTACTTTGGCATTTAGAATGGAAAGGGAAAGAGTTTCCTTTTAGTAAGGGATGCAGCTTTTATCGCTTGGTTGATGTTGATGGCAAGAAACAAATAATCTATGCAA
GAGACAGCGTTGAGCCTGCATTCAAGCCTGGGGAGATGGCTTTGCACTCCAAAATGAAGAGGGAAGGTCGCCAACATGGTATGGTAAGAACCTACCGGATCATCCCTTCC
CCATGGAACCCAAGACCCGAAACCCGATTCGTTAACGAGCTCGATTTCCCTCCTACTGCTGGCCTTTTCACAAAGGTCTCATCCAAGCCTACCAACCACTCCAAATTCAC
CGGAAAATGCGGCAAGCCCCGATGCTCTGGCTGCAGGCTACACCCGGTTCAAAAGGCCAGGGATAAAACCAAGGGATCTCATAAGTTCAAGTCACACCATCTCCTCATTG
ACTGCTCACTGGATGCCTTCGATGGTTGCCCCAGTTTCATCGATGATGATGGGAACGACCAGTCATTGGAAGAAATAGATATTGAATTGAGGGTTGGAAACATTCATCGC
CACTATGACGGTGACGACAAGATGCAGAAAGTAACAGATTTTGAATCGAAGGTCGAAGATATTCATCATGTGGATGATGAAGATGAAATGAATTACTTTGGAGTGGAGTT
CGTCGTGGACGAAACCATGGAGGGAGATGATGGTTGGTGTTTGATAGAAGAAAGACAAACAATCTAA
Protein sequenceShow/hide protein sequence
MNSIVASPAPFSPIFNHKKSSPPLLISYSFTSPPIPRSIPYSPLRVSSSSSDNPAVTVPSPPTDAPLDTLRSASHVVRDFYHGINRHDLASVEALIAENCVYEDLIFSRP
FVGRKDILLFFKKFNDSISKDLQFVIDDISTEDSSALGVLWHLEWKGKEFPFSKGCSFYRLVDVDGKKQIIYARDSVEPAFKPGEMALHSKMKREGRQHGMVRTYRIIPS
PWNPRPETRFVNELDFPPTAGLFTKVSSKPTNHSKFTGKCGKPRCSGCRLHPVQKARDKTKGSHKFKSHHLLIDCSLDAFDGCPSFIDDDGNDQSLEEIDIELRVGNIHR
HYDGDDKMQKVTDFESKVEDIHHVDDEDEMNYFGVEFVVDETMEGDDGWCLIEERQTI