; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g010680 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g010680
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationChr01:30637742..30651958
RNA-Seq ExpressionLcy01g010680
SyntenyLcy01g010680
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064985.1 DUF1997 family protein [Cucumis melo var. makuwa]3.0e-11386.42Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MV+LTTKW GQG GSFPL  A KFSS RKKFE IK SKA NS+TNTK+ANLSV R+EKI+LPSYS    GRTYHI EFLNHPSG+EAMLNKNALKSFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLR+IPTD+DFTVEMLSCKFEGSELVERQN+HFSA+MINHLTW+TIDSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI+QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

XP_022131393.1 uncharacterized protein LOC111004622 isoform X1 [Momordica charantia]2.7e-11486.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLT+KWCG+G+  FPL A QK SS RKKFEVIKLSKA NS+TNTKRANL VTRKEKI+LPSYSD RGGRTYHISEFL HPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLP LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQN+HFSALMINHLTW+T+DSNSFLE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQAL+D+LVPLLLRQ+VQDYEKWI QQ DHS +S S
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

XP_022929609.1 uncharacterized protein LOC111436145 [Cucurbita moschata]1.1e-11287.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IK+SKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

XP_022997460.1 uncharacterized protein LOC111492370 [Cucurbita maxima]1.5e-11287.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IKLSKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKL LSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

XP_038885685.1 uncharacterized protein LOC120075989 isoform X1 [Benincasa hispida]9.1e-11890.12Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLTTKW GQGKGSFPL AA KFSS RKKFEVIKLSKA NS+TNTKRANLSVTR+EKI+LPSYS  R GR YHI EFLNHPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+T+DSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

TrEMBL top hitse value%identityAlignment
A0A0A0LSA8 Uncharacterized protein2.4e-11680.43Show/hide
Query:  DFRIPVADLRLLRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDG
        DFR+P A    LRRP +        S   + +RMVVLTTK  GQG GSFPL  A KFSS RKKFE  KLSKA NS+TNTK+ANLSV ++EKI+LPSYS  
Subjt:  DFRIPVADLRLLRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDG

Query:  RGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWE
          GRTYHI EFLNHPSG+EAMLNKNALKSFQL+DANTYRCTLPKLQLLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+
Subjt:  RGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWE

Query:  TIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        TIDSNS+LE DVKLNLSLEIYTLPFTLMPTAAVE+PGNLMLQALLDNLVPLLLRQL+QDYEKWI QQLDHS++SIS
Subjt:  TIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A5A7VGG9 DUF1997 family protein1.5e-11386.42Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MV+LTTKW GQG GSFPL  A KFSS RKKFE IK SKA NS+TNTK+ANLSV R+EKI+LPSYS    GRTYHI EFLNHPSG+EAMLNKNALKSFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLR+IPTD+DFTVEMLSCKFEGSELVERQN+HFSA+MINHLTW+TIDSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI+QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A6J1BT83 uncharacterized protein LOC111004622 isoform X11.3e-11486.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLT+KWCG+G+  FPL A QK SS RKKFEVIKLSKA NS+TNTKRANL VTRKEKI+LPSYSD RGGRTYHISEFL HPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLP LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQN+HFSALMINHLTW+T+DSNSFLE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQAL+D+LVPLLLRQ+VQDYEKWI QQ DHS +S S
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A6J1EP97 uncharacterized protein LOC1114361455.6e-11387.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IK+SKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

A0A6J1K9Q3 uncharacterized protein LOC1114923707.3e-11387.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IKLSKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKL LSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)6.8e-5553.81Show/hide
Query:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC
        ++ K+AN+S +RK++I+L    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK+QL++FE  P L LRV PT +D TVE+LSC
Subjt:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC

Query:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS
        K EGSEL+E Q+E FSA+M N +TW       FLE DV+LN++LEI T PFT++P +AVE+PGNL++Q L+D LVPLLL+QL++DY++WI +Q  +S
Subjt:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS

AT4G31115.2 Protein of unknown function (DUF1997)6.8e-5553.81Show/hide
Query:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC
        ++ K+AN+S +RK++I+L    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK+QL++FE  P L LRV PT +D TVE+LSC
Subjt:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC

Query:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS
        K EGSEL+E Q+E FSA+M N +TW       FLE DV+LN++LEI T PFT++P +AVE+PGNL++Q L+D LVPLLL+QL++DY++WI +Q  +S
Subjt:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS

AT5G04440.1 Protein of unknown function (DUF1997)7.4e-1728.83Show/hide
Query:  SFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKI---QLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPK
        SF + ++  F  S K       S A +S T+  R + S T K +    Q  S S  +  R   + E+++ P+   ++L+   +   + +D NT+RC +  
Subjt:  SFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKI---QLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPK

Query:  LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSF---LEADVKLNLSLEIYTLPFTLMPTAAVESPGNLML
         +  NFE  P L +RV        +++LSCK EGS +V  QN+ F A M+N ++ ++    S    + +D  + +++EI    F + P  A+E+ G  +L
Subjt:  LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSF---LEADVKLNLSLEIYTLPFTLMPTAAVESPGNLML

Query:  QALLDNLVPLLLRQLVQDYEKW
          +L  ++P  L QL +DY  W
Subjt:  QALLDNLVPLLLRQLVQDYEKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTTGACTTCAGGATTCCGGTCGCCGACCTCCGCCTCCTCCGCCGCCCGAGTGAGATTCCGGACAAGGCAGATATTTCGAGCACAAAATTCCAGTTGAAGAGGAT
GGTGGTTTTGACTACTAAATGGTGTGGGCAAGGAAAAGGCTCGTTTCCATTGTTTGCTGCCCAGAAGTTTAGTTCATCGAGGAAGAAATTTGAAGTGATCAAGCTATCTA
AGGCCATCAATTCTGATACTAATACCAAGAGGGCAAATTTATCTGTCACAAGGAAGGAAAAGATCCAGTTGCCGAGTTACAGCGACGGTCGTGGAGGCAGGACGTATCAT
ATCAGTGAATTCCTGAATCACCCTTCAGGAGTTGAAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTTCAGTTGATTGATGCTAACACATACAGATGCACTCTGCCAAA
ATTGCAACTTTTGAACTTTGAAGCTGCTCCTACACTGGATCTACGAGTGATCCCGACAGACAAAGATTTTACAGTTGAGATGCTTTCGTGCAAGTTCGAAGGTTCAGAAT
TGGTGGAACGTCAAAACGAACATTTTTCGGCCTTGATGATTAATCACTTAACATGGGAGACAATTGATTCGAATTCGTTTCTGGAAGCCGATGTGAAGTTGAATCTGTCC
CTGGAGATTTATACACTGCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGAGTCCAGGGAATTTGATGTTACAAGCTCTCTTGGACAACCTTGTACCTCTGCTGCTGCG
GCAATTAGTGCAAGATTATGAAAAGTGGATCCATCAGCAGCTAGATCATTCCCGTGTTTCCATCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTTGACTTCAGGATTCCGGTCGCCGACCTCCGCCTCCTCCGCCGCCCGAGTGAGATTCCGGACAAGGCAGATATTTCGAGCACAAAATTCCAGTTGAAGAGGAT
GGTGGTTTTGACTACTAAATGGTGTGGGCAAGGAAAAGGCTCGTTTCCATTGTTTGCTGCCCAGAAGTTTAGTTCATCGAGGAAGAAATTTGAAGTGATCAAGCTATCTA
AGGCCATCAATTCTGATACTAATACCAAGAGGGCAAATTTATCTGTCACAAGGAAGGAAAAGATCCAGTTGCCGAGTTACAGCGACGGTCGTGGAGGCAGGACGTATCAT
ATCAGTGAATTCCTGAATCACCCTTCAGGAGTTGAAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTTCAGTTGATTGATGCTAACACATACAGATGCACTCTGCCAAA
ATTGCAACTTTTGAACTTTGAAGCTGCTCCTACACTGGATCTACGAGTGATCCCGACAGACAAAGATTTTACAGTTGAGATGCTTTCGTGCAAGTTCGAAGGTTCAGAAT
TGGTGGAACGTCAAAACGAACATTTTTCGGCCTTGATGATTAATCACTTAACATGGGAGACAATTGATTCGAATTCGTTTCTGGAAGCCGATGTGAAGTTGAATCTGTCC
CTGGAGATTTATACACTGCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGAGTCCAGGGAATTTGATGTTACAAGCTCTCTTGGACAACCTTGTACCTCTGCTGCTGCG
GCAATTAGTGCAAGATTATGAAAAGTGGATCCATCAGCAGCTAGATCATTCCCGTGTTTCCATCTCTTGA
Protein sequenceShow/hide protein sequence
MEVDFRIPVADLRLLRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYH
ISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLS
LEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS