; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025735 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025735
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationscaffold13:30963136..30968127
RNA-Seq ExpressionSpg025735
SyntenySpg025735
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064985.1 DUF1997 family protein [Cucumis melo var. makuwa]3.1e-11386.42Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MV+LTTKW GQG GSFPL  A KFSS RKKFE IK SKA NS+TNTK+ANLSV R+EKI+LPSYS    GRTYHI EFLNHPSG+EAMLNKNALKSFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLR+IPTD+DFTVEMLSCKFEGSELVERQN+HFSA+MINHLTW+TIDSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI+QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

XP_022131393.1 uncharacterized protein LOC111004622 isoform X1 [Momordica charantia]2.8e-11486.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLT+KWCG+G+  FPL A QK SS RKKFEVIKLSKA NS+TNTKRANL VTRKEKI+LPSYSD RGGRTYHISEFL HPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLP LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQN+HFSALMINHLTW+T+DSNSFLE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQAL+D+LVPLLLRQ+VQDYEKWI QQ DHS +S S
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

XP_022929609.1 uncharacterized protein LOC111436145 [Cucurbita moschata]1.2e-11287.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IK+SKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

XP_022997460.1 uncharacterized protein LOC111492370 [Cucurbita maxima]1.5e-11287.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IKLSKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKL LSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

XP_038885685.1 uncharacterized protein LOC120075989 isoform X1 [Benincasa hispida]9.3e-11890.12Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLTTKW GQGKGSFPL AA KFSS RKKFEVIKLSKA NS+TNTKRANLSVTR+EKI+LPSYS  R GR YHI EFLNHPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+T+DSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

TrEMBL top hitse value%identityAlignment
A0A0A0LSA8 Uncharacterized protein4.7e-11580.36Show/hide
Query:  VADLRL----LRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGR
        +AD RL    LRRP +        S   + +RMVVLTTK  GQG GSFPL  A KFSS RKKFE  KLSKA NS+TNTK+ANLSV ++EKI+LPSYS   
Subjt:  VADLRL----LRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGR

Query:  GGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWET
         GRTYHI EFLNHPSG+EAMLNKNALKSFQL+DANTYRCTLPKLQLLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+T
Subjt:  GGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWET

Query:  IDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        IDSNS+LE DVKLNLSLEIYTLPFTLMPTAAVE+PGNLMLQALLDNLVPLLLRQL+QDYEKWI QQLDHS++SIS
Subjt:  IDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A5A7VGG9 DUF1997 family protein1.5e-11386.42Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MV+LTTKW GQG GSFPL  A KFSS RKKFE IK SKA NS+TNTK+ANLSV R+EKI+LPSYS    GRTYHI EFLNHPSG+EAMLNKNALKSFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLPKLQLLNFEAAPTLDLR+IPTD+DFTVEMLSCKFEGSELVERQN+HFSA+MINHLTW+TIDSNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI+QQLDHS++SIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A6J1BT83 uncharacterized protein LOC111004622 isoform X11.4e-11486.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVLT+KWCG+G+  FPL A QK SS RKKFEVIKLSKA NS+TNTKRANL VTRKEKI+LPSYSD RGGRTYHISEFL HPSG+EAMLNKNAL+SFQL+
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTLP LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQN+HFSALMINHLTW+T+DSNSFLE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS
        E+PGNLMLQAL+D+LVPLLLRQ+VQDYEKWI QQ DHS +S S
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS

A0A6J1EP97 uncharacterized protein LOC1114361455.7e-11387.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IK+SKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKLNLSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

A0A6J1K9Q3 uncharacterized protein LOC1114923707.5e-11387.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI
        MVVL TKW GQGKGSFPL A  KFSS RKKFE IKLSKA NS+TNTKRANLSVTRKEKI+LPSYS GRGGRTYHI EFLNHPSG+EAM+NKNALKSFQ +
Subjt:  MVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLI

Query:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV
        DANTYRCTL KLQLLNFEAAPTLDLRVIPT++DFTVEMLSCKFEGSELVERQNEHFSALMINHLTW+++ SNS+LE DVKL LSLEIYTLPFTLMPTAAV
Subjt:  DANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAV

Query:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS
        E+PGNLMLQALLDNLVPLLLRQLVQDYEKWI  QQ+DHS VSIS
Subjt:  ESPGNLMLQALLDNLVPLLLRQLVQDYEKWI-HQQLDHSRVSIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)7.0e-5553.81Show/hide
Query:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC
        ++ K+AN+S +RK++I+L    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK+QL++FE  P L LRV PT +D TVE+LSC
Subjt:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC

Query:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS
        K EGSEL+E Q+E FSA+M N +TW       FLE DV+LN++LEI T PFT++P +AVE+PGNL++Q L+D LVPLLL+QL++DY++WI +Q  +S
Subjt:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS

AT4G31115.2 Protein of unknown function (DUF1997)7.0e-5553.81Show/hide
Query:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC
        ++ K+AN+S +RK++I+L    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK+QL++FE  P L LRV PT +D TVE+LSC
Subjt:  TNTKRANLSVTRKEKIQLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLID--ANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSC

Query:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS
        K EGSEL+E Q+E FSA+M N +TW       FLE DV+LN++LEI T PFT++P +AVE+PGNL++Q L+D LVPLLL+QL++DY++WI +Q  +S
Subjt:  KFEGSELVERQNEHFSALMINHLTWETIDSNSFLEADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHS

AT5G04440.1 Protein of unknown function (DUF1997)7.6e-1728.83Show/hide
Query:  SFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKI---QLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPK
        SF + ++  F  S K       S A +S T+  R + S T K +    Q  S S  +  R   + E+++ P+   ++L+   +   + +D NT+RC +  
Subjt:  SFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKI---QLPSYSDGRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPK

Query:  LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSF---LEADVKLNLSLEIYTLPFTLMPTAAVESPGNLML
         +  NFE  P L +RV        +++LSCK EGS +V  QN+ F A M+N ++ ++    S    + +D  + +++EI    F + P  A+E+ G  +L
Subjt:  LQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSF---LEADVKLNLSLEIYTLPFTLMPTAAVESPGNLML

Query:  QALLDNLVPLLLRQLVQDYEKW
          +L  ++P  L QL +DY  W
Subjt:  QALLDNLVPLLLRQLVQDYEKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCCTGAATTTGCTTTTTGATCCTGTAAAGATGAGGATTCCGGTCGCCGACCTCCGCCTCCTCCGCCGCCCGAGTGAGATTCCGGACAAGGCAGATATTTCGAG
CACAAAATTCCAGTTGAAGAGGATGGTGGTTTTGACTACTAAATGGTGTGGGCAAGGAAAAGGCTCGTTTCCATTGTTTGCTGCCCAGAAGTTTAGTTCATCGAGGAAGA
AATTTGAAGTGATCAAGCTATCTAAGGCCATCAATTCTGATACTAATACCAAGAGGGCAAATTTATCTGTCACAAGGAAGGAAAAGATCCAGTTGCCGAGTTACAGCGAC
GGTCGTGGAGGCAGGACGTATCATATCAGTGAATTCCTGAATCACCCTTCAGGAGTTGAAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTTCAGTTGATTGATGCTAA
CACATACAGATGCACTCTGCCAAAATTGCAACTTTTGAACTTTGAAGCTGCTCCTACACTGGATCTACGAGTGATCCCGACAGACAAAGATTTTACAGTTGAGATGCTTT
CGTGCAAGTTCGAAGGTTCAGAATTGGTGGAACGTCAAAACGAACATTTTTCGGCCTTGATGATTAATCACTTAACATGGGAGACAATTGATTCGAATTCGTTTCTGGAA
GCCGATGTGAAGTTGAATCTGTCCCTGGAGATTTATACACTGCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGAGTCCAGGGAATTTGATGTTACAAGCTCTCTTGGA
CAACCTTGTACCTCTGCTGCTGCGGCAATTAGTGCAAGATTATGAAAAGTGGATCCATCAGCAGCTAGATCATTCCCGTGTTTCCATCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCCTGAATTTGCTTTTTGATCCTGTAAAGATGAGGATTCCGGTCGCCGACCTCCGCCTCCTCCGCCGCCCGAGTGAGATTCCGGACAAGGCAGATATTTCGAG
CACAAAATTCCAGTTGAAGAGGATGGTGGTTTTGACTACTAAATGGTGTGGGCAAGGAAAAGGCTCGTTTCCATTGTTTGCTGCCCAGAAGTTTAGTTCATCGAGGAAGA
AATTTGAAGTGATCAAGCTATCTAAGGCCATCAATTCTGATACTAATACCAAGAGGGCAAATTTATCTGTCACAAGGAAGGAAAAGATCCAGTTGCCGAGTTACAGCGAC
GGTCGTGGAGGCAGGACGTATCATATCAGTGAATTCCTGAATCACCCTTCAGGAGTTGAAGCAATGCTTAACAAAAATGCCTTGAAAAGTTTTCAGTTGATTGATGCTAA
CACATACAGATGCACTCTGCCAAAATTGCAACTTTTGAACTTTGAAGCTGCTCCTACACTGGATCTACGAGTGATCCCGACAGACAAAGATTTTACAGTTGAGATGCTTT
CGTGCAAGTTCGAAGGTTCAGAATTGGTGGAACGTCAAAACGAACATTTTTCGGCCTTGATGATTAATCACTTAACATGGGAGACAATTGATTCGAATTCGTTTCTGGAA
GCCGATGTGAAGTTGAATCTGTCCCTGGAGATTTATACACTGCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGAGTCCAGGGAATTTGATGTTACAAGCTCTCTTGGA
CAACCTTGTACCTCTGCTGCTGCGGCAATTAGTGCAAGATTATGAAAAGTGGATCCATCAGCAGCTAGATCATTCCCGTGTTTCCATCTCTTGA
Protein sequenceShow/hide protein sequence
MEFLNLLFDPVKMRIPVADLRLLRRPSEIPDKADISSTKFQLKRMVVLTTKWCGQGKGSFPLFAAQKFSSSRKKFEVIKLSKAINSDTNTKRANLSVTRKEKIQLPSYSD
GRGGRTYHISEFLNHPSGVEAMLNKNALKSFQLIDANTYRCTLPKLQLLNFEAAPTLDLRVIPTDKDFTVEMLSCKFEGSELVERQNEHFSALMINHLTWETIDSNSFLE
ADVKLNLSLEIYTLPFTLMPTAAVESPGNLMLQALLDNLVPLLLRQLVQDYEKWIHQQLDHSRVSIS