; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012515 (gene) of Snake gourd v1 genome

Gene IDTan0012515
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG01:105735652..105744199
RNA-Seq ExpressionTan0012515
SyntenyTan0012515
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024718.1 hypothetical protein SDJN02_13536, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-11173.62Show/hide
Query:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL
        TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLMWIIAF                LQ+NLQ L IEFDNV+            AMKEQK+MELMLDEL
Subjt:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL

Query:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGS---------------SSVVQDLFQSDAWKDGKISKAKLIKMLE
        EMIHEKATNKIALLESE+QKL+NENLRLQEIKGK YWSLKGLD K+EAQK A  +GS               SS+VQDL +SDA KDG +SK KLI +LE
Subjt:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGS---------------SSVVQDLFQSDAWKDGKISKAKLIKMLE

Query:  TGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNW
        +G +SGVLI +HTS+I S+DED TEILDEQRE+A+ RSLFSTLLSLLVGVIIWKAEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAVALLSFNW
Subjt:  TGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNW

Query:  FVLGILAYPTLPNIARLLAPLASRFV
        FVLGILAYPTLPN+AR+LAPLASR V
Subjt:  FVLGILAYPTLPNIARLLAPLASRFV

XP_022137881.1 uncharacterized protein LOC111009200 isoform X1 [Momordica charantia]2.9e-9867.08Show/hide
Query:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD
        M  CS ILK  ++VV TW ELLKTS+++HLNIFWT LMW+IAF                L+Q+LQ LEIE DNV+            AMKEQK+MELMLD
Subjt:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD

Query:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTE-----------AQKVADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETG
        ELEMIHEKATNKIALLESE+Q L+NE LR QEIKGK YWSLKG   KT            + + + Y G SSV+QDL QSDAWKDG IS  KLIK+LE+G
Subjt:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTE-----------AQKVADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETG

Query:  LKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFV
        LKS V+I   TSEI SKDED  EILD+QRE+A+SRSLFST+LSLLVGV+IW+AEE HLCL++ALL VVSISLKSVVEFF TIKNKPALDAVALLS N FV
Subjt:  LKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFV

Query:  LGILAYPTLPNIARLLAPLASRFVG
        LGILAYPTLP IA LLAPLASRFVG
Subjt:  LGILAYPTLPNIARLLAPLASRFVG

XP_031740628.1 uncharacterized protein LOC101204571 isoform X1 [Cucumis sativus]2.1e-11271.6Show/hide
Query:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD
        MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIA                 LQQ LQ LEI+FDNV+            AMKE K+MELMLD
Subjt:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD

Query:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKML
        ELEMIHEKATNKIALLESE+Q+L+N+NLRLQEIKGK YWSLKGLD K+EAQK               +    SSS+VQDL Q DA KD  ISK KLIK+L
Subjt:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKML

Query:  ETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFN
        E+GLKSGVLI SHT EI SKDE  T++LDEQRE+A+SRSLFSTLLSLLVGVIIW+AEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAVALLSFN
Subjt:  ETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFN

Query:  WFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFSI
        WFVLGILAYPTLPNI+R LA   +    ++VEWFGFSI
Subjt:  WFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFSI

XP_038898361.1 uncharacterized protein LOC120086031 isoform X1 [Benincasa hispida]3.8e-11472.78Show/hide
Query:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD
        MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMW++A                 L+Q LQ LEIEF+NV+            AM+E K+MELMLD
Subjt:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD

Query:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKML
        ELEMIHEKATNKI+LLESE+QKL+NENLRLQEIKGK YWSLKGLD K+EAQK               +   GSSS++QDLFQSDA KDG ISK KLIK+L
Subjt:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKML

Query:  ETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFN
        ++GLKSGV I SHT EI SKDED TEILDEQRE+AISRSLFSTLLSLLVGVIIW+AEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAV+LLSFN
Subjt:  ETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFN

Query:  WFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFSI
        WFVLGILAYPTLP IARLLAP   R    IVEWF FSI
Subjt:  WFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFSI

XP_038898364.1 uncharacterized protein LOC120086031 isoform X3 [Benincasa hispida]1.7e-10170.79Show/hide
Query:  TSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDELEMIHEKATNKIALLESELQKL
        TSVSLHLNIFWTTLMW++A                 L+Q LQ LEIEF+NV+            AM+E K+MELMLDELEMIHEKATNKI+LLESE+QKL
Subjt:  TSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDELEMIHEKATNKIALLESELQKL

Query:  KNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETGLKSGVLICSHTSEIPSKDED
        +NENLRLQEIKGK YWSLKGLD K+EAQK               +   GSSS++QDLFQSDA KDG ISK KLIK+L++GLKSGV I SHT EI SKDED
Subjt:  KNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETGLKSGVLICSHTSEIPSKDED

Query:  ATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARLLAPLA
         TEILDEQRE+AISRSLFSTLLSLLVGVIIW+AEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAV+LLSFNWFVLGILAYPTLP IARLLAP  
Subjt:  ATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARLLAPLA

Query:  SRFVGQIVEWFGFSI
         R    IVEWF FSI
Subjt:  SRFVGQIVEWFGFSI

TrEMBL top hitse value%identityAlignment
A0A1S3B9G5 uncharacterized protein LOC1034876333.1e-9370.33Show/hide
Query:  MWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKT
        MWIIA                 LQQ LQ LEIEFDNV+            A+KE K+MELMLDELEMIHEKATNKIALLESE+QKL+NENLRLQEIKGK 
Subjt:  MWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKT

Query:  YWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAIS
        YWSLKGLD K+E QK               +     SSVVQDL Q DA KDG ISK KL+K+LE+GLKSGVLI SHT EI SKDE  TE+LDEQRE+AIS
Subjt:  YWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAIS

Query:  RSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFS
        RSLFS LLSLLVGVIIW+AEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAVALLSFNWFVLGILAYPTLPNIAR LAPLASR    +VEW GFS
Subjt:  RSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFS

A0A6J1C7X7 uncharacterized protein LOC111009200 isoform X11.4e-9867.08Show/hide
Query:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD
        M  CS ILK  ++VV TW ELLKTS+++HLNIFWT LMW+IAF                L+Q+LQ LEIE DNV+            AMKEQK+MELMLD
Subjt:  MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLD

Query:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTE-----------AQKVADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETG
        ELEMIHEKATNKIALLESE+Q L+NE LR QEIKGK YWSLKG   KT            + + + Y G SSV+QDL QSDAWKDG IS  KLIK+LE+G
Subjt:  ELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTE-----------AQKVADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETG

Query:  LKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFV
        LKS V+I   TSEI SKDED  EILD+QRE+A+SRSLFST+LSLLVGV+IW+AEE HLCL++ALL VVSISLKSVVEFF TIKNKPALDAVALLS N FV
Subjt:  LKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFV

Query:  LGILAYPTLPNIARLLAPLASRFVG
        LGILAYPTLP IA LLAPLASRFVG
Subjt:  LGILAYPTLPNIARLLAPLASRFVG

A0A6J1F5N4 uncharacterized protein LOC111442593 isoform X16.6e-9667.08Show/hide
Query:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL
        TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLMWIIAF                LQ+NLQ L IEFDNV+            AMKEQK+MELMLDEL
Subjt:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL

Query:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLET
        EMIHEKATNKIALLESE+QKL+NENLRLQEIKGK YWSLKGLD K+EAQK               +     SS+VQDL +SDA KD              
Subjt:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLET

Query:  GLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWF
                         +DED TEILDEQRE+A+ RSLFSTLLSLLVGVIIWKAEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAVALLSFNWF
Subjt:  GLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWF

Query:  VLGILAYPTLPNIARLLAPLASRFV
        VLGILAYPTLPN+AR+LAPLASR V
Subjt:  VLGILAYPTLPNIARLLAPLASRFV

A0A6J1F6D4 uncharacterized protein LOC111442593 isoform X26.6e-9667.08Show/hide
Query:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL
        TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLMWIIAF                LQ+NLQ L IEFDNV+            AMKEQK+MELMLDEL
Subjt:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL

Query:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLET
        EMIHEKATNKIALLESE+QKL+NENLRLQEIKGK YWSLKGLD K+EAQK               +     SS+VQDL +SDA KD              
Subjt:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKV--------------ADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLET

Query:  GLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWF
                         +DED TEILDEQRE+A+ RSLFSTLLSLLVGVIIWKAEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAVALLSFNWF
Subjt:  GLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWF

Query:  VLGILAYPTLPNIARLLAPLASRFV
        VLGILAYPTLPN+AR+LAPLASR V
Subjt:  VLGILAYPTLPNIARLLAPLASRFV

A0A6J1IHW7 uncharacterized protein LOC111477072 isoform X24.0e-9365.95Show/hide
Query:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL
        TCS ILKTFVVVVQTWLELLKTSVSLHLNI WTTLMWIIAF                LQ+NLQ L IEFDNV+            AMKEQK+MELMLDEL
Subjt:  TCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAF----------------LQQNLQLLEIEFDNVV------------AMKEQKVMELMLDEL

Query:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGS---------------SSVVQDLFQSDAWKDGKISKAKLIKMLE
        EMI+EKATNKIALLESE+QKL+NEN RLQEIKGK YWSLKG D K+EAQK +  +GS               SS++QDL +S+A KD             
Subjt:  EMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGS---------------SSVVQDLFQSDAWKDGKISKAKLIKMLE

Query:  TGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNW
                          +DED TEILDEQRE+A+ RSLFSTLLSLLVGVIIWKAEEPHLCLV+AL+FVVSISLKSVVEFF TIKNKPALDAV+LLSFNW
Subjt:  TGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLCLVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNW

Query:  FVLGILAYPTLPNIARLLAPLASRFV
        FVLGILAYPTLPN+ARLLAPLASR V
Subjt:  FVLGILAYPTLPNIARLLAPLASRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45310.1 unknown protein2.5e-3944.35Show/hide
Query:  EIEFDNVVAMKEQKVMELMLDELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGSSSVVQDLFQSDAWKDGKISK
        EIE +   A+KE ++ME  LDELE  H++A +KI  LE+ELQ+LK ENL+L E+ GK Y S KG    +E           S ++ +      K   I  
Subjt:  EIEFDNVVAMKEQKVMELMLDELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTYWSLKGLDDKTEAQKVADYIGSSSVVQDLFQSDAWKDGKISK

Query:  AKLIKMLETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLC--LVMALLFVVSISLKSVVEFFMTIKNKPAL
        A   K   T +KS  L     S IP  +E    +L  ++ +A+SRS+FS +L+L+VG+++++A+E  LC  L+ AL  VV ISLKSVV+FF T+KNKPAL
Subjt:  AKLIKMLETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLC--LVMALLFVVSISLKSVVEFFMTIKNKPAL

Query:  DAVALLSFNWFVLGILAYPTLPNIARLLAPLASRFVGQI
        DAVAL+S NWF++G L YPTLP +AR++ P     VG +
Subjt:  DAVALLSFNWFVLGILAYPTLPNIARLLAPLASRFVGQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTACATGCTCATTTATCTTGAAAACCTTTGTTGTTGTTGTTCAAACTTGGTTGGAGCTGTTGAAGACCTCGGTCAGTCTTCACTTAAATATATTTTGGACAACTTT
GATGTGGATAATCGCGTTTTTGCAACAAAATTTGCAATTGCTGGAAATTGAGTTCGATAATGTTGTTGCTATGAAAGAGCAGAAGGTGATGGAATTGATGTTGGACGAAC
TTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTGAGCTGCAGAAATTGAAAAATGAAAATCTTCGACTGCAAGAAATCAAGGGTAAGACATAT
TGGAGCTTAAAAGGTCTTGATGACAAAACAGAAGCACAAAAGGTGGCAGACTATATTGGCAGCAGCAGCGTTGTTCAAGACCTCTTTCAAAGTGACGCTTGGAAAGACGG
TAAGATATCTAAAGCAAAATTGATCAAAATGTTAGAAACCGGGTTAAAATCCGGTGTGCTAATTTGCTCTCATACTTCTGAAATCCCATCAAAAGATGAAGATGCCACTG
AAATTCTTGATGAACAAAGAGAGCTTGCAATTTCCCGAAGTCTTTTCAGTACCTTATTGTCACTTTTGGTTGGAGTGATTATATGGAAAGCTGAAGAGCCTCACTTGTGC
CTTGTAATGGCTCTCTTGTTTGTGGTTAGCATCTCATTGAAGAGCGTCGTTGAGTTTTTCATGACTATTAAGAATAAACCTGCTTTGGATGCTGTTGCTCTTTTGAGCTT
CAACTGGTTCGTACTTGGAATTCTGGCTTACCCAACGCTGCCAAATATTGCTCGTTTGCTTGCTCCTCTGGCCTCGAGGTTCGTCGGACAAATAGTGGAATGGTTTGGTT
TCTCAATTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTACATGCTCATTTATCTTGAAAACCTTTGTTGTTGTTGTTCAAACTTGGTTGGAGCTGTTGAAGACCTCGGTCAGTCTTCACTTAAATATATTTTGGACAACTTT
GATGTGGATAATCGCGTTTTTGCAACAAAATTTGCAATTGCTGGAAATTGAGTTCGATAATGTTGTTGCTATGAAAGAGCAGAAGGTGATGGAATTGATGTTGGACGAAC
TTGAAATGATACATGAAAAGGCGACCAACAAGATTGCACTCTTAGAAAGTGAGCTGCAGAAATTGAAAAATGAAAATCTTCGACTGCAAGAAATCAAGGGTAAGACATAT
TGGAGCTTAAAAGGTCTTGATGACAAAACAGAAGCACAAAAGGTGGCAGACTATATTGGCAGCAGCAGCGTTGTTCAAGACCTCTTTCAAAGTGACGCTTGGAAAGACGG
TAAGATATCTAAAGCAAAATTGATCAAAATGTTAGAAACCGGGTTAAAATCCGGTGTGCTAATTTGCTCTCATACTTCTGAAATCCCATCAAAAGATGAAGATGCCACTG
AAATTCTTGATGAACAAAGAGAGCTTGCAATTTCCCGAAGTCTTTTCAGTACCTTATTGTCACTTTTGGTTGGAGTGATTATATGGAAAGCTGAAGAGCCTCACTTGTGC
CTTGTAATGGCTCTCTTGTTTGTGGTTAGCATCTCATTGAAGAGCGTCGTTGAGTTTTTCATGACTATTAAGAATAAACCTGCTTTGGATGCTGTTGCTCTTTTGAGCTT
CAACTGGTTCGTACTTGGAATTCTGGCTTACCCAACGCTGCCAAATATTGCTCGTTTGCTTGCTCCTCTGGCCTCGAGGTTCGTCGGACAAATAGTGGAATGGTTTGGTT
TCTCAATTTTCTGA
Protein sequenceShow/hide protein sequence
MRTCSFILKTFVVVVQTWLELLKTSVSLHLNIFWTTLMWIIAFLQQNLQLLEIEFDNVVAMKEQKVMELMLDELEMIHEKATNKIALLESELQKLKNENLRLQEIKGKTY
WSLKGLDDKTEAQKVADYIGSSSVVQDLFQSDAWKDGKISKAKLIKMLETGLKSGVLICSHTSEIPSKDEDATEILDEQRELAISRSLFSTLLSLLVGVIIWKAEEPHLC
LVMALLFVVSISLKSVVEFFMTIKNKPALDAVALLSFNWFVLGILAYPTLPNIARLLAPLASRFVGQIVEWFGFSIF