; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021080 (gene) of Snake gourd v1 genome

Gene IDTan0021080
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSurvival motor neuron
Genome locationLG01:26477595..26486228
RNA-Seq ExpressionTan0021080
SyntenyTan0021080
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR040424 - Survival motor neuron-like protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577733.1 hypothetical protein SDJN03_25307, partial [Cucurbita argyrosperma subsp. sororia]1.3e-11679.58Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDR+YWDDSMIVKAMDEAM+KYK MHG EV  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNETTNTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE +E SNLN+KGE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSAS PS +  I SY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSS ADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

XP_022923599.1 uncharacterized protein LOC111431235 [Cucurbita moschata]2.7e-11779.58Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDR++WDDSMIV+AMDEAM+KYK MHG EV  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNETTNTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE +E SNLN+KGE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS +  IPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSS ADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

XP_022965361.1 uncharacterized protein LOC111465241 isoform X1 [Cucurbita maxima]1.4e-11378.78Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDRMYWDDSMIVKAMDEAM+KYK MHG ++  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNET NTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE IE SNLN++GE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS + AIPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVE
        AGPQSSSLADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGK +++
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVE

XP_022965362.1 uncharacterized protein LOC111465241 isoform X2 [Cucurbita maxima]2.4e-11880.28Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDRMYWDDSMIVKAMDEAM+KYK MHG ++  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNET NTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE IE SNLN++GE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS + AIPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSSLADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

XP_023551935.1 uncharacterized protein LOC111809760 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-11779.93Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDRMYWDDSMIVKAMDEAM+KYK MHG EV  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNE TNTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE ++ SNLN+KGE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS + AIPSY PSS+P+ 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSS ADGDIIKTA+DSAARAISS+KTVN+   EKESE H GIMPQSGASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

TrEMBL top hitse value%identityAlignment
A0A1S3BL18 uncharacterized protein LOC103490751 isoform X31.5e-10575.53Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        MGLD+MYWD+SMIVKAMDEAM+KYKIMHG EV C+ AEGGGV   CGK DE KR VDEES    NNV  EV ETT+T EA ENI VE   I+C  FSDAL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        HV+ETQ+EP+E S+L     E YN LLKQYYELEEKRQKVLEQLYQCG G WNYQDV+AGSD+GTQWGTS+A  EHPVSASQPSH   IPSY P+ +PIL
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAI-SSVKTVNEEKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSSL D DIIKTA+DSA RAI SS+KTVN+ KES+RHD IMPQSG SSETDLA VLNAWYSAGFYTGKYL+EQSHAKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAI-SSVKTVNEEKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

A0A6J1CU98 uncharacterized protein LOC111014833 isoform X32.1e-10474.56Show/hide
Query:  MYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDALHVKET
        M  DDS +V AM+EAM+KYKIMHG E+  +  EGG  F G G+ DEPKRG DE+SNIEANNV  EV+E TNTS  NENI+VEPCPISCA FSDALHVKET
Subjt:  MYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDALHVKET

Query:  QQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPILAGPQS
        QQ PIE SNLN+KG EGYN+LL+QYYELEEKRQKVL+QLY    G WNY DVSAGS +GTQWGTSSAY EHPV ASQ SH+HAI + WPSS+PI  GPQS
Subjt:  QQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPILAGPQS

Query:  SSLADGDIIKTAIDSAARAISSVKT-------VNEEKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        SSLADGDIIKTA+D+AARAISS+ T       VN+EK SER DGIMPQS ASSETDLAAV NAWYSAGFYTGKYLVEQS+AKK
Subjt:  SSLADGDIIKTAIDSAARAISSVKT-------VNEEKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

A0A6J1E6W0 uncharacterized protein LOC1114312351.3e-11779.58Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDR++WDDSMIV+AMDEAM+KYK MHG EV  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNETTNTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE +E SNLN+KGE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS +  IPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSS ADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

A0A6J1HK48 uncharacterized protein LOC111465241 isoform X21.2e-11880.28Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDRMYWDDSMIVKAMDEAM+KYK MHG ++  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNET NTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE IE SNLN++GE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS + AIPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK
        AGPQSSSLADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGKYLVEQS AKK
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKK

A0A6J1HLH3 uncharacterized protein LOC111465241 isoform X16.6e-11478.78Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL
        M LDRMYWDDSMIVKAMDEAM+KYK MHG ++  + AEGGGVF GCGK DEP+R VDEES I AN+V  EVNET NTSEA ENI+VEPCPISC  FS AL
Subjt:  MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNV--EVNETTNTSEANENITVEPCPISCAVFSDAL

Query:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL
        +VKET+QE IE SNLN++GE+GYN+LLKQYYELEEKRQKVLEQLYQCG G WNYQDV AGSDIG QWGTS+AY EHPVSASQPS + AIPSY PSS+PI 
Subjt:  HVKETQQEPIEHSNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPIL

Query:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVE
        AGPQSSSLADGDIIKTA+DSAARAISS+KTVN+   EKESE HDGIMPQ GASSETDL  VLNAWYSAGFYTGK +++
Subjt:  AGPQSSSLADGDIIKTAIDSAARAISSVKTVNE---EKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTCGACAGAATGTACTGGGATGATTCCATGATCGTCAAAGCCATGGATGAAGCTATGATGAAGTATAAGATAATGCATGGACTTGAAGTCCTCTGTCTTCCAGC
CGAGGGAGGAGGAGTTTTTACCGGCTGCGGTAAGAGAGATGAACCGAAAAGGGGTGTAGATGAAGAGAGCAATATCGAAGCAAATAATGTTGAAGTCAATGAGACTACAA
ATACCTCAGAAGCTAATGAAAATATCACCGTAGAGCCATGCCCTATATCTTGTGCGGTTTTTTCAGATGCTCTACATGTGAAAGAGACGCAACAGGAGCCCATTGAACAC
TCCAATTTAAATATAAAAGGTGAGGAGGGCTATAACCAGCTACTCAAGCAGTATTATGAGCTTGAGGAGAAGAGGCAGAAGGTTCTAGAACAGCTGTATCAATGCGGTGA
TGGTGTTTGGAACTACCAGGATGTCAGCGCAGGGTCTGACATTGGAACTCAATGGGGAACATCTTCTGCTTATCTAGAACACCCAGTCTCTGCAAGCCAACCTTCTCATA
GCCATGCAATACCCTCCTATTGGCCCTCCAGTTTTCCAATTTTAGCTGGTCCTCAAAGTTCGTCCCTTGCTGATGGCGACATTATCAAAACTGCAATTGATTCTGCAGCA
AGAGCTATATCCTCCGTGAAGACTGTAAATGAAGAGAAAGAGAGCGAGAGACATGATGGGATAATGCCTCAAAGTGGTGCTAGCTCTGAAACAGACCTTGCTGCTGTTTT
AAATGCCTGGTATTCTGCAGGCTTCTACACTGGCAAATACCTTGTGGAGCAATCTCATGCCAAGAAACAGTGA
mRNA sequenceShow/hide mRNA sequence
GGGTCATCAAACCCGAAAATCGTTTCGTCTTGCTATTGCCACTCTATAGTTCTATAGACAGGTCGCGGCAAGCTGAGGCGGGAACACATTTCCTTCAGCTTTGTTTTTTC
TTTCTTCTTTGCAATCGCATCATTCTCAGAATTCAATCGAGAAGATGGGGCTCGACAGAATGTACTGGGATGATTCCATGATCGTCAAAGCCATGGATGAAGCTATGATG
AAGTATAAGATAATGCATGGACTTGAAGTCCTCTGTCTTCCAGCCGAGGGAGGAGGAGTTTTTACCGGCTGCGGTAAGAGAGATGAACCGAAAAGGGGTGTAGATGAAGA
GAGCAATATCGAAGCAAATAATGTTGAAGTCAATGAGACTACAAATACCTCAGAAGCTAATGAAAATATCACCGTAGAGCCATGCCCTATATCTTGTGCGGTTTTTTCAG
ATGCTCTACATGTGAAAGAGACGCAACAGGAGCCCATTGAACACTCCAATTTAAATATAAAAGGTGAGGAGGGCTATAACCAGCTACTCAAGCAGTATTATGAGCTTGAG
GAGAAGAGGCAGAAGGTTCTAGAACAGCTGTATCAATGCGGTGATGGTGTTTGGAACTACCAGGATGTCAGCGCAGGGTCTGACATTGGAACTCAATGGGGAACATCTTC
TGCTTATCTAGAACACCCAGTCTCTGCAAGCCAACCTTCTCATAGCCATGCAATACCCTCCTATTGGCCCTCCAGTTTTCCAATTTTAGCTGGTCCTCAAAGTTCGTCCC
TTGCTGATGGCGACATTATCAAAACTGCAATTGATTCTGCAGCAAGAGCTATATCCTCCGTGAAGACTGTAAATGAAGAGAAAGAGAGCGAGAGACATGATGGGATAATG
CCTCAAAGTGGTGCTAGCTCTGAAACAGACCTTGCTGCTGTTTTAAATGCCTGGTATTCTGCAGGCTTCTACACTGGCAAATACCTTGTGGAGCAATCTCATGCCAAGAA
ACAGTGAAAGGCAAGCTTCCTCATGCCTAGGCTCATGTTTGTTACTAATTTTGCCCACTTTTCACACCCATGGTGTGCTATGCTACAACTCTATAAATCTTTTCACTGAA
TTTTGCTTGTACAGAAGAAAGCCAGCTCTGATCTAAATGAGTTTGCATTTTGCAGAATTTTTCTACTGTCAATTTTTGTTGTATGCTATTCATCTCCCACTATACTGCAC
CAACTCACCGTAATTAGAGCCGAGCACGAGCTCTATCAACTGAGCTATATCCTCTCCAACTCACCGTAATTAGAGCCGAGCACGAGCTCTATCAACTGAGCTATATCCTC
TCGAACCATGTAGGCCTTGCATGAAGAAGTTAGATGTTTCTTCTATTCTTTTTCTTCCATAGTGAGCCCCTTTAGGCTTGAGTTAGGTTGCAACGTAATTTGATTTTACA
TGTTAATAAGGTCGCGTTTGTGACTTCTTCAATCCCTTAGGCGTGTTATTGATTATTGAAGTCTTGTTTGATAACTATTTGGTTTTTGAAAATTGTG
Protein sequenceShow/hide protein sequence
MGLDRMYWDDSMIVKAMDEAMMKYKIMHGLEVLCLPAEGGGVFTGCGKRDEPKRGVDEESNIEANNVEVNETTNTSEANENITVEPCPISCAVFSDALHVKETQQEPIEH
SNLNIKGEEGYNQLLKQYYELEEKRQKVLEQLYQCGDGVWNYQDVSAGSDIGTQWGTSSAYLEHPVSASQPSHSHAIPSYWPSSFPILAGPQSSSLADGDIIKTAIDSAA
RAISSVKTVNEEKESERHDGIMPQSGASSETDLAAVLNAWYSAGFYTGKYLVEQSHAKKQ