; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020130 (gene) of Snake gourd v1 genome

Gene IDTan0020130
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionStress response NST1-like protein
Genome locationLG04:84227892..84237615
RNA-Seq ExpressionTan0020130
SyntenyTan0020130
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152172.1 uncharacterized protein LOC101207869 [Cucumis sativus]8.7e-10784.05Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARASWS FSKRLKP ET+SFCSKSHI TNK++NNGKINGDNKVE DLSSYNEAYKQLDNLD MTASKILFT+P KKKKFG+DFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHE  PELQEVKTRLDKLE TIKEIAVESRKQSG+G ITKNSEKG++  KTKHG NI  
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
               +S+KSMDDHLGGQK+VPAPVLPKGR SEST+R+D KH+NHG GSSPDA+R
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

XP_008454169.1 PREDICTED: uncharacterized protein LOC103494654 [Cucumis melo]5.8e-10382.1Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARASWS FSKRLKP ET+SFCSK HI TNK++NNGKINGDNKV+ DLSSY+EAYKQLDNLDFMTASKILFT+P KKKKFG+DFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEE EKIHE  PELQEVKTRLDKLE+TIKEIAVESRKQSG+G ITKNSEKG++  KTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 +KSMDDHLGGQK+VPAPVLPK  +SEST+RED KH N GEGSS D KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

XP_022956225.1 uncharacterized protein LOC111457985 [Cucurbita moschata]3.9e-9982.1Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARAS SRFSKRLKPF+T  FCSKS I TNKN+NNG+ING NKVESDLSSY EAYKQLDNLDFMTASKILFT+PPKKKKFGIDFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKK EEE AKQI+LEE E+IH+K  ELQEVKTRLDKLEETIKEIAVESRKQSGSGI+TKNSEK Q VDKTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 SKSMDDHLGGQK+VPAPVLPK R+  ST+ ED KHQN G  SSPD+KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

XP_022980157.1 uncharacterized protein LOC111479631 [Cucurbita maxima]1.9e-9881.71Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARAS SRFSKRLKPF+T  FCSKS I TNKN+NNG+ING NKVESDLSSY EAYKQLDNLDFMTA KILFTEPPKKKKFGIDFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKK EEE AKQI+L+E E+IH+K  ELQEVKTRLDKLEETIKEIAVESRKQSGSGI+TKNSEK Q VDKTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 SKSMDDHLGGQK+VPAPVLPK R+  ST+ ED KHQN G  SSPD+KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

XP_038901255.1 uncharacterized protein LOC120088201 [Benincasa hispida]2.4e-10484.77Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        M RARASW+RFSKRLKPFET SFCSKSHI  NKN+   KINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFT+P  KKKFGIDFHLVQLFFACMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQ+ELEETE+IHEK  ELQEVK RLDKLEETIKEIAVE RKQSG+GIITKNSEKGQ+VDKTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAK
                 SKSMDD LGGQK+VPAPVLPKGR+SEST+REDGKHQN   GSSP AK
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAK

TrEMBL top hitse value%identityAlignment
A0A0A0KTR7 Uncharacterized protein4.2e-10784.05Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARASWS FSKRLKP ET+SFCSKSHI TNK++NNGKINGDNKVE DLSSYNEAYKQLDNLD MTASKILFT+P KKKKFG+DFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHE  PELQEVKTRLDKLE TIKEIAVESRKQSG+G ITKNSEKG++  KTKHG NI  
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
               +S+KSMDDHLGGQK+VPAPVLPKGR SEST+R+D KH+NHG GSSPDA+R
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

A0A1S3BZ75 uncharacterized protein LOC1034946542.8e-10382.1Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARASWS FSKRLKP ET+SFCSK HI TNK++NNGKINGDNKV+ DLSSY+EAYKQLDNLDFMTASKILFT+P KKKKFG+DFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEE EKIHE  PELQEVKTRLDKLE+TIKEIAVESRKQSG+G ITKNSEKG++  KTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 +KSMDDHLGGQK+VPAPVLPK  +SEST+RED KH N GEGSS D KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

A0A6J1DGT5 uncharacterized protein LOC1110209342.0e-9679.77Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MF AR SW RFSKR KPF+T+SFCSKSH PTN        NG+NKVESDLSSY EAYKQLDNLDFMTASKILFT+PPKKKKFGIDFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELK+KKEEEEKAKQ+ELEETE+I EK PELQEVK RLDKLEETIKEIAVESRK SGSG   KNSEK +E  K KHGEN   
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
         NMGN SESSKS++DHLG QK+  APVLPKGR SESTS+E+G+H N G GSSPDAKR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

A0A6J1GVZ5 uncharacterized protein LOC1114579851.9e-9982.1Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARAS SRFSKRLKPF+T  FCSKS I TNKN+NNG+ING NKVESDLSSY EAYKQLDNLDFMTASKILFT+PPKKKKFGIDFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKK EEE AKQI+LEE E+IH+K  ELQEVKTRLDKLEETIKEIAVESRKQSGSGI+TKNSEK Q VDKTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 SKSMDDHLGGQK+VPAPVLPK R+  ST+ ED KHQN G  SSPD+KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

A0A6J1IQM5 uncharacterized protein LOC1114796319.4e-9981.71Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        MFRARAS SRFSKRLKPF+T  FCSKS I TNKN+NNG+ING NKVESDLSSY EAYKQLDNLDFMTA KILFTEPPKKKKFGIDFHLVQLFF CMPSLA
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP
        VYLVAQYARYEMRKMEADLELKKKK EEE AKQI+L+E E+IH+K  ELQEVKTRLDKLEETIKEIAVESRKQSGSGI+TKNSEK Q VDKTKHG NI+P
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINP

Query:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR
                 SKSMDDHLGGQK+VPAPVLPK R+  ST+ ED KHQN G  SSPD+KR
Subjt:  SNMGNTSESSKSMDDHLGGQKVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80700.1 unknown protein2.0e-3749.25Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        M R R SWS  S RLK + T+ FC+K      K  ++   + D   ES +S Y+E YK+LD LDF+TA+KILFTEPPKK KFG D+H+VQ    C+PS+A
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQ------IELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTK
        VYLVAQYAR +M+ M+A+L  KK+KEEE+K K+      +++E   K HE   EL E++ RL K+EETIKEI +E++K SG+        K QE   TK
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQ------IELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTK

AT1G80980.1 unknown protein2.7e-3748.74Show/hide
Query:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA
        M R R SWS  S RLK + T+ FC+K      K  ++  +   ++ ES +S Y+E YK+LD LDF+TA+KILFTEPPKK KFG D+H+VQ    C+PS+A
Subjt:  MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLA

Query:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQ------IELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTK
        VYLVAQYAR +M+ M+A+L  KK+KEEE+K K+      +++E   K HE   EL E++ RL K+EETIKEI +E++K SG+        K QE   TK
Subjt:  VYLVAQYARYEMRKMEADLELKKKKEEEEKAKQ------IELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCGCGCCAGAGCCAGTTGGAGTCGATTTTCAAAGCGATTGAAGCCATTCGAAACCAAATCATTTTGCTCCAAATCCCACATTCCGACCAACAAGAACAACAACAA
CGGCAAGATCAATGGAGACAACAAGGTTGAGTCGGATCTGAGCAGCTACAACGAAGCTTACAAGCAGCTGGATAACTTGGACTTCATGACCGCATCCAAGATCCTCTTCA
CTGAACCTCCCAAGAAAAAGAAATTTGGGATTGATTTCCATCTGGTGCAACTTTTCTTTGCTTGCATGCCGTCTTTGGCTGTGTATCTGGTGGCCCAATATGCTCGTTAT
GAAATGAGGAAGATGGAAGCGGACCTGGAGCTGAAAAAGAAGAAAGAAGAAGAAGAGAAAGCGAAACAAATAGAGTTAGAAGAGACTGAAAAAATTCATGAAAAGTATCC
GGAACTTCAGGAGGTAAAAACAAGACTTGATAAACTCGAGGAGACCATAAAGGAAATTGCTGTTGAATCTAGAAAACAATCGGGGAGTGGTATTATAACAAAGAACTCTG
AAAAGGGTCAAGAAGTTGATAAAACCAAACATGGGGAAAATATTAATCCAAGCAACATGGGGAATACGTCAGAGTCAAGCAAGTCTATGGATGACCATCTTGGTGGACAA
AAAGTAGTACCGGCTCCAGTTTTGCCCAAAGGTCGTTCAAGTGAGTCTACATCACGAGAAGATGGTAAGCATCAAAACCACGGTGAAGGATCTTCTCCAGATGCCAAAAG
ATGA
mRNA sequenceShow/hide mRNA sequence
AAGGGCAATGTTTCGCGCCAGAGCCAGTTGGAGTCGATTTTCAAAGCGATTGAAGCCATTCGAAACCAAATCATTTTGCTCCAAATCCCACATTCCGACCAACAAGAACA
ACAACAACGGCAAGATCAATGGAGACAACAAGGTTGAGTCGGATCTGAGCAGCTACAACGAAGCTTACAAGCAGCTGGATAACTTGGACTTCATGACCGCATCCAAGATC
CTCTTCACTGAACCTCCCAAGAAAAAGAAATTTGGGATTGATTTCCATCTGGTGCAACTTTTCTTTGCTTGCATGCCGTCTTTGGCTGTGTATCTGGTGGCCCAATATGC
TCGTTATGAAATGAGGAAGATGGAAGCGGACCTGGAGCTGAAAAAGAAGAAAGAAGAAGAAGAGAAAGCGAAACAAATAGAGTTAGAAGAGACTGAAAAAATTCATGAAA
AGTATCCGGAACTTCAGGAGGTAAAAACAAGACTTGATAAACTCGAGGAGACCATAAAGGAAATTGCTGTTGAATCTAGAAAACAATCGGGGAGTGGTATTATAACAAAG
AACTCTGAAAAGGGTCAAGAAGTTGATAAAACCAAACATGGGGAAAATATTAATCCAAGCAACATGGGGAATACGTCAGAGTCAAGCAAGTCTATGGATGACCATCTTGG
TGGACAAAAAGTAGTACCGGCTCCAGTTTTGCCCAAAGGTCGTTCAAGTGAGTCTACATCACGAGAAGATGGTAAGCATCAAAACCACGGTGAAGGATCTTCTCCAGATG
CCAAAAGATGAGGAGAATGCCTCTCTCAATATTACTGCAATATCCTGCCCATTTCCTTTTTGATGAAGTTAGAGAAATGGAGTGAACCATGTTGCCAAGAATCCAATGGT
TTCAGGTTTTGTTTAATAATTATATTGCTCGAGGCAGTAGCCTCTATCATTAACAATGTAAATGCTTAATATGTAAAGGATCGATACCCATTTTGTTACGACAATTAATT
ACCTCATTATTAGCCCCA
Protein sequenceShow/hide protein sequence
MFRARASWSRFSKRLKPFETKSFCSKSHIPTNKNNNNGKINGDNKVESDLSSYNEAYKQLDNLDFMTASKILFTEPPKKKKFGIDFHLVQLFFACMPSLAVYLVAQYARY
EMRKMEADLELKKKKEEEEKAKQIELEETEKIHEKYPELQEVKTRLDKLEETIKEIAVESRKQSGSGIITKNSEKGQEVDKTKHGENINPSNMGNTSESSKSMDDHLGGQ
KVVPAPVLPKGRSSESTSREDGKHQNHGEGSSPDAKR