; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021418 (gene) of Chayote v1 genome

Gene IDSed0021418
OrganismSechium edule (Chayote v1)
DescriptionWAS/WASL-interacting protein family member 2, putative isoform 1
Genome locationLG11:35582849..35584330
RNA-Seq ExpressionSed0021418
SyntenySed0021418
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441420.1 PREDICTED: uncharacterized protein LOC103485542 [Cucumis melo]1.3e-6049.1Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PFQ    FM GF+GA  ++PVKSTNN++Y +FSNQN ARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        VQ ++ +LVL+AL ASS  GRQAKV W +AQPH K EI+NRE N+VN + R G   GGLYH P +PP QN    PN++ +R +HP    VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR++NPTD RQKQG P++RF EEMK+PIQAP+        D I+ RRNN   P+PR F+ EG + QE H LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

XP_022134429.1 uncharacterized protein LOC111006679 [Momordica charantia]4.0e-6250Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MS +  PFQ N  F+ GFMGA  ++PVKST+N++Y LFSN+ CARNLA++AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVFL
        VQ +R +LVL+A+ ASS+GRQAKV W AA PHRKPEI+NRE NI     R  G+  GLY    +PPPQ+ P PPN+SA+R +HPGGPAVKRASSGTGVFL
Subjt:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVFL

Query:  PRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        PRR+VNP++ RQKQG P +RFPEEM  PIQAP         D+++ RRN +  P+PR  + E A+ QELH LPQEWTY
Subjt:  PRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

XP_022957513.1 uncharacterized protein LOC111458886 [Cucurbita moschata]1.1e-5951.64Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        M GQ  P+       NGF+GA  ++PVKST+N+EY LFS QNCARNLA++AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHP-GGPAVKRASSGTGVF
        VQ +  +LVL+AL AS++ RQAKVGW  AQPHRKP+I++RE NIVNVT R  G  G LYH P IPP QN   PPN+SAMR + P GG A+KRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHP-GGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI---MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRRHVNP+D R KQG+P I F EEMK+ IQAP    + S    R N   P+PR  +AEGAVKQELH LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI---MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

XP_038884388.1 uncharacterized protein LOC120075245 isoform X1 [Benincasa hispida]4.8e-6349.1Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PF+ N  FM  ++GA  ++PVKSTNN++Y +FSNQNCARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        +Q ++ +LVL+AL ASS+ GRQAKV W +AQPH K EI++RE N++N + R GG+ GGLYH P +PPPQN   P N S MR +HPG   VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR+++P++ RQKQG+P +RF EEMK+PIQAP+       +DS++ RRNN   P+PR F+ EGA+ QELH LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

XP_038884389.1 uncharacterized protein LOC120075245 isoform X2 [Benincasa hispida]4.8e-6349.1Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PF+ N  FM  ++GA  ++PVKSTNN++Y +FSNQNCARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        +Q ++ +LVL+AL ASS+ GRQAKV W +AQPH K EI++RE N++N + R GG+ GGLYH P +PPPQN   P N S MR +HPG   VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR+++P++ RQKQG+P +RF EEMK+PIQAP+       +DS++ RRNN   P+PR F+ EGA+ QELH LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

TrEMBL top hitse value%identityAlignment
A0A0A0KA94 Uncharacterized protein7.7e-5948.03Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PFQ N  FM GF+GA  +VPVKSTNN++Y +FS QN ARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        VQ ++ +LVL+AL ASS   RQAK  W +AQPH K EI+NRE N+VN + R GG  GGLYH P +PP QN     N + +R +HP    VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR++NP++ RQKQG P++RF EEMK+PIQAP+        D I+ RRNN   P+PR F+ EG + QE H LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

A0A1S3B425 uncharacterized protein LOC1034855426.3e-6149.1Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PFQ    FM GF+GA  ++PVKSTNN++Y +FSNQN ARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        VQ ++ +LVL+AL ASS  GRQAKV W +AQPH K EI+NRE N+VN + R G   GGLYH P +PP QN    PN++ +R +HP    VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR++NPTD RQKQG P++RF EEMK+PIQAP+        D I+ RRNN   P+PR F+ EG + QE H LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

A0A5A7TJ83 Uncharacterized protein6.3e-6149.1Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MSG+  PFQ    FM GF+GA  ++PVKSTNN++Y +FSNQN ARNLA+ AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF
        VQ ++ +LVL+AL ASS  GRQAKV W +AQPH K EI+NRE N+VN + R G   GGLYH P +PP QN    PN++ +R +HP    VKRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSF-GRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRR++NPTD RQKQG P++RF EEMK+PIQAP+        D I+ RRNN   P+PR F+ EG + QE H LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

A0A6J1BYQ6 uncharacterized protein LOC1110066791.9e-6250Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        MS +  PFQ N  F+ GFMGA  ++PVKST+N++Y LFSN+ CARNLA++AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVFL
        VQ +R +LVL+A+ ASS+GRQAKV W AA PHRKPEI+NRE NI     R  G+  GLY    +PPPQ+ P PPN+SA+R +HPGGPAVKRASSGTGVFL
Subjt:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVFL

Query:  PRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        PRR+VNP++ RQKQG P +RFPEEM  PIQAP         D+++ RRN +  P+PR  + E A+ QELH LPQEWTY
Subjt:  PRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI-------MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

A0A6J1H0F4 uncharacterized protein LOC1114588865.3e-6051.64Show/hide
Query:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL
        M GQ  P+       NGF+GA  ++PVKST+N+EY LFS QNCARNLA++AQ                                                
Subjt:  MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFL

Query:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHP-GGPAVKRASSGTGVF
        VQ +  +LVL+AL AS++ RQAKVGW  AQPHRKP+I++RE NIVNVT R  G  G LYH P IPP QN   PPN+SAMR + P GG A+KRASSGTGVF
Subjt:  VQLMRDNLVLRALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHP-GGPAVKRASSGTGVF

Query:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI---MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY
        LPRRHVNP+D R KQG+P I F EEMK+ IQAP    + S    R N   P+PR  +AEGAVKQELH LPQEWTY
Subjt:  LPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQAPI---MDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.1e-0431.86Show/hide
Query:  PNSSAMRRVHPGGPAVKRASSGTGVFLPR--RHVNPTDFRQKQGNPTIRFPEEMK--------TPIQ--APIMDSIIYRRNNT---FQPIPRGFQAEGAV
        P    MR V  G    KR S+GTGVFLPR   H + T+ R+K    T+  P  +          P++  A + D    +R+N       +  G +AE +V
Subjt:  PNSSAMRRVHPGGPAVKRASSGTGVFLPR--RHVNPTDFRQKQGNPTIRFPEEMK--------TPIQ--APIMDSIIYRRNNT---FQPIPRGFQAEGAV

Query:  KQELHQLPQEWTY
        ++   +LP EW Y
Subjt:  KQELHQLPQEWTY

AT5G59050.1 unknown protein8.2e-0527.2Show/hide
Query:  SAMRRVHPGGPAVKRASSGTGVFLPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQA----------------PIMDSIIYRRNN---------TFQPIPRG
        S ++ V   G   +  S GTGVFLPR H    + R+K G  T+  P  +   ++                 P  D+++   NN         +   +  G
Subjt:  SAMRRVHPGGPAVKRASSGTGVFLPRRHVNPTDFRQKQGNPTIRFPEEMKTPIQA----------------PIMDSIIYRRNN---------TFQPIPRG

Query:  FQAEGAVKQELHQ-----LPQEWTY
           E  +  E HQ     LPQEWTY
Subjt:  FQAEGAVKQELHQ-----LPQEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGCCAAGCAGCTCCATTTCAAAAGAATGGCTCTTTTATGAATGGATTTATGGGTGCTCCGTGTACTGTTCCGGTTAAATCTACGAACAATATGGAATATGAGCT
GTTTTCGAATCAGAACTGTGCTAGGAATCTTGCATATACCGCTCAGGTAACTTATTTGTGTTGTTATCCTCCAAAACAGAGCATTTTAACAATTGGGTTCTCTGTTTTAC
TCTGTTTTGCTCTGATTTTCTGTGTTTCTTTGAAGAGTTGTTTTCCATTTTTGAAACTTTGGAGATTTCTGGTTTTTCTGGTGCAGCTAATGAGAGATAATCTTGTATTG
CGAGCTCTCTATGCCTCTTCTTTCGGAAGACAAGCGAAAGTCGGCTGGACGGCAGCTCAGCCGCACCGGAAGCCGGAGATTCGAAACAGAGAGATGAATATTGTTAACGT
TACTCGCCGGAGCGGTGGAAGTCCCGGCGGTTTGTACCATCCTCCAAGGATTCCGCCGCCGCAGAATCATCCTGTTCCTCCCAATTCGTCCGCCATGCGTCGTGTTCATC
CCGGCGGACCCGCCGTCAAAAGGGCTTCCTCCGGCACCGGCGTTTTCCTTCCTCGCCGCCATGTGAACCCTACGGATTTCCGTCAAAAACAAGGTAACCCAACAATTCGA
TTTCCAGAAGAAATGAAAACCCCCATTCAAGCACCAATCATGGATTCCATAATATATAGAAGAAACAATACATTTCAACCTATTCCAAGGGGTTTTCAAGCAGAGGGAGC
TGTAAAACAAGAACTTCATCAACTACCTCAGGAATGGACTTACTAA
mRNA sequenceShow/hide mRNA sequence
ACCTTTGAAAAAAGCTATGAAAGGTCTATTATAGCTATTTTCGCTAGCTATAAAATCTGTTATTTCTTGTAGTGAACTCCTTCCTCTCAGCTCAAATCGGTTTTGTGGGC
TGAGAATGAAGCTAGAGAGGCTGCGCGGTTGAAGATGAGCGGCCAAGCAGCTCCATTTCAAAAGAATGGCTCTTTTATGAATGGATTTATGGGTGCTCCGTGTACTGTTC
CGGTTAAATCTACGAACAATATGGAATATGAGCTGTTTTCGAATCAGAACTGTGCTAGGAATCTTGCATATACCGCTCAGGTAACTTATTTGTGTTGTTATCCTCCAAAA
CAGAGCATTTTAACAATTGGGTTCTCTGTTTTACTCTGTTTTGCTCTGATTTTCTGTGTTTCTTTGAAGAGTTGTTTTCCATTTTTGAAACTTTGGAGATTTCTGGTTTT
TCTGGTGCAGCTAATGAGAGATAATCTTGTATTGCGAGCTCTCTATGCCTCTTCTTTCGGAAGACAAGCGAAAGTCGGCTGGACGGCAGCTCAGCCGCACCGGAAGCCGG
AGATTCGAAACAGAGAGATGAATATTGTTAACGTTACTCGCCGGAGCGGTGGAAGTCCCGGCGGTTTGTACCATCCTCCAAGGATTCCGCCGCCGCAGAATCATCCTGTT
CCTCCCAATTCGTCCGCCATGCGTCGTGTTCATCCCGGCGGACCCGCCGTCAAAAGGGCTTCCTCCGGCACCGGCGTTTTCCTTCCTCGCCGCCATGTGAACCCTACGGA
TTTCCGTCAAAAACAAGGTAACCCAACAATTCGATTTCCAGAAGAAATGAAAACCCCCATTCAAGCACCAATCATGGATTCCATAATATATAGAAGAAACAATACATTTC
AACCTATTCCAAGGGGTTTTCAAGCAGAGGGAGCTGTAAAACAAGAACTTCATCAACTACCTCAGGAATGGACTTACTAAACAGGGCAAAAAAAAAGAAGACTTGAAATC
CAATTGTTTTTATTTTTGTAGGGTTTGGAAAAAGAAACCATTAAAGTAGGGATTAGAAGCATAATTTTATGAAGAACATGAAAAGAAAGGGAGATTTTAGTTGTAGGAAT
GATGGTGTTATTTTGTTGCAGCAGCTTTAGAATAAAGAAACAGTGAAAGTGGTAGAAGGGGTAGCAGTGGAAGAAGCTTGAAAAAACAAAAGGACAGGGATGTCAGTTGT
GAGTCCTTGTAATAATTAATCTGTAATTTTGTGTTATAGTCATTTTAGTTATGTTATTTTGAGTGAGGGTTAAAATATATTTATATAAAAAAAAAC
Protein sequenceShow/hide protein sequence
MSGQAAPFQKNGSFMNGFMGAPCTVPVKSTNNMEYELFSNQNCARNLAYTAQVTYLCCYPPKQSILTIGFSVLLCFALIFCVSLKSCFPFLKLWRFLVFLVQLMRDNLVL
RALYASSFGRQAKVGWTAAQPHRKPEIRNREMNIVNVTRRSGGSPGGLYHPPRIPPPQNHPVPPNSSAMRRVHPGGPAVKRASSGTGVFLPRRHVNPTDFRQKQGNPTIR
FPEEMKTPIQAPIMDSIIYRRNNTFQPIPRGFQAEGAVKQELHQLPQEWTY