; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012732 (gene) of Snake gourd v1 genome

Gene IDTan0012732
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNA-binding protein 7-like isoform X1
Genome locationLG10:60880865..60892817
RNA-Seq ExpressionTan0012732
SyntenyTan0012732
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141807.1 RNA-binding protein 7 [Momordica charantia]8.1e-9285.07Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSN CTVYVGNLD+RVSDRVLYDILIQAGR+VDLHIPRDKESGKP+GYAFAEYE+EEI+ YAVKLFSGLV LHNR LKFAVSGQDKPSPSS AI SS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHK RSQ VPGYS+EISQYSNRLS SCRFSAYPENHLEAP YPGLVNQSNGYGSH+  +NN+YSQRY GTTLDSFNH + RR++TS PVYYPPYE L
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

XP_022932504.1 RNA-binding protein 7 isoform X1 [Cucurbita moschata]2.0e-10393.03Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKSRSQIVPGYSNEIS YSNRLSNSCRFSAYPENH+EAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

XP_022973396.1 RNA-binding protein 7-like isoform X1 [Cucurbita maxima]9.2e-10493.53Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKSRSQIVPGYSNEIS YSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

XP_022973397.1 RNA-binding protein 7-like isoform X2 [Cucurbita maxima]6.0e-8782.59Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKS                       RFSAYPENHLEAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

XP_023523852.1 RNA-binding protein 7-like isoform X1 [Cucurbita pepo subsp. pepo]3.9e-10292.54Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKSRSQIVPGYSNEIS YSNRLSNSCRFSAY ENHLEAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPV+YPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

TrEMBL top hitse value%identityAlignment
A0A6J1CKW6 RNA-binding protein 73.9e-9285.07Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSN CTVYVGNLD+RVSDRVLYDILIQAGR+VDLHIPRDKESGKP+GYAFAEYE+EEI+ YAVKLFSGLV LHNR LKFAVSGQDKPSPSS AI SS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHK RSQ VPGYS+EISQYSNRLS SCRFSAYPENHLEAP YPGLVNQSNGYGSH+  +NN+YSQRY GTTLDSFNH + RR++TS PVYYPPYE L
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

A0A6J1EWJ3 RNA-binding protein 7 isoform X26.4e-8782.09Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKS                       RFSAYPENH+EAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

A0A6J1EX61 RNA-binding protein 7 isoform X19.9e-10493.03Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKSRSQIVPGYSNEIS YSNRLSNSCRFSAYPENH+EAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

A0A6J1I7F0 RNA-binding protein 7-like isoform X22.9e-8782.59Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKS                       RFSAYPENHLEAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

A0A6J1I8J0 RNA-binding protein 7-like isoform X14.4e-10493.53Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSGRSNGCT+Y+GNLDERVSDRVLYDILIQAGRVVDLHIPRDKESG+PRGYAFAEYENEEI+ YAVKLFSGLVTLHNRTLKFAVSGQDKPSPSS+AITSS
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL
         NISHKSRSQIVPGYSNEIS YSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGS+ D+ NNQYSQRYSGT LDSFNHPKSRRHDTSYPVYYPPYELL
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELL

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
O14102 Spliceosome-associated protein 491.9e-1136.05Show/hide
Query:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDK
        R+   T+Y+GNLDE+V+D +L+++ +QAG VV++HIPRD+      G+ F E+ +E+  +YA ++ +  V L  + ++   + QD+
Subjt:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDK

Q09442 Splicing factor 3B subunit 41.9e-1135.9Show/hide
Query:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK
        R+   T+YVG LDE+VS+ +L+++++QAG VV +++P+D+ +   +G+ F E+  EE + YA+K+ + ++ L+ + +K
Subjt:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK

Q15427 Splicing factor 3B subunit 47.7e-1339.51Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK
        +S R+   TVYVG LDE+VS+ +L+++ +QAG VV+ H+P+D+ +G+ +GY F E+ +EE + YA+K+ + ++ L+ + ++
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK

Q6AYL5 Splicing factor 3B subunit 47.7e-1339.51Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK
        +S R+   TVYVG LDE+VS+ +L+++ +QAG VV+ H+P+D+ +G+ +GY F E+ +EE + YA+K+ + ++ L+ + ++
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK

Q8QZY9 Splicing factor 3B subunit 47.7e-1339.51Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK
        +S R+   TVYVG LDE+VS+ +L+++ +QAG VV+ H+P+D+ +G+ +GY F E+ +EE + YA+K+ + ++ L+ + ++
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLK

Arabidopsis top hitse value%identityAlignment
AT2G18510.1 RNA-binding (RRM/RBD/RNP motifs) family protein9.4e-1436.36Show/hide
Query:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPS
        R+   TVYVG LD ++S+ +L+++ +QAG VV++++P+D+ +   + Y F EY +EE + YA+K+ + ++ LH + ++   + QDK S
Subjt:  RSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPS

AT2G37220.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.9e-0734.29Show/hide
Query:  SGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG
        SG  +G  VYVGNL   V D  L  +  + G+VV+  +  D++SG+ +G+ F  Y++ +  + A+K   G
Subjt:  SGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG

AT4G10110.1 RNA-binding (RRM/RBD/RNP motifs) family protein5.1e-4452.63Show/hide
Query:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS
        MSG SN CTVY+GN+DERVSDRVLYDI+IQAGRV+DLHIPRDKE+ KP+G+AFAEYE EEI+ YAVKLFSGLV+L+NRTLKFA+SGQDK        ++S
Subjt:  MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSS

Query:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRF----SAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRH
         N  H++R Q +    ++ + Y +    S +     S  P ++ + PP PG+ N     G+ L     +YS+R  G+ LDS NH + RR+
Subjt:  LNISHKSRSQIVPGYSNEISQYSNRLSNSCRF----SAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRH

AT5G64200.1 ortholog of human splicing factor SC351.5e-0630.84Show/hide
Query:  SNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG-LVTLHNRTLKFAVSGQDKPSPSSNAITSSLNI
        S+  ++ V N+  R +   LY +  + G+VVD+ IPRD+ +G  RG+AF  Y+ ++ +  AV+   G +V     T++FA  G +    S   +      
Subjt:  SNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG-LVTLHNRTLKFAVSGQDKPSPSSNAITSSLNI

Query:  SHKSRSQ
        S +SRS+
Subjt:  SHKSRSQ

AT5G64200.2 ortholog of human splicing factor SC351.5e-0630.84Show/hide
Query:  SNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG-LVTLHNRTLKFAVSGQDKPSPSSNAITSSLNI
        S+  ++ V N+  R +   LY +  + G+VVD+ IPRD+ +G  RG+AF  Y+ ++ +  AV+   G +V     T++FA  G +    S   +      
Subjt:  SNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSG-LVTLHNRTLKFAVSGQDKPSPSSNAITSSLNI

Query:  SHKSRSQ
        S +SRS+
Subjt:  SHKSRSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGAAGATCAAATGGTTGCACTGTTTACGTAGGTAATTTGGATGAAAGGGTAAGTGATAGGGTTCTGTATGACATTCTAATCCAAGCTGGGCGAGTAGTGGACTT
GCACATTCCCCGGGACAAGGAATCTGGCAAACCCAGGGGTTATGCTTTTGCAGAATATGAAAATGAAGAGATTTCCAAATATGCTGTTAAGCTATTCTCTGGTCTTGTGA
CTCTTCATAACCGTACCCTGAAGTTTGCAGTGTCTGGGCAAGACAAGCCTTCGCCTAGTAGTAATGCAATCACTTCATCATTAAATATATCTCATAAATCAAGGTCGCAG
ATTGTTCCAGGTTACTCTAATGAAATATCTCAATATTCCAATCGCTTGTCAAATTCATGCAGGTTTTCAGCATACCCCGAAAACCATCTAGAAGCGCCTCCTTACCCTGG
TTTAGTTAACCAATCTAATGGGTATGGATCACATTTGGATTACTTCAATAACCAATACAGTCAGAGATATTCTGGGACAACTTTGGATAGCTTTAACCATCCTAAATCAC
GACGCCATGATACAAGTTATCCAGTATACTATCCTCCTTATGAACTTTTGAACTAA
mRNA sequenceShow/hide mRNA sequence
CTAATCAAACGAAAAAACGATAAAATAAAAAGGCAAATCAAAGCCATAATAATTGTAGGTCAATTGGCTTACTGACTTAGCCGTCGTTCCTCTTTCTCTTTTATCCTTCC
GTTTTCCTGTTTCGCTCTCTGAAACTTCGGTTTAGGGTTTTGCATCTGAACTTGCAGAATAATCATGTCAGGAAGATCAAATGGTTGCACTGTTTACGTAGGTAATTTGG
ATGAAAGGGTAAGTGATAGGGTTCTGTATGACATTCTAATCCAAGCTGGGCGAGTAGTGGACTTGCACATTCCCCGGGACAAGGAATCTGGCAAACCCAGGGGTTATGCT
TTTGCAGAATATGAAAATGAAGAGATTTCCAAATATGCTGTTAAGCTATTCTCTGGTCTTGTGACTCTTCATAACCGTACCCTGAAGTTTGCAGTGTCTGGGCAAGACAA
GCCTTCGCCTAGTAGTAATGCAATCACTTCATCATTAAATATATCTCATAAATCAAGGTCGCAGATTGTTCCAGGTTACTCTAATGAAATATCTCAATATTCCAATCGCT
TGTCAAATTCATGCAGGTTTTCAGCATACCCCGAAAACCATCTAGAAGCGCCTCCTTACCCTGGTTTAGTTAACCAATCTAATGGGTATGGATCACATTTGGATTACTTC
AATAACCAATACAGTCAGAGATATTCTGGGACAACTTTGGATAGCTTTAACCATCCTAAATCACGACGCCATGATACAAGTTATCCAGTATACTATCCTCCTTATGAACT
TTTGAACTAAGGCAGCTAGCAACTTCCATTTTGGTTAATCTTAATGGTAAATCTAAATTAGGATAAGTTGGATCTTAGTTGGATGCAGAAGGTTTAGTGGAAATGATATG
ATCTTTGCTATGGAGCATGCTGGTTCAATATATTATAGTCATTTTGTAGGTGTATGTATTCATATGAATTGGCTGTGTTATAGTTGATGCTTTAATTAAAGTATTGTAAT
CTATTTTATTAAATAGTCAGATTTTCTTTAATTCCAGTTTAATGTTATGGGAATACAAATGATATTTTTCATGGTGACTTTAAGAAATAGCTTTTTGTTTGTGTATATTG
AGTATTTAGTTTTTTCTGAAGTGTTCTACTTAAGCCAATGTATATATGTTACTAAGGTCTGCATTTATTGGTGGTGGCTTTATTATGTTTTTCCCCCCTTCTAGAAATTG
GTTCATTTTAACTAGAAACTGTCGAGCAGACTGGTTTACGCTTCAGTGAAGTGAAAGCTAAATTACTTGTTATCCCAGGCATTTTGGTTTGAGATTATTGTCATTCTGCA
TCTGAGTATGCTGGTACTGAAATACTTAGTAGGTTAAATTGTAAGGTGGCATAGTCTAGCTTTGGCCTGGCGCCTCACTGTTATGAAAAGGTTACTTTTGCTTGGAAGAA
AAAAAAAAAGAAAGGCAAGCTCCAAAATGAGAACTGATATGACTGCTTTTGTTGCATTTTATAGACCTAGTTGATAACATTATTGTGTTTGCTTGGAAGGATAAAGGAAA
GCCAAGCTCCAAAATGAGAACTGATATGGCTGCTTTTGCTGCATTCTGTAGACCTAGTCGAAAACATTATTGCTTGAAATATATTTTAAATTTACATTATTTCTCTGTTT
AGTTCCGGTTTTTGAGAACGTGCTTGAATTCTGGCCAAAATTTCAAAATCAAAGGAAACTTTGAAAACTATTTAGAGTTGCACTAACTTTTTGGAAACATTTTGAAAAGT
TTATTATAAAAGAAAGTTAATTGTTTGTCAAGGAGCATGATTTTGAAAACAGATGACCTTATGTTTGTCTTGTCCGCTTCTTGTTTTGAGACTTCTGATTCTTTTATTCT
AAAGTGCTGATTAATCTGAAAGAAATGTATGTTGTACTTGAAGTTTTCAAGACGTGATTTCTTTGTGAAAAGAATTAATGATTGAAAATTTAATTCTTTGAGAAAGATAG
GTTTTTACTAGTCTTGAGGGCCATTCGTTTTTTGATCGTAGATAAAACATTTGGTGATCAGGTTAGTGGCAGTTCTATACGAGGATCAATATAGTGGTTTCTATCAAAAT
TCAAGAATCTAAATGCCTCCTTTCTCTTATCTGGGCTCAAAGTTCTTATCTGGAGTTGCCCTTTAGTTCTTCCATTCTTTGTATCATAATATGAATTTGTTCATTTCCTC
TCATTTCTTCCCTTTTTATTGGTTGTGAAATCTGTATTTGATGTATAATATATACTAATACGTTTGTTTGTATTTGTATGATAAATGATCTTTCTTTGGCTGCCTTGTAA
GATTTGGGATAAGTTCGATATAGTCATATAGCATTAGTTAGCAAGTAATGTTTCTAAGTGAATTCCTTTTGCATAAAGCACCGGTGGAGAAAAATACATGGATGAAGTGC
TTTGTCCAAAAGTTACTTCAAAGAGTTTACATTTTCCTTTTCAAACAAGAAGTGAAAGTACAGAAATGAAAAGCATCCAAGTCCTAGTGGATAATACAAAGGTCTTCTAA
ACCTTTGGTGGCATAAGTTCAAATCCTTGTACCCAAGTTTGCTTCTATAGGCATGTATTTTCATAGTTTCGAACACGTATGGTGATGCCATTGTCAAGAAAGATATTTGG
AATGAAGTGAAAATACGAGAGGAAGGTGGAGAACGGTGGTAATATGATGAAACTATGGAGCTTGGAAATGAATGGTAAAAGGATAGGGAATGTGATAGATGGTTGGAGAG
ATTAAAGCATAATATAGGGAGTTTGATTGTTTTGTATTTGATTTTGTGCAATCCCTAAGGATTTGAATTGAAGGAGAGGCTCCCTATAATTGTAATTAGTATAGCATAGT
TCATTAGGTCAGAGGCATAGAGACAATCCTCCTCTAAAGAGGATTACATTTCTGTCATCTTCCCTATAATTGTAATTAGAGTTGTTTTTTTGGGTTAAGAATCCAACTTC
GAGAGTTGCAACAGGAGGGTTGCGCTCGTTGGACGGTTACAAAGGAAATCTCGAGCTCGTATTGGTCATAGATATGGGTTCAAGAAGCTTGTTCACAGATTGGTTATGGA
CATGCACAGAAGCTTAGAAAGCTCAGATAACATTTTTTGTGAAGAGTTCAAGGAAATGTTGTAAAGCAAAGCAGGTGGGTGATTTTGTACAGGGGTGGCTGCCAAACATT
TGTTCTTTTTACAAATTTTGTGGATTTTAATTTTCTTTATTTTGCATGCAATATTCTTAAATAAATAGCTTCATTATGGTCATTGGAACTACTAAGCGATCTAGTGAGGG
TAGAACAGGAAAAAAAAAATTGAAGAAATGAAAATTTATATAATGCACACAGGTTGTTAGTACATGGCCTATTAAATTGTTTCACCAAATAGGAATTAAATAGAAAATTC
CCAAAATGCACCCATCACTGAATTTTAAGTCTGCTTAGTAATGGAAAAGCCAGCCTGTGACCATTAAAGTGACGCGAAACTTTAATTCATTTCCTCCACTAAATCACTTT
CTCTGTCTTTATATTAAAATGATCATGCAAATTTAAGTAAAATTAATAACAGTACTAGTCATCTTTAATTCTGGCATTTTTTGTTATAAAGTTGCACAATTTATTTATGT
ACGTATTAAATAAAAAGATTGGAC
Protein sequenceShow/hide protein sequence
MSGRSNGCTVYVGNLDERVSDRVLYDILIQAGRVVDLHIPRDKESGKPRGYAFAEYENEEISKYAVKLFSGLVTLHNRTLKFAVSGQDKPSPSSNAITSSLNISHKSRSQ
IVPGYSNEISQYSNRLSNSCRFSAYPENHLEAPPYPGLVNQSNGYGSHLDYFNNQYSQRYSGTTLDSFNHPKSRRHDTSYPVYYPPYELLN