; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010090 (gene) of Snake gourd v1 genome

Gene IDTan0010090
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSerine-rich protein-like protein
Genome locationLG09:66983163..66985888
RNA-Seq ExpressionTan0010090
SyntenyTan0010090
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021750.1 hypothetical protein SDJN02_15477, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-9194.55Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR+NLQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

XP_022933728.1 uncharacterized protein LOC111441057 [Cucurbita moschata]1.1e-9194.55Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR+NLQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

XP_022966156.1 uncharacterized protein LOC111465917 [Cucurbita maxima]1.2e-9094.06Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR+NLQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPR SRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

XP_023530348.1 uncharacterized protein LOC111792947 [Cucurbita pepo subsp. pepo]4.0e-9194.06Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR++LQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

XP_038879100.1 uncharacterized serine-rich protein C215.13-like [Benincasa hispida]4.0e-9193.63Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSY-SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGN-
        MAVSSRKSSGPVLRSLSPSGRFYGSY SSSSSSSSSAFASS+SSFST+NPTSFFRRSVSPSRVNLQGSSSPSASSV FSLDRSISPNRPISVL+R+SGN 
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSY-SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGN-

Query:  QVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSK
        QVVK+QSNQKRTCMCSPTTHPGSFRCSLHKG+ SQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLV+RALASLIRPSSHSQRRRADFCPRPSRLS+MSK
Subjt:  QVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSK

Query:  ADDL
        ADDL
Subjt:  ADDL

TrEMBL top hitse value%identityAlignment
A0A0A0LW36 Uncharacterized protein9.0e-8991.13Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGN-Q
        MAVSSRKSSGPVLRSLSPSGRFYGS SS SSSSSSAFASSTSSFST+N TSFF RSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVL+R+SGN Q
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGN-Q

Query:  VVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKA
        VVK+QS QKRTCMCSPTTHPGSFRCSLHKG+ SQPSTPYSSNRLNARRSAMTNS+VRIGGVEGD+++RALASLIRPSSHSQRRR DFCPRPSRLS+MSKA
Subjt:  VVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKA

Query:  DDL
        DDL
Subjt:  DDL

A0A6J1C124 uncharacterized protein DDB_G0271670-like1.5e-8890.78Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSY----SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSS
        MAVSSRKSSGPVLRSLSPSGRFY SY    SSSSSSSSSAFASSTS+FST+N TSFFRRS SP+RVNLQGSSSPSASSVRFSLDRSISPNRP+SVLTRSS
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSY----SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSS

Query:  GNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVM
        G+QVVKRQ NQKRTCMCSPTTHPGSFRCSLHKG PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADF PR SRLS+M
Subjt:  GNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVM

Query:  SKADDL
        SKA+DL
Subjt:  SKADDL

A0A6J1F5M8 uncharacterized protein LOC1114410575.1e-9294.55Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR+NLQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

A0A6J1HNK6 uncharacterized protein LOC1114659175.7e-9194.06Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV
        MAVSSRKSSGPVLRSLSPSGRFYGSYSSS SS SSAFASSTSSFST+NPT FFRRSVSPSR+NLQ SSSPSASSVRFSLDRSISPNRPISV TR SGNQV
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQV

Query:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD
        VKRQSNQKRTCMCSPTTHPGSFRCSLHKG+PSQPSTPYSSNRLNARRSAMTNS+VRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPR SRLS MSKAD
Subjt:  VKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKAD

Query:  DL
        DL
Subjt:  DL

A0A6J1JI11 uncharacterized serine-rich protein C215.13-like1.2e-8889.37Show/hide
Query:  MAVSSRKSSGPVLRSLSPSGRFYGSY-----SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRS
        MAVSSRKSSGP +RSLSPSGRF G Y     SSSSSSSSSAFASSTSSFST+NPTSFFRRS+SPSRV+LQGSSS SASSVRF+LDRSISPNR ISVLTR 
Subjt:  MAVSSRKSSGPVLRSLSPSGRFYGSY-----SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRS

Query:  SGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSV
        SGNQVVKRQSNQKRTC+CSPTTHPGSFRCSLHKG+PSQP+TPYSSNRLNARRSAMTNS+VRIGGVEGDLV+RALASLIRPSSHSQRRRADFCPRPSRLS+
Subjt:  SGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSV

Query:  MSKADDL
        MSKADDL
Subjt:  MSKADDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67910.1 unknown protein2.8e-0550Show/hide
Query:  GNQVVKRQSNQKRT-CMCSPTTHPGSFRCSLHKGIPSQ
        G++ + RQ++  +T C+CSPTTHPGSFRC +H+ +  Q
Subjt:  GNQVVKRQSNQKRT-CMCSPTTHPGSFRCSLHKGIPSQ

AT5G11090.1 serine-rich protein-related5.5e-4657.41Show/hide
Query:  SSRKSSGPVLRSLSPSGRFYGSYSSS-SSSSSSAFASSTSSFSTKNPTSFF------------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISP-NRP
        S  KS+GPVLRS SPSGRF G YS +  SSSSSAFASSTSS  +   ++FF             RS SP+RVNL  ++ P + S R+SLD RSISP N+ 
Subjt:  SSRKSSGPVLRSLSPSGRFYGSYSSS-SSSSSSAFASSTSSFSTKNPTSFF------------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISP-NRP

Query:  ISVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRAD
        ISV    S NQ         R CMCSPTTHPGSFRCSLHK + +   Q +  Y++N LN RRSAMTNS+VRIGGVEG+ V+RAL +LIRPSSH  +RRA 
Subjt:  ISVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRAD

Query:  FCPRPSRLSVMSKADD
        + PRPSRLS+M+KAD+
Subjt:  FCPRPSRLSVMSKADD

AT5G20370.1 serine-rich protein-related3.7e-1035.52Show/hide
Query:  SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSL
        +++  SS S     +SS    N      R  SPS  N          ++R       S  +P SV+T SS +Q        KR C+CSPTTHPGSFRCS 
Subjt:  SSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSL

Query:  HKGIPSQPSTPYSS------NR----------LNARRSAMTNSIVRIGGVEGDLVKRAL-ASLIRPSSHSQRRRADFCPRPSR
        H+ +  + S   +S      NR          LN R+ A+ NS+ +IG VE +  +R+L A+L +PSS    RR +F PR SR
Subjt:  HKGIPSQPSTPYSS------NR----------LNARRSAMTNSIVRIGGVEGDLVKRAL-ASLIRPSSHSQRRRADFCPRPSR

AT5G25280.1 serine-rich protein-related2.5e-4354.17Show/hide
Query:  SSRKSSGPVLRSLSPSGRFYGSYSS---SSSSSSSAFASSTSSFSTKNPTSFF----------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISPNRPI
        ++R +    LRS SPSGRF G YS+   SSS SSS FASSTSS  +   T+FF           RS SP+RVNL  +S+P   S R+S+D RSISPNR I
Subjt:  SSRKSSGPVLRSLSPSGRFYGSYSS---SSSSSSSAFASSTSSFSTKNPTSFF----------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISPNRPI

Query:  SVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADF
        +V +    N    +  + +R CMCSPTTHPGSFRCSLHK + +   Q +  Y++N LN RRSAMTNS+VRIGGVEG+ V+RAL +LIRPSSH  +RR+ +
Subjt:  SVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADF

Query:  CPRPSRLSVMSKADDL
         PR SRL+ MSKA+DL
Subjt:  CPRPSRLSVMSKADDL

AT5G25280.2 serine-rich protein-related2.5e-4354.17Show/hide
Query:  SSRKSSGPVLRSLSPSGRFYGSYSS---SSSSSSSAFASSTSSFSTKNPTSFF----------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISPNRPI
        ++R +    LRS SPSGRF G YS+   SSS SSS FASSTSS  +   T+FF           RS SP+RVNL  +S+P   S R+S+D RSISPNR I
Subjt:  SSRKSSGPVLRSLSPSGRFYGSYSS---SSSSSSSAFASSTSSFSTKNPTSFF----------RRSVSPSRVNLQGSSSPSASSVRFSLD-RSISPNRPI

Query:  SVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADF
        +V +    N    +  + +R CMCSPTTHPGSFRCSLHK + +   Q +  Y++N LN RRSAMTNS+VRIGGVEG+ V+RAL +LIRPSSH  +RR+ +
Subjt:  SVLTRSSGNQVVKRQSNQKRTCMCSPTTHPGSFRCSLHKGIPS---QPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADF

Query:  CPRPSRLSVMSKADDL
         PR SRL+ MSKA+DL
Subjt:  CPRPSRLSVMSKADDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTCTTCCAGAAAATCGAGCGGACCGGTTCTGAGGTCTCTCTCACCTTCTGGGAGATTCTATGGCTCCTATTCTTCCTCTTCTTCTTCTTCCTCATCGGCTTT
TGCGTCTTCTACTTCAAGCTTTTCCACCAAAAACCCTACTTCGTTTTTCCGTAGATCTGTGTCTCCTTCTCGTGTTAACCTGCAAGGTTCTTCTTCTCCGTCGGCGTCGT
CCGTTAGATTTTCACTGGACCGGTCTATTTCTCCGAATCGGCCTATCTCTGTTTTGACTCGTAGCAGTGGAAATCAAGTAGTGAAGAGGCAGAGCAACCAGAAGAGAACC
TGCATGTGCTCGCCGACCACGCATCCTGGTTCGTTCCGGTGTAGTCTCCACAAAGGCATTCCGTCGCAGCCTTCGACTCCTTACTCGTCTAATCGGCTGAACGCGCGGAG
ATCGGCGATGACGAACTCTATTGTCAGAATCGGAGGAGTTGAAGGCGATTTAGTGAAGCGAGCCTTGGCGTCTCTCATCCGGCCTTCGTCTCATAGTCAAAGGCGCCGAG
CGGATTTCTGTCCGAGACCGAGCCGGCTTTCAGTCATGTCGAAAGCCGATGATCTGTGA
mRNA sequenceShow/hide mRNA sequence
CGATTTTTATTTCATTCGGAGAGATATATAAATCCTTCCAAGTGTAGGGGCATATTTGCAAAGCGTAATTTTCCGCATACAATTCCGTCAAAATAACGCGATAAATTGCG
GTCGGTCCTTCCTATAAATACATGGCTCGAGTGAAGCTAGGTTTGCCAATTTGAAGAGCCAAACGCGGCTTGTAAGAAAATCCTCTCTCCTGCTCTTTCTTCTTCTTCTC
GCTTTTTCTCTTGACTGCAACTTTAATGGCGGTTTCTTCCAGAAAATCGAGCGGACCGGTTCTGAGGTCTCTCTCACCTTCTGGGAGATTCTATGGCTCCTATTCTTCCT
CTTCTTCTTCTTCCTCATCGGCTTTTGCGTCTTCTACTTCAAGCTTTTCCACCAAAAACCCTACTTCGTTTTTCCGTAGATCTGTGTCTCCTTCTCGTGTTAACCTGCAA
GGTTCTTCTTCTCCGTCGGCGTCGTCCGTTAGATTTTCACTGGACCGGTCTATTTCTCCGAATCGGCCTATCTCTGTTTTGACTCGTAGCAGTGGAAATCAAGTAGTGAA
GAGGCAGAGCAACCAGAAGAGAACCTGCATGTGCTCGCCGACCACGCATCCTGGTTCGTTCCGGTGTAGTCTCCACAAAGGCATTCCGTCGCAGCCTTCGACTCCTTACT
CGTCTAATCGGCTGAACGCGCGGAGATCGGCGATGACGAACTCTATTGTCAGAATCGGAGGAGTTGAAGGCGATTTAGTGAAGCGAGCCTTGGCGTCTCTCATCCGGCCT
TCGTCTCATAGTCAAAGGCGCCGAGCGGATTTCTGTCCGAGACCGAGCCGGCTTTCAGTCATGTCGAAAGCCGATGATCTGTGATGGAGATTTTGGTACGTTGGACAATA
TCCTCTGGTTTATTAATCTTTTTACCTCTTCTCTGAGTTTCCGACGACGAAGACGGCGGATTCCGACCACAGGTTCGACGGCAGTGACGGATCTTAGCGCAATGATACGG
AATTGGAAGCGAAGCATGAGAATCAGATCTTGCTGCCTCTCCAGTAAACCTCGCTGCCCGGAGTTGGACTAGAGAACCTATATAACGTTGTGTATATACAGAATATTAAA
TATATGGAAGATATATACAGTTGCCGCGATATTGATAACAGTTCGTGGAGCTTCTGAACCCTAATTTTTCACCAATTTCAAAATTAACCATAAATCAAACTCTAATTGGT
CTTAATTTCGAACTAATTCCAAAGGAAATGATTCAAAATTATTTGTGTCCCGCTAATCGCTGAAATCAATTCACTGTTATTGTTTACGTTCCACTCGCTTCGTATTGTAT
ACAGAACGGCTGTGTTTGTTATTTGGTTCGGCGTCTGACCGTCTCCATTCATCGTCGGCGTGAAATTAAAGCCAGTAATTTACGTACTCGTGCTTGCCGTAATCCGTTCC
GTCAAGCGTGGTGATTTCTCAAAGATGCAATTGCAGGTTGAAACAGAAGAGCCTCTGAACTGTGAATTTTGTGTATCTGAAAGTAGGATTTATGATGGGAATTTAGTCGG
AGATGGTTCAAATTTGGATGTTGATATGAACAAGGACACTCGTGGAGATTTGACACGGACTGAACACATTGAATTCAAATTAAATGGGAAGAAATTCTGGATTGAATTTA
AAAAAGTAAATAAAAATTAAAAAAACAAGTTAGAATGTAATAGAATAGAATGAAAGAATTGGGGAGAAGAAAATTGAGAAATAGGAGATTGTCAGATTGATGATGGGGTT
GGACAGTGGATGTGGATGGTTGACCTGTCAAGTCCCAAATCGGTGGAAGGAACCGACTGCTATTTGCTATAAAAAATCAAATTCATTTCCCTATGGAATGTTCATAAAAT
TGGACATTGAGAAATTAAAAATAAAGCGGCTTTTCAATTGAGAATGAAATATGTATGCCCCTTTGAGAGAGGAAGGGTTGGTCTACATAGTAGGAGATAATATATTTTTC
TTAGGGGAGGAAACTATTAGGGAGAAATTAGAAAATGAAAAGACAGGCTTAGGCTTAAATTTATATTAGTGGATTTAAAACTCTGCTCAAATTATTAATTTGGTTCCTAT
GGGGCTTATGGTTTTAAAATTTCTAATTTGAATTCTGCCATTTTGGTGGCAGAGTAATTAGGATTTTGATTTGTTAAGGAGCTTTGGATTATGTGGAAATTGAATTACAA
AATTGCAGTATTTTTGGATGAATTTGAATTAAATGCAAACTCTATGATCGGATTCATAATCCAAAGATGGCTAAGGGAAGTAAAATCAGTTTAGTACAAGGTGAGAAATA
AAGGGGAAAAAAAAGAAAGAGGAGAAAGATGGGAGTCTAAAAATGTTGAAAAACGAAGGGCAATGAGGATAAGAAGATAATAGTGAGTGAAAATGAGTGGGCATCAGTAT
CGACAATGCCAATCTCTCACTTTTTTATTTATTTATTTATTTTGAATGATCTCTCGCCTTTTATGCTTCTCATATTTTCAAACCGATCAAATGTATGGTTACGCGAACTC
AAAAATATGAAATTAAAAAAAGAAATGTATAGTTAGGCAAATGTTGGTCATCTCTGAATTGTTCCACTCACCCCACCGTTGTTTGAATAGAGATCACCTTATTT
Protein sequenceShow/hide protein sequence
MAVSSRKSSGPVLRSLSPSGRFYGSYSSSSSSSSSAFASSTSSFSTKNPTSFFRRSVSPSRVNLQGSSSPSASSVRFSLDRSISPNRPISVLTRSSGNQVVKRQSNQKRT
CMCSPTTHPGSFRCSLHKGIPSQPSTPYSSNRLNARRSAMTNSIVRIGGVEGDLVKRALASLIRPSSHSQRRRADFCPRPSRLSVMSKADDL