; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004495 (gene) of Snake gourd v1 genome

Gene IDTan0004495
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTPR_REGION domain-containing protein
Genome locationLG08:6277725..6285649
RNA-Seq ExpressionTan0004495
SyntenyTan0004495
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030139.1 hypothetical protein SDJN02_08486, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-13790.37Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLKPSISM RSFH+SFT PSLPNSIK LEN HSPSS PMRIKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAV A+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLD-----RFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGE
        LAEFDKAIELDPRQKAYLWQRGLSLYYLD     RFEEGA+QFRLDVAQNPNDTEESIWCF+CEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFKDGG 
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLD-----RFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGE

Query:  PEKLVAAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        PE LVAAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  PEKLVAAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

XP_022946000.1 uncharacterized protein LOC111450218 isoform X1 [Cucurbita moschata]2.6e-13892.08Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLKPSISM RSFHISFT PSLPNSIK LEN HSPSS PMRIKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAV A+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCF+CEAQLYGVDEARR+FLEVGRDPR VMREAYNMFKDGG PE LV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

XP_022999589.1 uncharacterized protein LOC111493913 isoform X3 [Cucurbita maxima]1.4e-13992.83Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLKPSISM RSFHISFT PSLPNS K LEN HSPSSLPMRIKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNP+DTEESIWCF+CEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFKDGG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

XP_023545315.1 uncharacterized protein LOC111804759 isoform X1 [Cucurbita pepo subsp. pepo]7.6e-13890.94Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AIS+LKPSISM RSFHISFT PSLPNS K LEN HSP+S PM+IKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAV A+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRF+EGA+QFRLDVAQNPNDTEESIWCF+CEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFKDGG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

XP_038889663.1 uncharacterized protein LOC120079523 [Benincasa hispida]1.1e-13389.14Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLK SIS+  S HI F  PS PNS KTLE  HSPSSLPMRI LNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFRQGDV GS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARR+FLEVGRD RPVMREAYNMFKDGG PEKL+
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAAL +VH LCRNWSFS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS

TrEMBL top hitse value%identityAlignment
A0A0A0LGH0 TPR_REGION domain-containing protein3.9e-13288.76Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M   SSLK SIS+  S H SFT PS  NS KT+E FHSPSSLPMRI LNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFRQGDV GS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFKDGG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQ IVAACQS Y QRSDDYMAALAKVH LCRNWSFS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS

A0A5D3BC02 TPR_REGION domain-containing protein3.3e-13188.01Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M   S LK SIS+  S H SFT PS  NS KT+ENFHSP+SLP+RI LNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFRQGDV GS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFK+GG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQS YGQRSDDYMAALAKVH L RNWSFS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS

A0A6J1DQ08 uncharacterized protein LOC1110232571.1e-13188.39Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M  ISSLKP+ISMPRS   SFT PSLPNSI T E F SP+SLPMRIKLN +P  SLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFRQGDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCFLCEAQLYGVDE+RR+FLEVGRD RPVMREAY+MFKDGG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS
        AAF SGRENEYFYASLYAGLY+E+EK +DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWSFS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS

A0A6J1G2K4 uncharacterized protein LOC111450218 isoform X11.3e-13892.08Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLKPSISM RSFHISFT PSLPNSIK LEN HSPSS PMRIKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAV A+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNPNDTEESIWCF+CEAQLYGVDEARR+FLEVGRDPR VMREAYNMFKDGG PE LV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

A0A6J1KHI1 uncharacterized protein LOC111493913 isoform X36.7e-14092.83Show/hide
Query:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS
        M AISSLKPSISM RSFHISFT PSLPNS K LEN HSPSSLPMRIKLNHVPPLSLSRRLF+PSVSGIWDA+TGGNNPRDAVAA+RRGMLLFR+GDVSGS
Subjt:  MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGS

Query:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV
        LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGA+QFRLDVAQNP+DTEESIWCF+CEAQLYGVDEARR+FLEVGRDPRPVMREAYNMFKDGG PEKLV
Subjt:  LAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLV

Query:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS
        AAF SGRENEYFYASLYAGLY+EAEKK+DAAKQHIVAACQSPYGQRSDDYMAALAKVH LCRNWS
Subjt:  AAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWS

SwissProt top hitse value%identityAlignment
Q1C3L9 Lipoprotein NlpI6.0e-0530.67Show/hide
Query:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL
        G+ L + G+   +   FD  +ELDP        RG++LYY  RF       +     +PND   S+W +L E ++
Subjt:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL

Q1CM50 Lipoprotein NlpI6.0e-0530.67Show/hide
Query:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL
        G+ L + G+   +   FD  +ELDP        RG++LYY  RF       +     +PND   S+W +L E ++
Subjt:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL

Q7CKI5 Lipoprotein NlpI6.0e-0530.67Show/hide
Query:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL
        G+ L + G+   +   FD  +ELDP        RG++LYY  RF       +     +PND   S+W +L E ++
Subjt:  GMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQL

Arabidopsis top hitse value%identityAlignment
AT3G05625.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-10080.56Show/hide
Query:  PPLSLSRRLFLPSVSGIWDAITGG--NNPRDAVAAVRRGMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDT
        PP +LSRRLFLPSVS IWDAITGG  +NPR+A+AAVRRGM LFRQGDV+GS+AEFD+AI LDPRQKAYLWQRGLSLYY+DRFEEGA+QFR+DVAQNPNDT
Subjt:  PPLSLSRRLFLPSVSGIWDAITGG--NNPRDAVAAVRRGMLLFRQGDVSGSLAEFDKAIELDPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDT

Query:  EESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLVAAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDD
        EESIWCF+CEA+L+GVD AR QFLEVGRD RPVMREAYN+FK+GG+PEKLV  F SG+ +EYFYASLYAGLY EAE K + AK H+ AAC SPYGQRSDD
Subjt:  EESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLVAAFLSGRENEYFYASLYAGLYNEAEKKMDAAKQHIVAACQSPYGQRSDD

Query:  YMAALAKVHSLCRNWS
        YMA+LAKVH LCRNWS
Subjt:  YMAALAKVHSLCRNWS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCAATTTCCAGTCTCAAACCCTCCATTTCCATGCCCCGTTCGTTTCACATATCTTTCACCCCGCCTTCTTTGCCGAATTCAATCAAAACCCTCGAAAATTTCCA
CTCTCCGAGCTCGCTTCCGATGAGAATCAAGCTCAATCACGTCCCGCCATTGTCGTTGTCTCGAAGATTGTTCCTTCCTTCAGTCTCTGGAATCTGGGACGCCATAACAG
GGGGAAACAACCCTCGCGACGCTGTTGCTGCTGTTCGACGTGGAATGCTTCTCTTCAGACAGGGCGATGTTTCGGGATCTTTAGCAGAATTTGATAAGGCGATTGAGTTG
GATCCTCGTCAAAAGGCATATCTTTGGCAAAGAGGGCTTTCACTTTACTACCTTGATAGATTTGAAGAGGGAGCTCAGCAGTTCCGACTAGATGTTGCACAAAATCCGAA
TGATACAGAGGAGTCTATATGGTGCTTTCTTTGTGAAGCCCAGTTGTATGGAGTTGATGAAGCAAGAAGGCAATTTCTTGAGGTAGGTAGAGATCCAAGACCAGTCATGC
GGGAAGCTTACAACATGTTTAAAGATGGTGGCGAACCAGAGAAACTTGTTGCTGCCTTCTTAAGTGGCCGTGAGAATGAATATTTTTATGCTTCTCTATATGCTGGGCTT
TATAACGAAGCAGAGAAAAAAATGGATGCAGCTAAACAACATATAGTTGCAGCTTGCCAGTCTCCTTATGGACAGAGGTCCGATGATTACATGGCTGCTCTTGCCAAAGT
TCACTCCCTCTGTAGAAACTGGAGTTTCAGTTGA
mRNA sequenceShow/hide mRNA sequence
CATTTTGGCGCCTTATCTTCATCAATTCACTTTATCTGAAAATGGGCAGTACAGAGCCATCGTCACCAAATTCTCACTAACACTCTGCTACTGAGCTCAATCTTCTCCAA
TTTCATCGAATTCATCATCAATGCCTGCAATTTCCAGTCTCAAACCCTCCATTTCCATGCCCCGTTCGTTTCACATATCTTTCACCCCGCCTTCTTTGCCGAATTCAATC
AAAACCCTCGAAAATTTCCACTCTCCGAGCTCGCTTCCGATGAGAATCAAGCTCAATCACGTCCCGCCATTGTCGTTGTCTCGAAGATTGTTCCTTCCTTCAGTCTCTGG
AATCTGGGACGCCATAACAGGGGGAAACAACCCTCGCGACGCTGTTGCTGCTGTTCGACGTGGAATGCTTCTCTTCAGACAGGGCGATGTTTCGGGATCTTTAGCAGAAT
TTGATAAGGCGATTGAGTTGGATCCTCGTCAAAAGGCATATCTTTGGCAAAGAGGGCTTTCACTTTACTACCTTGATAGATTTGAAGAGGGAGCTCAGCAGTTCCGACTA
GATGTTGCACAAAATCCGAATGATACAGAGGAGTCTATATGGTGCTTTCTTTGTGAAGCCCAGTTGTATGGAGTTGATGAAGCAAGAAGGCAATTTCTTGAGGTAGGTAG
AGATCCAAGACCAGTCATGCGGGAAGCTTACAACATGTTTAAAGATGGTGGCGAACCAGAGAAACTTGTTGCTGCCTTCTTAAGTGGCCGTGAGAATGAATATTTTTATG
CTTCTCTATATGCTGGGCTTTATAACGAAGCAGAGAAAAAAATGGATGCAGCTAAACAACATATAGTTGCAGCTTGCCAGTCTCCTTATGGACAGAGGTCCGATGATTAC
ATGGCTGCTCTTGCCAAAGTTCACTCCCTCTGTAGAAACTGGAGTTTCAGTTGAAGAAGTCTTGATTAGTTGCTTAGATCTCGTTTGATAACTATTTGGTTTTTGATTTT
TTGTTTTTAAAATTTAAGCTTAAAAATACTATTTCTACTCATAAGTTTCTATGTTTTCTTATCTATTTTGTACTTATATTTTCAAAACTCAAGTTAAGTTTTGAAAATTA
AAAAAATGAGTTTCAAAAACTTGTTTTTGTTTTTGAAATTTGGCTAGAAACTTGGATGATACCTCAAGAAATATGGAAATTATTGAGAAAAAGAGAAATTTTGAAAAAAT
AAGCAATTTTCAAAAGCCAAACCAAAAGTTAAATGATTATGAAGCGGACTTAGATACTATATTTTACCATTTCTTTTTCTTTTCCTTCTTGTTTTCAATCTAAATTTTAG
GACGACAAAGTTGTTATCAGGGTAGAGCATGAGCATTGTATGTTCACAAGTTCTATAACGACATTATGACGATGTCACTACCAAGTTGGCG
Protein sequenceShow/hide protein sequence
MPAISSLKPSISMPRSFHISFTPPSLPNSIKTLENFHSPSSLPMRIKLNHVPPLSLSRRLFLPSVSGIWDAITGGNNPRDAVAAVRRGMLLFRQGDVSGSLAEFDKAIEL
DPRQKAYLWQRGLSLYYLDRFEEGAQQFRLDVAQNPNDTEESIWCFLCEAQLYGVDEARRQFLEVGRDPRPVMREAYNMFKDGGEPEKLVAAFLSGRENEYFYASLYAGL
YNEAEKKMDAAKQHIVAACQSPYGQRSDDYMAALAKVHSLCRNWSFS