; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009302 (gene) of Snake gourd v1 genome

Gene IDTan0009302
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationLG04:80686754..80690719
RNA-Seq ExpressionTan0009302
SyntenyTan0009302
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]2.5e-13392.25Show/hide
Query:  MGVESNSAPPPP----SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD
        MGVESNS PPPP    SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSAPPPP----SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD

Query:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
        SSMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL ECSSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Subjt:  SSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG

XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]2.4e-13694.07Show/hide
Query:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRP SGV PSTGTNTSLGCS  SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSSM  GVSE EYNEQK+CKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]1.3e-13492.96Show/hide
Query:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSSM  GVSE EYNEQK+CKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]1.9e-13392.59Show/hide
Query:  MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS PPPP   SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
        SMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL ECSSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Subjt:  SMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]7.1e-13694.01Show/hide
Query:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPP  SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFP+QSDSSVPTSPVSPYRYQRP SGVTPSTGTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEERP  CIKSLVDER +QLEECSSMGVSEPEYNE+K+CKDLNRDMKDSESGG
Subjt:  LPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.2e-13694.07Show/hide
Query:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRP SGV PSTGTNTSLGCS  SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCS-ISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSSM  GVSE EYNEQK+CKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X16.5e-13592.96Show/hide
Query:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSSM  GVSE EYNEQK+CKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG

A0A5A7TRC2 Uncharacterized protein6.5e-13592.96Show/hide
Query:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPP+SSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-PPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+ PS GTNTSLGCS SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP  CIKSLVDERVYQLEECSSM  GVSE EYNEQK+CKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSM--GVSEPEYNEQKTCKDLNRDMKDSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X19.3e-13492.59Show/hide
Query:  MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS PPPP   SSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPP---SSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRP SG+TPSTGTNTSLGC+  P+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
        SMELPYCSMPEPGPNIEAEER    IKSLVDERVYQL ECSSMGVSEPEYNEQK+CKDLNR+MKDSESGG
Subjt:  SMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X16.1e-13392.13Show/hide
Query:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSAPPPP SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVGD LSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPSQSD+SVPTSPVSPYRYQRP SG+TPST TNTSLGC+ SP+TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEER    IKSLVDERVYQL ECS+MGVSEPEYNEQK+CKDLNR+MKD ESGG
Subjt:  LPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)9.7e-5955.81Show/hide
Query:  ESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE--
        E+N  P P S S    SP GKR RDPEDEVYLDN  S KRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  
Subjt:  ESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE--

Query:  TSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSL----GCSISPI------TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRA
        T+T    S    S PTSPVSPYRYQRPL+       + T L     C  S I      T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQMR 
Subjt:  TSTNLFPSQSDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSL----GCSISPI------TSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRA

Query:  QPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQL-EECSSMGVSEPEYNEQKTCKDLN
        QP G SS   P         NI+ EER   C KS+ ++R Y   E+     VS    ++ K+CK L+
Subjt:  QPPGPSSMELPYCSMPEPGPNIEAEERPYPCIKSLVDERVYQL-EECSSMGVSEPEYNEQKTCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCGAATCGAACTCAGCGCCGCCGCCGCCATCATCATCGTCTTCTACGCCATCTCCGAGCGGGAAGCGGGCCAGAGATCCCGAGGATGAAGTTTATCTCGACAA
TTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCGG
AGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCGATGTCGGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCGACAAACTTATTTCCCTCGCAA
TCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCAGAGGCCATTGAGCGGGGTGACTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTATTAG
TCCCATCACTAGCTTGCAGCCCCATCAGCGCGGATCAGATTCCGAAGGTCGTTTCCCATCATCTCCCAGCGATATATGCCACTCAGCAGACTTGAGAAGAGCTGCGCTCC
TGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCCGAAGAACGACCTTAT
CCTTGCATAAAATCGTTGGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAGCCTGAATATAATGAACAAAAAACATGCAAGGACTTGAA
CAGAGATATGAAAGACAGTGAGTCTGGAGGTTAG
mRNA sequenceShow/hide mRNA sequence
AAAATCCACAACAAATTATGAAAACAAAACAGAGCCCCACAAATCACCTCTTTTCCCCTGCCCAGAAAGGAAAGGAAAAACCAGAGGAAACTACTTTTGTTGTTCTTCTT
CTACAATCCAAAACTGGTTACAGAGATTGATGATTCCAATTTCCAAAAATTGCCCACATTACCCTCTTTTCCAAACCAAACGATCGTCGAATTTGAACTTTTGTTCTTGA
ATTTCTTGCTCGGAAAACTGTTGAAGCTGCCCTGATGGGCGTCGAATCGAACTCAGCGCCGCCGCCGCCATCATCATCGTCTTCTACGCCATCTCCGAGCGGGAAGCGGG
CCAGAGATCCCGAGGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTT
TCAGAGAATCTCATGGATTCCCCTGCAAGGTCGGAGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCGATGTCGGAAGATTCAGATGACTGCCGGTT
TTGTGAGACATCGACAAACTTATTTCCCTCGCAATCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCAGAGGCCATTGAGCGGGGTGACTCCTTCAA
CAGGTACTAATACTTCACTTGGATGTTCTATTAGTCCCATCACTAGCTTGCAGCCCCATCAGCGCGGATCAGATTCCGAAGGTCGTTTCCCATCATCTCCCAGCGATATA
TGCCACTCAGCAGACTTGAGAAGAGCTGCGCTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCC
TGGACCTAATATAGAAGCCGAAGAACGACCTTATCCTTGCATAAAATCGTTGGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAGCCTG
AATATAATGAACAAAAAACATGCAAGGACTTGAACAGAGATATGAAAGACAGTGAGTCTGGAGGTTAGTAAATACTAAAAAAATGTACAGAGGACAATGTATTCTGACAT
CACACACGGGACACATTGCTGCTGTGTGACCGAAGTGTTTGCCTTTTGGTTCAATCACCTTGGAAAATTCTCCTTTTGCATATAGCTTCCAAAATTTACTCGATTTCGCT
TGTTCGAGGATGGTTTGGTCTTCTGCTAGGTTCTTTCTCGAATTCGTGGTGGCTTGTGCTATCTACATTGTTGAGCATGTGTGGAAGTTCTGCCTATCTATTTGATTCTT
CCGAAGCACGAGTTGTGAAGATGTTCATTGTTGAGCATGTGTGGAAGCTCTGTTTATCTATCCGATTCATGCCGAAGCCCGAGTTGAGAGAAGATCTTCGTTGTGGAGCA
TGTGTGGAAGCTCTGTCTATCTATTTGATTCTACACGAGCTCGAGTCGAGACAAAATGCTGGGACCTTAATGGGTCATAAAGTTGAAGTTGCAAGTGTTCCATTTGTTTC
ATATTTGTCAGGCTGTTATTATTGAATTCTTTGTTCTAATTTGCCTTGGGAATTTGCAG
Protein sequenceShow/hide protein sequence
MGVESNSAPPPPSSSSSTPSPSGKRARDPEDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQ
SDSSVPTSPVSPYRYQRPLSGVTPSTGTNTSLGCSISPITSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPY
PCIKSLVDERVYQLEECSSMGVSEPEYNEQKTCKDLNRDMKDSESGG