; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031040 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031040
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationchr11:4089764..4095149
RNA-Seq ExpressionLag0031040
SyntenyLag0031040
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]1.2e-13292.25Show/hide
Query:  MGVESNSAAAPP----SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD
        MGVESNS   PP    SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSAAAPP----SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD

Query:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  TSLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG
        +S+ELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEY EQKSCKDLNR+MKDSESGG
Subjt:  TSLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG

XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]1.0e-13694.81Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP+S+
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EY EQKSCKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]3.6e-13492.96Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+S+
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EY EQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]8.8e-13392.59Show/hide
Query:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS   PP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPT
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPT

Query:  SLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG
        S+ELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEY EQKSCKDLNR+MKDSESGG
Subjt:  SLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]5.0e-13694.38Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNS  APP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSLE
        ETSTNLFP+QSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+S+E
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSLE

Query:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEERP SCIKSLVDER +QLEECSSMGVSEPEY E+KSCKDLNRDMKDSESGG
Subjt:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein4.9e-13794.81Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP+S+
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EY EQKSCKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X11.7e-13492.96Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+S+
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EY EQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG

A0A5A7TRC2 Uncharacterized protein1.7e-13492.96Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+S+
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSL

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EY EQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYIEQKSCKDLNRDMKDSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X14.3e-13392.59Show/hide
Query:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS   PP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPT
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPT

Query:  SLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG
        S+ELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEY EQKSCKDLNR+MKDSESGG
Subjt:  SLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X12.8e-13292.13Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD LSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSLE
        ETSTNLFPSQSD+SVPTSPVSPYRYQRPFSG+ PST TNTSLGC+TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP+S+E
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPTSLE

Query:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECS+MGVSEPEY EQKSCKDLNR+MKD ESGG
Subjt:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)4.9e-5752.79Show/hide
Query:  GVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE
        G      A  P    S  SP GKR RDP+DEVYLDN  S KRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE
Subjt:  GVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE

Query:  --TSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM
          T+T    S    S PTSPVSPYRYQRP +       + T          S+  + +  T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQM
Subjt:  --TSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM

Query:  RAQPPGPTSLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQL-EECSSMGVSEPEYIEQKSCKDLN
        R QP G +S   P         NI+ EER   C KS+ ++R Y   E+     VS     + KSCK L+
Subjt:  RAQPPGPTSLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQL-EECSSMGVSEPEYIEQKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTTGCCACGTCATCGCCACGTCACCAAAATTTGACCGTTGTGGAGCCACGTCATCCGCCACATCAGCTGCCACATCATCATTTTTGGTCGATTGTTGTGCCACA
TCAGCGCCACGTCGGATGCCACATCAGCGACTGCCACGTGTCACTGCCACAGGCAGTAGGGACCAAATGGAGCAGGAAGAACTCGGCTCGCGCGGTCGGCCGAGGCCGAG
CACGGGGTCGGGCCAAAAGCCCAACCCCTTCGGTCTTGGCCCGTCCTACTTGTCGGTCTCGCCTCTGGGGTCCATCTCTCAGTCCTATTTCTGTCATCTATCCTCGTCAG
CTTCTTGTACATCGGAATGGTCCAAAATTACCCATAACAGCCAACATTTTCCTCTTTCCCAATCCAAATGATCCTCGATTTCATACTTCTGTGCTTGGATTTCTTGCTCG
GAAAACTCTCGAAGCTGCCCTAATGGGTGTCGAATCGAACTCGGCGGCGGCGCCGCCATCGTCGTCGTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCCGAGATCCTG
ACGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTC
ATGGATTCCCCAGCAAGGTCGGAGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGGTTTTGTGAGACATC
CACAAACTTATTTCCCTCGCAGTCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATACCGATATCAGAGGCCATTCAGCGGGGTGGCTCCGTCAACAGGTACTAATA
CTTCACTTGGATGTTCTACTAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGACTCCGAGGGTCGTTTCCCATCGTCTCCTAGTGATATATGCCACTCAGCA
GACTTGAGAAGGGCTGCACTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCAACATCTCTGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATAT
AGAAGCTGAAGAGCGGCCATATTCTTGCATCAAATCGTTAGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAACCTGAATATATTGAAC
AGAAATCATGCAAGGACTTGAACAGGGATATGAAAGACAGTGAGTCTGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACTTGCCACGTCATCGCCACGTCACCAAAATTTGACCGTTGTGGAGCCACGTCATCCGCCACATCAGCTGCCACATCATCATTTTTGGTCGATTGTTGTGCCACA
TCAGCGCCACGTCGGATGCCACATCAGCGACTGCCACGTGTCACTGCCACAGGCAGTAGGGACCAAATGGAGCAGGAAGAACTCGGCTCGCGCGGTCGGCCGAGGCCGAG
CACGGGGTCGGGCCAAAAGCCCAACCCCTTCGGTCTTGGCCCGTCCTACTTGTCGGTCTCGCCTCTGGGGTCCATCTCTCAGTCCTATTTCTGTCATCTATCCTCGTCAG
CTTCTTGTACATCGGAATGGTCCAAAATTACCCATAACAGCCAACATTTTCCTCTTTCCCAATCCAAATGATCCTCGATTTCATACTTCTGTGCTTGGATTTCTTGCTCG
GAAAACTCTCGAAGCTGCCCTAATGGGTGTCGAATCGAACTCGGCGGCGGCGCCGCCATCGTCGTCGTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCCGAGATCCTG
ACGATGAAGTTTATCTCGACAATTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTC
ATGGATTCCCCAGCAAGGTCGGAGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGGTTTTGTGAGACATC
CACAAACTTATTTCCCTCGCAGTCTGATAGTAGTGTACCTACCAGTCCAGTCTCTCCATACCGATATCAGAGGCCATTCAGCGGGGTGGCTCCGTCAACAGGTACTAATA
CTTCACTTGGATGTTCTACTAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGACTCCGAGGGTCGTTTCCCATCGTCTCCTAGTGATATATGCCACTCAGCA
GACTTGAGAAGGGCTGCACTCCTGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCAACATCTCTGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATAT
AGAAGCTGAAGAGCGGCCATATTCTTGCATCAAATCGTTAGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAACCTGAATATATTGAAC
AGAAATCATGCAAGGACTTGAACAGGGATATGAAAGACAGTGAGTCTGGAGGGTAG
Protein sequenceShow/hide protein sequence
MSLATSSPRHQNLTVVEPRHPPHQLPHHHFWSIVVPHQRHVGCHISDCHVSLPQAVGTKWSRKNSARAVGRGRARGRAKSPTPSVLARPTCRSRLWGPSLSPISVIYPRQ
LLVHRNGPKLPITANIFLFPNPNDPRFHTSVLGFLARKTLEAALMGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENL
MDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSA
DLRRAALLRSVQMRAQPPGPTSLELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYIEQKSCKDLNRDMKDSESGG