; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000466 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000466
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold8:48228096..48244039
RNA-Seq ExpressionSpg000466
SyntenySpg000466
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149633.1 uncharacterized protein LOC101215314 isoform X1 [Cucumis sativus]6.8e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

XP_004149633.1 uncharacterized protein LOC101215314 isoform X1 [Cucumis sativus]2.2e-9590.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

XP_008449914.1 PREDICTED: uncharacterized protein LOC103491645 isoform X2 [Cucumis melo]2.5e-9490.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

XP_008449914.1 PREDICTED: uncharacterized protein LOC103491645 isoform X2 [Cucumis melo]6.8e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

XP_011653507.1 uncharacterized protein LOC101215314 isoform X2 [Cucumis sativus]6.8e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

XP_011653507.1 uncharacterized protein LOC101215314 isoform X2 [Cucumis sativus]2.9e-9592.12Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSG
        LSG
Subjt:  LSG

XP_016900772.1 PREDICTED: uncharacterized protein LOC103491645 isoform X3 [Cucumis melo]2.6e-9694.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDISWNISSEQYGPLFTF
        MLCNSLRDRLRP LRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+ WNISSEQYGPLFTF
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDISWNISSEQYGPLFTF

Query:  SVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKCLSGVT
        SVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKCLSG++
Subjt:  SVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKCLSGVT

XP_016900772.1 PREDICTED: uncharacterized protein LOC103491645 isoform X3 [Cucumis melo]6.8e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

XP_016900772.1 PREDICTED: uncharacterized protein LOC103491645 isoform X3 [Cucumis melo]2.2e-9590.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

TrEMBL top hitse value%identityAlignment
A0A0A0KWY7 Uncharacterized protein3.3e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

A0A0A0KWY7 Uncharacterized protein1.4e-9592.12Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSG
        LSG
Subjt:  LSG

A0A1S3BP27 uncharacterized protein LOC103491645 isoform X21.2e-9490.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

A0A1S3BP27 uncharacterized protein LOC103491645 isoform X23.3e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

A0A1S3BP27 uncharacterized protein LOC103491645 isoform X21.2e-9490.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRP LRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        EQYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

A0A1S4DXR3 uncharacterized protein LOC103491645 isoform X13.3e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

A0A1S4DXR6 uncharacterized protein LOC103491645 isoform X31.3e-9694.9Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDISWNISSEQYGPLFTF
        MLCNSLRDRLRP LRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+ WNISSEQYGPLFTF
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDISWNISSEQYGPLFTF

Query:  SVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKCLSGVT
        SVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKCLSG++
Subjt:  SVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKCLSGVT

A0A1S4DXR6 uncharacterized protein LOC103491645 isoform X33.3e-0490Show/hide
Query:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD
        ISHFG GDNGSTSGPDVS+SKLSRH QVAD
Subjt:  ISHFGIGDNGSTSGPDVSQSKLSRHSQVAD

A0A1S4DXR6 uncharacterized protein LOC103491645 isoform X31.1e-9590.73Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS
        MLCNSLRDRLRPWLRDYDRLQSFAVILIY+QIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLD+         +WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDI---------SWNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC
        +QYGPLFTFSVKLTLAMQIIGFSVRL SSLLWIQIYRLGISYMET VPREADYDLRNSFLSPATP+VVRQPSGSDD+IGGSIYDPTYYSSLFEDGQDSKC
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQDSKC

Query:  LSGVT
        LSG++
Subjt:  LSGVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55535.1 unknown protein5.7e-7371.57Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNIS
        MMLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYAVLLF A+ LDIS         W+IS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQ
        +E YG  F FSVKLT+AM++IGF VRL SSLLW QIYRLG + ++T +PRE D DLRNSFL+P TP + RQ SG+++I+GGSIYDP YY+SLFE+ Q
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQ

AT1G55535.2 unknown protein5.7e-7371.57Show/hide
Query:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNIS
        MMLC SLRDR+ PWLRDY +LQS AV LIY QIGCALIGSLGALYNGVLLINLAIALFALVAIES+SQSLGRTYAVLLF A+ LDIS         W+IS
Subjt:  MMLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNIS

Query:  SEQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQ
        +E YG  F FSVKLT+AM++IGF VRL SSLLW QIYRLG + ++T +PRE D DLRNSFL+P TP + RQ SG+++I+GGSIYDP YY+SLFE+ Q
Subjt:  SEQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPSGSDDIIGGSIYDPTYYSSLFEDGQ

AT3G13420.1 unknown protein4.7e-5958.48Show/hide
Query:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNISS
        MLC SLR+R+  WLRDY RLQS  +ILIY QIGCALIGSLGALYNGV+LINLAIALF LVAIES+SQSLGRTYAVLLF AI LD+S         WNISS
Subjt:  MLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLAIALFALVAIESSSQSLGRTYAVLLFSAIFLDIS---------WNISS

Query:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSP------------------ATPIVVRQPSGSDDIIGGSI
        + Y   + FSVKLTLAM+I GF VRL SSLLW QIYRLG S +++P PR++D DLRNSFL P                    P + +Q S SD+I+  SI
Subjt:  EQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSP------------------ATPIVVRQPSGSDDIIGGSI

Query:  YDPTYYSSLFEDGQDSKCLSGVTL
         +P  Y+ L + G     LS +TL
Subjt:  YDPTYYSSLFEDGQDSKCLSGVTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCTTCGCCTCGGCCTCCGTTCTGTTCCCCTTTTCCACCGAATTCGCTGTTTCCTATCGGAGCAAACGCACGGATTCCTGGAATTCCTCAACCAGATTCTCCACTTCGCTC
TGGCTTTCAAGTAATTCCTTCTCCACATCTTCCTCTTTGTGGATTTTTCAAATGGTATATGATGCTTTGCAATTCATTGAGAGATCGACTTCGGCCATGGCTTCGTGATT
ATGATAGGCTTCAGTCTTTCGCAGTCATTCTCATTTATGTCCAGATCGGGTGCGCATTGATTGGATCCTTAGGGGCATTGTACAATGGTGTATTGCTTATAAATTTGGCA
ATCGCATTATTCGCTTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGCCGAACGTATGCTGTTCTCCTGTTCTCTGCAATTTTCCTCGACATCTCCTGGAACATCTC
ATCTGAGCAATATGGACCCCTTTTTACATTTTCAGTGAAGCTTACTCTGGCTATGCAGATTATTGGATTTTCTGTTAGACTATGGTCTTCACTACTGTGGATTCAAATAT
ACAGATTGGGGATTTCATACATGGAAACTCCAGTTCCCCGAGAGGCAGATTATGATTTGAGAAATAGTTTTCTTAGCCCAGCTACTCCTATTGTAGTTAGACAACCCTCA
GGTTCTGATGATATAATAGGGGGCTCTATCTACGATCCAACCTATTACTCGTCCCTCTTTGAAGATGGTCAAGATAGTAAATGTTTGTCTGGGGTAACATTGGATTGTTG
CTTAGCCAAAGTAGAAGTTAAAAGAAACCTTTGTGGTTTTGTTCCACACTTAATTGAAATTAAAGATGGAATGTTAGGAAGTATTTTCGTTCGAACTATGGTCTGTGATA
TATCTCCATCCCAATCTTCACCTTTTGAATGCTTTATTTTGGATTCTAGCAAGCTTGATAACTCTCTGGATTTTTTTCACTTCCAAGTGGTCTTGGAAGATGAACCGTCT
TTTGTACCACCATTTATTGAAAATACGAATTTGGAAGAGGCCTTGGTTATTGATGTATTAGATAAAGTTGATCAACAAATTAATGAGTCTTTGGTTGACGAAACCTCATG
TATTGGTGTCTGCTTTCATGATTCAAATTATCGGTTGAGCATTAATGATGGAGTTGGTAAGACTGTATCTTTGAATTCTTCAGTTTTTGATAAGGGTGTTGATAAGACCA
TTAATGAGACCTTGGTAGAAGAAAACTCATTATTGATGCTCCTTTTTGTGGATATTGAAACACCTTTTGAAGCTTCCTTTAAGGCTTATGAGCCTACAAAGCCTCTGAGT
GTTAATAAGGATACTCTTCCTATTCCTCCATCCTTAGTACCTGCCAAATTTGCTTCTCTTATTGAAGCCTATGGTATCGAGATGCGTGAAATTCCCCTTTGGGTTAAGTA
TGAAACGGTCTTCGGTTCAAGATTCTCTTTGGCTGGTTTTGTCTTTAGAGGTTCCCGGGTGTTTCGGCAGGCTTTCTTTAGTGTTTTCTTTCAGGGGTTCTCTCTTGCAA
GCTTGTCTCTATCGAGAGTTTGTTCTTTGAGGCTTTTGGAGTGCAGGATCTCCCATTTTGGTATTGGTGATAATGGTTCTACTTCTGGACCAGATGTGTCTCAATCAAAG
CTATCCAGACATTCCCAAGTAGCTGATGTGAGTGCCCTTATCTCAGATAATGAAGTCTGGACTTTAATGTCACCATATGGTCTTGTGAGAAATGCACAGAAGACCAACAA
CCCGGAGTATGAGGATTTTCAATTGGTTTATCACCTCCCACTGGCTTCCTTGATATGCCCCTACAAAATGGCAAATACATATGGTCCAGTTTTAGAGAAAGACCAGCTAT
GTCTCTCTTTGACAGATTTTTTACCACCAATGTATGCCTCTGCAAATCAGAGACTTGCTCGAACAAAAATCAAAGAAAACCTCCTGGAAATGGATGCTATGGAAAAAGCA
TCTCTTGGAGACAAAAATGTAAGTCTGGCCAAGGATGGAAAAGGCATGCACAATGTCAGATGGGATATTACCGAGAAGCCCCTTCACCTTGGAGGTTTGGGTATTGGCGC
AACCAAAACAACGAAGCAGCAGATCCCCTTTTCAAAAGCATCTGGGAAGGAAAATACCTTAGGAAAGCAAAGATTTTCCCTTGGGAGCTCAGCCTCAAGGGAATCAACAC
CAATGACAGATCGTAAAAGAAGCTACCGCATCAGATCTCATCTCCATCAGTCTGTGTTTTCTGCTGGAAGGAATCAGAGACGCAAAGCCATCTTTTCGTCACTTGCTCTT
TTTGTGTGCATTTCTGGCCCTTGGTTTTGGAATCCTTTGGCTGGTTTACTGTGCTGCCTACCAACATTCACAATTTCCTCTCCTATATCCTTATGGGACATCCCTTTACT
GGCCAAAAGATCATTTGGAGCTACCTCATCAAAGCTTTCTTTTGGACCCTATGGAAATTAA
mRNA sequenceShow/hide mRNA sequence
GCTTCGCCTCGGCCTCCGTTCTGTTCCCCTTTTCCACCGAATTCGCTGTTTCCTATCGGAGCAAACGCACGGATTCCTGGAATTCCTCAACCAGATTCTCCACTTCGCTC
TGGCTTTCAAGTAATTCCTTCTCCACATCTTCCTCTTTGTGGATTTTTCAAATGGTATATGATGCTTTGCAATTCATTGAGAGATCGACTTCGGCCATGGCTTCGTGATT
ATGATAGGCTTCAGTCTTTCGCAGTCATTCTCATTTATGTCCAGATCGGGTGCGCATTGATTGGATCCTTAGGGGCATTGTACAATGGTGTATTGCTTATAAATTTGGCA
ATCGCATTATTCGCTTTGGTAGCCATAGAGAGCAGCAGTCAGAGTCTTGGCCGAACGTATGCTGTTCTCCTGTTCTCTGCAATTTTCCTCGACATCTCCTGGAACATCTC
ATCTGAGCAATATGGACCCCTTTTTACATTTTCAGTGAAGCTTACTCTGGCTATGCAGATTATTGGATTTTCTGTTAGACTATGGTCTTCACTACTGTGGATTCAAATAT
ACAGATTGGGGATTTCATACATGGAAACTCCAGTTCCCCGAGAGGCAGATTATGATTTGAGAAATAGTTTTCTTAGCCCAGCTACTCCTATTGTAGTTAGACAACCCTCA
GGTTCTGATGATATAATAGGGGGCTCTATCTACGATCCAACCTATTACTCGTCCCTCTTTGAAGATGGTCAAGATAGTAAATGTTTGTCTGGGGTAACATTGGATTGTTG
CTTAGCCAAAGTAGAAGTTAAAAGAAACCTTTGTGGTTTTGTTCCACACTTAATTGAAATTAAAGATGGAATGTTAGGAAGTATTTTCGTTCGAACTATGGTCTGTGATA
TATCTCCATCCCAATCTTCACCTTTTGAATGCTTTATTTTGGATTCTAGCAAGCTTGATAACTCTCTGGATTTTTTTCACTTCCAAGTGGTCTTGGAAGATGAACCGTCT
TTTGTACCACCATTTATTGAAAATACGAATTTGGAAGAGGCCTTGGTTATTGATGTATTAGATAAAGTTGATCAACAAATTAATGAGTCTTTGGTTGACGAAACCTCATG
TATTGGTGTCTGCTTTCATGATTCAAATTATCGGTTGAGCATTAATGATGGAGTTGGTAAGACTGTATCTTTGAATTCTTCAGTTTTTGATAAGGGTGTTGATAAGACCA
TTAATGAGACCTTGGTAGAAGAAAACTCATTATTGATGCTCCTTTTTGTGGATATTGAAACACCTTTTGAAGCTTCCTTTAAGGCTTATGAGCCTACAAAGCCTCTGAGT
GTTAATAAGGATACTCTTCCTATTCCTCCATCCTTAGTACCTGCCAAATTTGCTTCTCTTATTGAAGCCTATGGTATCGAGATGCGTGAAATTCCCCTTTGGGTTAAGTA
TGAAACGGTCTTCGGTTCAAGATTCTCTTTGGCTGGTTTTGTCTTTAGAGGTTCCCGGGTGTTTCGGCAGGCTTTCTTTAGTGTTTTCTTTCAGGGGTTCTCTCTTGCAA
GCTTGTCTCTATCGAGAGTTTGTTCTTTGAGGCTTTTGGAGTGCAGGATCTCCCATTTTGGTATTGGTGATAATGGTTCTACTTCTGGACCAGATGTGTCTCAATCAAAG
CTATCCAGACATTCCCAAGTAGCTGATGTGAGTGCCCTTATCTCAGATAATGAAGTCTGGACTTTAATGTCACCATATGGTCTTGTGAGAAATGCACAGAAGACCAACAA
CCCGGAGTATGAGGATTTTCAATTGGTTTATCACCTCCCACTGGCTTCCTTGATATGCCCCTACAAAATGGCAAATACATATGGTCCAGTTTTAGAGAAAGACCAGCTAT
GTCTCTCTTTGACAGATTTTTTACCACCAATGTATGCCTCTGCAAATCAGAGACTTGCTCGAACAAAAATCAAAGAAAACCTCCTGGAAATGGATGCTATGGAAAAAGCA
TCTCTTGGAGACAAAAATGTAAGTCTGGCCAAGGATGGAAAAGGCATGCACAATGTCAGATGGGATATTACCGAGAAGCCCCTTCACCTTGGAGGTTTGGGTATTGGCGC
AACCAAAACAACGAAGCAGCAGATCCCCTTTTCAAAAGCATCTGGGAAGGAAAATACCTTAGGAAAGCAAAGATTTTCCCTTGGGAGCTCAGCCTCAAGGGAATCAACAC
CAATGACAGATCGTAAAAGAAGCTACCGCATCAGATCTCATCTCCATCAGTCTGTGTTTTCTGCTGGAAGGAATCAGAGACGCAAAGCCATCTTTTCGTCACTTGCTCTT
TTTGTGTGCATTTCTGGCCCTTGGTTTTGGAATCCTTTGGCTGGTTTACTGTGCTGCCTACCAACATTCACAATTTCCTCTCCTATATCCTTATGGGACATCCCTTTACT
GGCCAAAAGATCATTTGGAGCTACCTCATCAAAGCTTTCTTTTGGACCCTATGGAAATTAA
Protein sequenceShow/hide protein sequence
ASPRPPFCSPFPPNSLFPIGANARIPGIPQPDSPLRSGFQVIPSPHLPLCGFFKWYMMLCNSLRDRLRPWLRDYDRLQSFAVILIYVQIGCALIGSLGALYNGVLLINLA
IALFALVAIESSSQSLGRTYAVLLFSAIFLDISWNISSEQYGPLFTFSVKLTLAMQIIGFSVRLWSSLLWIQIYRLGISYMETPVPREADYDLRNSFLSPATPIVVRQPS
GSDDIIGGSIYDPTYYSSLFEDGQDSKCLSGVTLDCCLAKVEVKRNLCGFVPHLIEIKDGMLGSIFVRTMVCDISPSQSSPFECFILDSSKLDNSLDFFHFQVVLEDEPS
FVPPFIENTNLEEALVIDVLDKVDQQINESLVDETSCIGVCFHDSNYRLSINDGVGKTVSLNSSVFDKGVDKTINETLVEENSLLMLLFVDIETPFEASFKAYEPTKPLS
VNKDTLPIPPSLVPAKFASLIEAYGIEMREIPLWVKYETVFGSRFSLAGFVFRGSRVFRQAFFSVFFQGFSLASLSLSRVCSLRLLECRISHFGIGDNGSTSGPDVSQSK
LSRHSQVADVSALISDNEVWTLMSPYGLVRNAQKTNNPEYEDFQLVYHLPLASLICPYKMANTYGPVLEKDQLCLSLTDFLPPMYASANQRLARTKIKENLLEMDAMEKA
SLGDKNVSLAKDGKGMHNVRWDITEKPLHLGGLGIGATKTTKQQIPFSKASGKENTLGKQRFSLGSSASRESTPMTDRKRSYRIRSHLHQSVFSAGRNQRRKAIFSSLAL
FVCISGPWFWNPLAGLLCCLPTFTISSPISLWDIPLLAKRSFGATSSKLSFGPYGN