; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002534 (gene) of Snake gourd v1 genome

Gene IDTan0002534
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
Genome locationLG06:62136242..62137131
RNA-Seq ExpressionTan0002534
SyntenyTan0002534
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575200.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-7875.8Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLAPT   RPSSDD +  LHLKRI+RRR IK   F++ LL+IL+VIV+ ILMFT+F+VKDP IQMN ISIT++ELING+IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPN+ASFKYSNTTTTLYINET IGEARGPPGQAKARRT +MN+TINIV D+LL NL  D++SGKL LRSFSR+PGRVKLL+I+RR IVVKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-7976.26Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLAPTT  RPSSDD +  LHLKRI+RRR IK   F++ LL+IL+VIV+ ILMFT+F+VKDP IQMN ISIT++ELING+IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPN+ASFKYSNTTTTLYINET IGEARGPPGQAKARRT +MN+TINIV D+LL NL  D++SGKL LRSFSR+PGRVKLL+I+RR IVVKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

XP_022959336.1 uncharacterized protein LOC111460339 [Cucurbita moschata]4.9e-7974.89Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLAP TD RPSSDD +  LHLKRI+RRR IK   F++ LL+IL+V +++IL+FT+F+VKDP IQMN ISIT++ELING+IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPN+ASFKYSNTTTTLYINET IGEARGPPGQAKARRT RMN+TINIV D+LL NL  D++SGKL LRSFSR+PGRVK+L+I+RR IVVKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

XP_023548342.1 uncharacterized protein LOC111807010 [Cucurbita pepo subsp. pepo]5.4e-7876.26Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLAP TD RPS+DD +  LHLK  R+RR IK   F++ LLVIL+V+V+ IL+FT+F+VKDP IQMN ISIT++ELINGIIPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPNMASFKYSNTTTTLYINET IGEARGPPGQAKARRT RMN+TINIV DQLL NL  D++SGKL LRSFSR+PGRVKLL+IIRR I+VKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

XP_038875202.1 uncharacterized protein LOC120067718 [Benincasa hispida]2.9e-7973.52Show/hide
Query:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M++K+QA+PLAP T  R SSD+ ET LHLKRI+RRR IKCCGF+V  L+I  + I++ILMFT+F++KDP I+MN +SIT++ELING IPKPGSNMSLTAD
Subjt:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPNMASFKYSNTTTTL+INET IGEARGPPG+AKARRT RMN+TI+IV D++L+NL  DV+ GK+ LRSFSRIPGRVKLL++I R +VVKMNCTF+
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI NRSIEDQ+CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein1.6e-7570.32Show/hide
Query:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVI-LAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M++K+QA+PL P T +R SSD+ ET LHLKRI+R+R IKCC F+VALL+I   VI++ILMFT+F++KDP IQMN +SIT++ELIN +IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVI-LAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPNMASFKYSNTTTTL+INET IGE RGP G+AKAR+T RMN+TI+IV D++L+NL  DV+ GK+ LRSFSRIPG+VKLL+ I R +VVKMNCTF+
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI ++SIEDQKCKRK+K+
Subjt:  INIINRSIEDQKCKRKVKL

A0A1S3C8G8 uncharacterized protein LOC1034976854.2e-7670.32Show/hide
Query:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVI-LAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M+ K+QA+PL P T DR SSD+ ET LHLKRI+R+R IKCC F+ ALL+I   VI++ILMFT+F++KDP I+MN +SIT++ELIN +IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTT-DRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVI-LAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPNMASFKYSNTTTTL+INET IGE RGPPG+AKAR+T RMN+TI+IV D++L+NL  DV+ GK+ LRSFSRIPG+VKLL++I R +VVKMNCTF+
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI ++SIEDQKCKRK+K+
Subjt:  INIINRSIEDQKCKRKVKL

A0A2I4GMG7 uncharacterized protein LOC1090091713.0e-7465.75Show/hide
Query:  MMEKEQARPLAPTTDRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVS
        M+E++QARPLAP+TDRPSSD++E  LH++++RR+R +K CG + ALL+I AV+++IL+FTVF VKDP I+MNGI++T++ELING  PKPG+NMSLTADVS
Subjt:  MMEKEQARPLAPTTDRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVS

Query:  VKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLAN----LDVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VKNPN+ASFKY NTTTTL+ N T +GEARGPPGQAK RRT RMNIT++I+TDQLL+N     DV S  LS+ S+SRIPGRVK++ II++ +VVKMNCTF 
Subjt:  VKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLAN----LDVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        +NI +++I+ QKCKRKV L
Subjt:  INIINRSIEDQKCKRKVKL

A0A6J1H4K3 uncharacterized protein LOC1114603392.4e-7974.89Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLAP TD RPSSDD +  LHLKRI+RRR IK   F++ LL+IL+V +++IL+FT+F+VKDP IQMN ISIT++ELING+IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAV-IVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPN+ASFKYSNTTTTLYINET IGEARGPPGQAKARRT RMN+TINIV D+LL NL  D++SGKL LRSFSR+PGRVK+L+I+RR IVVKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

A0A6J1L0R6 uncharacterized protein LOC1114993184.5e-7875.8Show/hide
Query:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD
        M +K+QARPLA  TD RPSSDD +  LHLK+I+R R IK   F++ LLVIL+V+V+ ILMFT+F+VKDP IQMN ISIT++ELING+IPKPGSN+SLTAD
Subjt:  MMEKEQARPLAPTTD-RPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVV-ILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI
        VSVKNPN+ASFKYSNTTTTLYINET IGEARGPPGQAKARRT RMN+TINIV D+LL NL  D++SGKL LRSFSR+PGRVKLL+IIRR IVVKMNCT  
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL--DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFI

Query:  INIINRSIEDQKCKRKVKL
        INI N+SIEDQ CKRKVK+
Subjt:  INIINRSIEDQKCKRKVKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family3.5e-0627.27Show/hide
Query:  KRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVSVKNPNMASFKYSNTTTTLYINETAIGEA
        +R   R  +  C      L+IL V+++++ FTVF+ KDP+I +N + +    + N       +N S +  V+V+NPN A F + +++  L  +   +G  
Subjt:  KRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVSVKNPNMASFKYSNTTTTLYINETAIGEA

Query:  RGPPGQAKARRTSRMNITINI
          P G+  + R   M  T  +
Subjt:  RGPPGQAKARRTSRMNITINI

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.2e-4443.69Show/hide
Query:  MMEKEQARPLAPTTDRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGI--IPKPGSNMSLTAD
        M + E  RPLAP T  P SD+  + +     R R  IKC   + A  +IL  IV+ L+FTVF VKDP I+MNG+ +  ++ + G   +   G+N+S+  D
Subjt:  MMEKEQARPLAPTTDRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGI--IPKPGSNMSLTAD

Query:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANLDV-----NSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNC
        VSVKNPN ASFKYSNTTT +Y   T +GEA G PG+A+  RTSRMN+T++I+ D++L++  +      SG +++ S++R+ G+VK++ I+++ + VKMNC
Subjt:  VSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANLDV-----NSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNC

Query:  TFIINIINRSIEDQKCKRKVKL
        T  +NI  ++I+D  CK+K+ L
Subjt:  TFIINIINRSIEDQKCKRKVKL

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.0e-1125.65Show/hide
Query:  RRRLIKCC---GFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVSVKNPNMASFKYSNTTTTLYINETAIGEAR
        +RR+  CC   G +  L VI   +  +++  VF+ K P +Q    ++  +     +  +   N +LT ++ +KNPN+A F+Y      +Y  +T +G   
Subjt:  RRRLIKCC---GFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVSVKNPNMASFKYSNTTTTLYINETAIGEAR

Query:  GPPGQAKARRTSRMNITINIVTDQLLANL-----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFIINIINRSIEDQKCKRKVKL
         P     A+ +  +   + +  D+ +ANL     DV  GK+ + + +++PG++ LL I +  +    +C  ++   +  +EDQ C  K KL
Subjt:  GPPGQAKARRTSRMNITINIVTDQLLANL-----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFIINIINRSIEDQKCKRKVKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-2331.11Show/hide
Query:  EKEQARP----LAPTTDRPSSDDEET--TLHLKRIRRRRLIK-CCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVEL-INGIIPKPGSNMS
        +KE+ +P    L P     SS + ++  T   K++RR+R  K C  F + L++++A+++VIL FT+F+ K P   ++ +++ R++  +N ++ K   N++
Subjt:  EKEQARP----LAPTTDRPSSDDEET--TLHLKRIRRRRLIK-CCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVEL-INGIIPKPGSNMS

Query:  LTADVSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVK
        L  D+S+KNPN   F Y +++  L      IGEA  P  +  AR+T  +NIT+ ++ D+LL+      DV +G + L +F ++ G+V +L I + ++   
Subjt:  LTADVSVKNPNMASFKYSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVK

Query:  MNCTFIINIINRSIEDQKCKRKVKL
         +C   I++ +R++  Q CK   KL
Subjt:  MNCTFIINIINRSIEDQKCKRKVKL

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.5e-2032.68Show/hide
Query:  MEKEQARPLAP--TTDRPSSDDEETTLHLKRIR----RRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISIT-RVELINGIIPKPGSNMS
        + ++QA+PLAP   T R    DEE   H  R +    + +LI CCGF+ +L +++AV  ++L  TVF +  P + ++ IS   R + +NG +     N +
Subjt:  MEKEQARPLAP--TTDRPSSDDEETTLHLKRIR----RRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISIT-RVELINGIIPKPGSNMS

Query:  LTADVSVKNPNMASFKYSNTTTTLYINE-TAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL-----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIV
        ++ ++S+ NPN A F   N   + Y  E   +GE+        A+RT +MN+T  IV  +LLA+L     D+N   + L+S   + GRVK + I R+ + 
Subjt:  LTADVSVKNPNMASFKYSNTTTTLYINE-TAIGEARGPPGQAKARRTSRMNITINIVTDQLLANL-----DVNSGKLSLRSFSRIPGRVKLLNIIRRRIV

Query:  VKMNC
        ++ +C
Subjt:  VKMNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAAAGGAGCAAGCGCGACCACTCGCCCCAACTACCGACCGTCCGAGCAGCGACGACGAGGAGACAACATTACACTTGAAGAGAATTCGACGAAGAAGACTCAT
AAAATGTTGTGGATTTATGGTTGCCCTTCTTGTAATACTAGCAGTAATAGTTGTCATCTTGATGTTCACTGTGTTTGAAGTTAAGGATCCTAGAATCCAAATGAACGGAA
TATCAATCACAAGAGTTGAGTTGATCAATGGTATCATTCCGAAGCCAGGGTCGAACATGTCGCTCACCGCAGACGTGTCTGTGAAAAACCCGAACATGGCGTCGTTCAAG
TATAGTAACACAACGACAACTCTATACATTAACGAGACCGCGATAGGGGAGGCCAGAGGGCCACCCGGGCAAGCGAAGGCACGACGAACATCGCGAATGAACATCACCAT
CAACATAGTCACCGATCAGCTCCTAGCGAATCTCGACGTCAACTCGGGAAAGCTGAGTTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGCTGTTGAATATTATAA
GAAGACGTATTGTTGTGAAAATGAACTGTACGTTCATTATCAATATCATAAACAGATCGATCGAGGACCAGAAATGCAAGAGGAAGGTGAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
CAAATTTTTGGGCATTTCCCAAACAACTGCCTTACAAACCCAAGCACTTGAACAAAATCCCACAATCCAAATTTTCCAATTTTTCTTCTCCAACAATGATGGAAAAGGAG
CAAGCGCGACCACTCGCCCCAACTACCGACCGTCCGAGCAGCGACGACGAGGAGACAACATTACACTTGAAGAGAATTCGACGAAGAAGACTCATAAAATGTTGTGGATT
TATGGTTGCCCTTCTTGTAATACTAGCAGTAATAGTTGTCATCTTGATGTTCACTGTGTTTGAAGTTAAGGATCCTAGAATCCAAATGAACGGAATATCAATCACAAGAG
TTGAGTTGATCAATGGTATCATTCCGAAGCCAGGGTCGAACATGTCGCTCACCGCAGACGTGTCTGTGAAAAACCCGAACATGGCGTCGTTCAAGTATAGTAACACAACG
ACAACTCTATACATTAACGAGACCGCGATAGGGGAGGCCAGAGGGCCACCCGGGCAAGCGAAGGCACGACGAACATCGCGAATGAACATCACCATCAACATAGTCACCGA
TCAGCTCCTAGCGAATCTCGACGTCAACTCGGGAAAGCTGAGTTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGCTGTTGAATATTATAAGAAGACGTATTGTTG
TGAAAATGAACTGTACGTTCATTATCAATATCATAAACAGATCGATCGAGGACCAGAAATGCAAGAGGAAGGTGAAGCTCTAGACTTTCAGTGGCATGTATGGCAATGCT
TTTGATCAATGGTGATGGTTTGAGGGTTTATTTCTTTACAGTAGAATCGAGGGTACGTGATTATGTAAACCTTTATTATCAATCTCTGTCATACTTCAATAAAAACGTTT
TTATCATTAA
Protein sequenceShow/hide protein sequence
MMEKEQARPLAPTTDRPSSDDEETTLHLKRIRRRRLIKCCGFMVALLVILAVIVVILMFTVFEVKDPRIQMNGISITRVELINGIIPKPGSNMSLTADVSVKNPNMASFK
YSNTTTTLYINETAIGEARGPPGQAKARRTSRMNITINIVTDQLLANLDVNSGKLSLRSFSRIPGRVKLLNIIRRRIVVKMNCTFIINIINRSIEDQKCKRKVKL