; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006796 (gene) of Snake gourd v1 genome

Gene IDTan0006796
OrganismTrichosanthes anguina (Snake gourd v1)
Description60S ribosomal protein L44
Genome locationLG06:4898467..4901332
RNA-Seq ExpressionTan0006796
SyntenyTan0006796
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR040411 - Uncharacterized protein At5g23160-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152887.1 uncharacterized protein LOC111020508 [Momordica charantia]7.8e-8060.07Show/hide
Query:  KSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQDGEERPE
        KSKPQ  LCCFGFSGKLRR KP+KPAA H KRPFSW++FH KPP  V S +LA+  +S  DSDRLA  S+++ T YK  +++   V P A NQ GE RPE
Subjt:  KSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQDGEERPE

Query:  KVANDIVAEKVIFESE-HSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST----AISHSLSFPPPNAARTCRFNEPSASMRVGVTNDE
        KV N+ V E++I E+  HS++PKKP  ESQ+RFSLTR+L SFRSGRF +P SPT +K+  +T    AIS SLSFP  N  R  +   P  SMRVG+  DE
Subjt:  KVANDIVAEKVIFESE-HSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST----AISHSLSFPPPNAARTCRFNEPSASMRVGVTNDE

Query:  TSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
         SK YRSAA +SI I+TL+IM+LWG++CAIL TA WI IV SL SIVEEDD+  DFI SDSY +G K KLV+LKGFL R+QRE + KK
Subjt:  TSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

XP_022954035.1 uncharacterized protein LOC111456417 [Cucurbita moschata]2.3e-7960.61Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KS P PFLCCFGFSGK RR KP+KPAA H+K P SW++FH K PP   S  + + N S  DSDRL+ +S+S   A+ S+ D F + +PVATN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR
         G E   RPE V NDI+ EK+I ES E  NSPKKP+  S+SRFSLTRKL SFRSGRFAQPASPT KKNLKST++ S ++S  PP   R  +  EP ASMR
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR

Query:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
        VG+   N ET + Y+S AG+SIL+ITL IMV WG+LCAIL TA WI +VTSLRSI+ ED D + F ESDSY +GFK KLVVLKGF+ RNQ +   KK
Subjt:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

XP_022991489.1 uncharacterized protein LOC111488090 [Cucurbita maxima]1.1e-7357.91Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KS P PFLCCFGFSGK RR KP+KPAA H+K P SW++FH K PP   S    + N S  DSDRL+ +S+S   A+ S+ D F + +PVATN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR
         G E   RPE V NDI+ EK+I E+ E  NSPKK +  S+SRFS TRKL SFRSGRFAQP SPT KKNLKST + S ++S    N  R  +  E  ASM+
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR

Query:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
        V +   N E  + YRS AG+SIL+ITL IM++WG+LCAIL TA WI +VTSLRSIV ED D + F +SDSY +GFK KLVVLKGFL RNQ +   KK
Subjt:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

XP_023549176.1 uncharacterized protein LOC111807611 [Cucurbita pepo subsp. pepo]2.1e-8061.28Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KS P PFLCCFGFSGK RR KP+KPAA H+K P SW++FH K PP   S      N S  DSDRL+ +S++ A A+ S+ D F + +PVATN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR
         GEE   RPE V NDI+ EK+I ES E  NSPKKP+  S+SRFSLTRKL SFRSGRFAQPASPT KKNLKST + S ++S  PP   R  +  EP ASMR
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR

Query:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
        +G+   N ET + Y+S AG+SIL+ITL IMV+WG+LCAIL TA WI +VTSLRSIV ED D + F ESDSY +GFK KLVVLKGFL RNQ +   KK
Subjt:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

XP_038897219.1 uncharacterized protein LOC120085350 isoform X1 [Benincasa hispida]6.2e-8562.46Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KSKP  FL CFGFSGKLRR KP+K +AG +KRPFSW+ FH KPPP VHS++  + N+S  DSDRL+  S++     KSS ++F + + V TN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST-----AISHSLSFPPPNAARTCRFNEPS
         G+E     ++V NDI+AEK+I ES E SNS KKPA +SQSRFSLT+KL SFRS RF QPASP  KKNLKST      ISHSLSFPPPN AR  R NE  
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST-----AISHSLSFPPPNAARTCRFNEPS

Query:  ASMRVGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFK
         S R G+   N ETS+ YRSAA +S+L++TLA+MVLWG++CAIL TA WIFIVTSLR+IVEE  D IDF+ESDSY +GFK KLVVLKGFL RN RE L K
Subjt:  ASMRVGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFK

Query:  K
        +
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0K6H3 Uncharacterized protein2.8e-6754.76Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVL-AEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATN
        MAKS N KSKP  FLCCFG+S KL R KP+K  AGH+KR FSW + + KPP  +HS+   +  N+S  +SDRL+  S++      SS ++  L +PVATN
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVL-AEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATN

Query:  QDGEE---RPEKVANDIVAEKVIF-ESEHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASP---TVKKNLKSTAISHSLSFPPPNAARTCRFNEPSA
        + GEE    P +V ND VA K I   SEHSNSP KP  +SQSRFSLT++L SFRS RF Q A P   T   N+++  ISHSLSFPPP    + R +E   
Subjt:  QDGEE---RPEKVANDIVAEKVIF-ESEHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASP---TVKKNLKSTAISHSLSFPPPNAARTCRFNEPSA

Query:  SMRV--GVTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQR
        S RV     N ++S+ YRS A +S+L++TLA+MV+WG++CAIL TA WIFIVTSLRSIVEE  + IDF+ESDSY +GFK KLVVLKGF+ RN +
Subjt:  SMRV--GVTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQR

A0A1S3CFR9 uncharacterized protein At5g231601.0e-7257.63Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KSKP  FL CFGFS KLRR KP K  AGH+KRPFSW + + KPP  VHS+  +  N S  +SDRL+  S++     KSS ++  + +PVAT +
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST-----AISHSLSFPPPNAARTCRFNEPS
         G+E    P +V NDIVAEK I ES EHSN P K   +SQSRFSLT+KL SFR  RF Q ASP  KKN KST      ISHSLSFPPP  A   + +EPS
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST-----AISHSLSFPPPNAARTCRFNEPS

Query:  ASMRVG--VTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQR
         S + G    N ++S+ YRSA  +S+L++TLA+MVLWG++CAIL TA WIF+VTSLRSIVEE  D IDF+ESDSY +GFK KLVVLKGF+ RN +
Subjt:  ASMRVG--VTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQR

A0A6J1DHG8 uncharacterized protein LOC1110205083.8e-8060.07Show/hide
Query:  KSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQDGEERPE
        KSKPQ  LCCFGFSGKLRR KP+KPAA H KRPFSW++FH KPP  V S +LA+  +S  DSDRLA  S+++ T YK  +++   V P A NQ GE RPE
Subjt:  KSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQDGEERPE

Query:  KVANDIVAEKVIFESE-HSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST----AISHSLSFPPPNAARTCRFNEPSASMRVGVTNDE
        KV N+ V E++I E+  HS++PKKP  ESQ+RFSLTR+L SFRSGRF +P SPT +K+  +T    AIS SLSFP  N  R  +   P  SMRVG+  DE
Subjt:  KVANDIVAEKVIFESE-HSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKST----AISHSLSFPPPNAARTCRFNEPSASMRVGVTNDE

Query:  TSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
         SK YRSAA +SI I+TL+IM+LWG++CAIL TA WI IV SL SIVEEDD+  DFI SDSY +G K KLV+LKGFL R+QRE + KK
Subjt:  TSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

A0A6J1GRB9 uncharacterized protein LOC1114564171.1e-7960.61Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KS P PFLCCFGFSGK RR KP+KPAA H+K P SW++FH K PP   S  + + N S  DSDRL+ +S+S   A+ S+ D F + +PVATN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR
         G E   RPE V NDI+ EK+I ES E  NSPKKP+  S+SRFSLTRKL SFRSGRFAQPASPT KKNLKST++ S ++S  PP   R  +  EP ASMR
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR

Query:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
        VG+   N ET + Y+S AG+SIL+ITL IMV WG+LCAIL TA WI +VTSLRSI+ ED D + F ESDSY +GFK KLVVLKGF+ RNQ +   KK
Subjt:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

A0A6J1JV00 uncharacterized protein LOC1114880905.3e-7457.91Show/hide
Query:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ
        MAKS N KS P PFLCCFGFSGK RR KP+KPAA H+K P SW++FH K PP   S    + N S  DSDRL+ +S+S   A+ S+ D F + +PVATN+
Subjt:  MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQ

Query:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR
         G E   RPE V NDI+ EK+I E+ E  NSPKK +  S+SRFS TRKL SFRSGRFAQP SPT KKNLKST + S ++S    N  R  +  E  ASM+
Subjt:  DGEE---RPEKVANDIVAEKVIFES-EHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAI-SHSLSFPPPNAARTCRFNEPSASMR

Query:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK
        V +   N E  + YRS AG+SIL+ITL IM++WG+LCAIL TA WI +VTSLRSIV ED D + F +SDSY +GFK KLVVLKGFL RNQ +   KK
Subjt:  VGV--TNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G08240.1 unknown protein1.9e-0725.5Show/hide
Query:  FLCCFGFSGKLRRLKPI--KPAAGHKK--------RPFSWIKF-----HFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPV--
        FL CFGFS K+   KP+  K   G KK        R F   KF       KP P   +     + +   D  +  L  +   T  K+        +PV  
Subjt:  FLCCFGFSGKLRRLKPI--KPAAGHKK--------RPFSWIKF-----HFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPV--

Query:  -ATNQDGEERPEKVANDIVAEKVIFESEHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAISHSLSFPPPNAARTCRFNEPSASMR
         A NQ+ +E   K   DI  ++        + P +P             LGSF+          T  + + S +  +      P  +R            
Subjt:  -ATNQDGEERPEKVANDIVAEKVIFESEHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAISHSLSFPPPNAARTCRFNEPSASMR

Query:  VGVTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIV------EEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETL
            N    K +    G+SI+I+TL IM++WG+LCAIL T+ W +++  +R              ++  + S+SY      + VVL GFL R  R +L
Subjt:  VGVTNDETSKNYRSAAGLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLRSIV------EEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETL

AT5G23160.1 unknown protein4.4e-0437.78Show/hide
Query:  GLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLR------SIVEEDDDAIDFIESDSYFQG--------FKNKLVVLKGFLNRNQRETL
        G+SI+++TL IM+ WG+LCAIL T+ W +I   L+      ++V          E  S FQG        +K K VVL+GFL R  R ++
Subjt:  GLSILIITLAIMVLWGKLCAILGTAMWIFIVTSLR------SIVEEDDDAIDFIESDSYFQG--------FKNKLVVLKGFLNRNQRETL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAATCTCTGAATCCCAAATCCAAACCTCAACCCTTTCTCTGCTGCTTCGGATTTTCCGGCAAGCTTCGCCGCTTAAAGCCTATCAAACCCGCCGCCGGTCACAA
AAAACGTCCGTTTTCTTGGATAAAGTTTCACTTCAAGCCGCCGCCGGCGGTTCACTCTACTGTTCTGGCCGAGCAGAACCAGTCCAGGCTGGATTCCGACCGGCTCGCCT
TGAGATCTGTTTCTACGGCCACCGCTTATAAGTCGTCGACTGATGAATTTCCGCTCGTGTTGCCGGTGGCGACGAATCAAGACGGCGAGGAGAGGCCGGAAAAGGTGGCA
AATGACATCGTTGCAGAAAAAGTCATCTTTGAAAGTGAACACTCGAACTCGCCAAAGAAACCCGCCCATGAAAGTCAATCAAGGTTCTCACTAACCCGAAAACTCGGATC
GTTTCGATCGGGCCGGTTCGCTCAACCCGCCTCACCGACGGTGAAAAAGAATCTGAAGTCCACTGCCATATCACACTCTCTCTCATTCCCTCCACCGAACGCCGCTCGCA
CGTGCCGGTTCAATGAACCGTCGGCGAGCATGCGAGTCGGGGTTACCAACGACGAGACGAGCAAAAATTACCGGTCGGCGGCGGGCTTGTCGATTTTGATCATCACACTG
GCGATCATGGTACTATGGGGAAAATTGTGTGCCATTTTAGGTACGGCAATGTGGATTTTTATTGTTACGAGCTTAAGGTCAATTGTTGAAGAAGATGACGATGCAATTGA
TTTTATTGAATCGGATTCATATTTTCAAGGGTTTAAGAACAAATTAGTGGTTTTAAAAGGGTTTCTTAATAGAAATCAGAGAGAGACTCTGTTCAAGAAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAATCTCTGAATCCCAAATCCAAACCTCAACCCTTTCTCTGCTGCTTCGGATTTTCCGGCAAGCTTCGCCGCTTAAAGCCTATCAAACCCGCCGCCGGTCACAA
AAAACGTCCGTTTTCTTGGATAAAGTTTCACTTCAAGCCGCCGCCGGCGGTTCACTCTACTGTTCTGGCCGAGCAGAACCAGTCCAGGCTGGATTCCGACCGGCTCGCCT
TGAGATCTGTTTCTACGGCCACCGCTTATAAGTCGTCGACTGATGAATTTCCGCTCGTGTTGCCGGTGGCGACGAATCAAGACGGCGAGGAGAGGCCGGAAAAGGTGGCA
AATGACATCGTTGCAGAAAAAGTCATCTTTGAAAGTGAACACTCGAACTCGCCAAAGAAACCCGCCCATGAAAGTCAATCAAGGTTCTCACTAACCCGAAAACTCGGATC
GTTTCGATCGGGCCGGTTCGCTCAACCCGCCTCACCGACGGTGAAAAAGAATCTGAAGTCCACTGCCATATCACACTCTCTCTCATTCCCTCCACCGAACGCCGCTCGCA
CGTGCCGGTTCAATGAACCGTCGGCGAGCATGCGAGTCGGGGTTACCAACGACGAGACGAGCAAAAATTACCGGTCGGCGGCGGGCTTGTCGATTTTGATCATCACACTG
GCGATCATGGTACTATGGGGAAAATTGTGTGCCATTTTAGGTACGGCAATGTGGATTTTTATTGTTACGAGCTTAAGGTCAATTGTTGAAGAAGATGACGATGCAATTGA
TTTTATTGAATCGGATTCATATTTTCAAGGGTTTAAGAACAAATTAGTGGTTTTAAAAGGGTTTCTTAATAGAAATCAGAGAGAGACTCTGTTCAAGAAAGGATAG
Protein sequenceShow/hide protein sequence
MAKSLNPKSKPQPFLCCFGFSGKLRRLKPIKPAAGHKKRPFSWIKFHFKPPPAVHSTVLAEQNQSRLDSDRLALRSVSTATAYKSSTDEFPLVLPVATNQDGEERPEKVA
NDIVAEKVIFESEHSNSPKKPAHESQSRFSLTRKLGSFRSGRFAQPASPTVKKNLKSTAISHSLSFPPPNAARTCRFNEPSASMRVGVTNDETSKNYRSAAGLSILIITL
AIMVLWGKLCAILGTAMWIFIVTSLRSIVEEDDDAIDFIESDSYFQGFKNKLVVLKGFLNRNQRETLFKKG