; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001428 (gene) of Snake gourd v1 genome

Gene IDTan0001428
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0114 domain-containing protein
Genome locationLG02:87698192..87699749
RNA-Seq ExpressionTan0001428
SyntenyTan0001428
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011649594.1 uncharacterized protein LOC101218655 isoform X1 [Cucumis sativus]7.3e-11078.31Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVA-AAPETVEIKTRELDLGSLLANLFVQLKTAVVK
        MA+TR  + V+P+AA VS+SSSSSS SS   VR L KTGLN NNGE  ITSG  ERRQ+V +KAA A AAP+TVE KT ELDLGSL+ANL +QLK  + K
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVA-AAPETVEIKTRELDLGSLLANLFVQLKTAVVK

Query:  TKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMK
        TKI++ +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGSFIV ESYLQYF+GLSQR+DQTHTVELLIEALDMFLVGTAL+VFGIGLFAMFVGSEKMK
Subjt:  TKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMK

Query:  EKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK
        +KN++  S SNLFGLFYMKKIPTWV MES+S AKSKIGHAVMMILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLS+LNV GGG  G+K
Subjt:  EKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]3.0e-11983.45Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV
        MA+TR  R V+PSA VV SSSSSSS S+AATVRCL KTGLNS NGE  +TSGD ERRQ+V LK  AA AAAPETVE +TRELDLGSLLANL VQLK   V
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV

Query:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM
        KTKIRR QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC++EGSFIVAESYLQYFNG+S+RSD++H VELLIE+LDMFLVGTALVVFG+GLFAMFVGSEKM
Subjt:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM

Query:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK
         EKN R +SGSNLFGLFYMK IPTWV MESVSEAKSKIGHAVMMILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLSRLN+ GG  GGYK
Subjt:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK

XP_023002007.1 uncharacterized protein LOC111496020 [Cucurbita maxima]1.2e-11581.82Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK---AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAV
        MA+TR  R V+PSA VV  SSSSSS S+AA VRCL KTGLNS NGE  ITSGD ERRQ+V LK   AA AAAPETVE KTRELDLGSLLANL VQLK   
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK---AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAV

Query:  VKTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEK
        VK KIRR QIQKFIEKIII+CRFFTLFAVAGSLLGSILC++EGSFIVAESYLQYFN +S+RSD++H VELLIE+LDMFLVGTALVVFG+GLFAMFVGSEK
Subjt:  VKTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEK

Query:  MKEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK
        M EKNRR +SGSNLFGLFYMK IPTWV MESVSEAKSKIGHAVMMILQVGVLEK KSIPL+SAADLACFAAA+LI SASIFFLSRLN+ GG   GYK
Subjt:  MKEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]4.6e-12084.12Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV
        MA+TR FR V+PSA VV SSSSSSS S+AATVRCL KTGLNS NGE  ITSGD ER+  V LK  AA AAAPETVE KTRELDLGSLLANL VQLK  VV
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV

Query:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM
        KTKIRR QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC++EGSFIVAESYLQYFNG+S+RSD++H VELLIE+LDMFLVGTALVVFG+GLFAMFVGSEKM
Subjt:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM

Query:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK
         EKN+R +SGSNLFGLFYMK IPTWV MESVSEAKSKIGHAVMMILQVGVLEKFKSIPL+SAADLACFA A+LISSASIFFLSRLN+ GG  GGYK
Subjt:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]2.1e-11279.59Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT
        M +TR  + ++P++A VSSSSSSSS SSA TVRCL KTGLN NNGE  ITSGD ER+Q+VA+KA  AAAP+TVE +T EL+LGSLLANL VQLKT V KT
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT

Query:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE
        KI+R QIQKFIEKIIIDCRFFTL AVAGSLLGSILCY+EGSFIVAESYLQYF+GLSQ S+Q HTVELLIEALDMFLVGTALVVFG+GLFAMF+GS KMKE
Subjt:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE

Query:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK
        KNR +ISGSN FGLF MKKIPTWV MES+S+AKSKIGHAVMMILQVGVLEKFK+IPL+SA DLACFAAAV++SSASIFFLS+LN+ GG  GG+K
Subjt:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein3.5e-11078.31Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVA-AAPETVEIKTRELDLGSLLANLFVQLKTAVVK
        MA+TR  + V+P+AA VS+SSSSSS SS   VR L KTGLN NNGE  ITSG  ERRQ+V +KAA A AAP+TVE KT ELDLGSL+ANL +QLK  + K
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVA-AAPETVEIKTRELDLGSLLANLFVQLKTAVVK

Query:  TKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMK
        TKI++ +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGSFIV ESYLQYF+GLSQR+DQTHTVELLIEALDMFLVGTAL+VFGIGLFAMFVGSEKMK
Subjt:  TKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMK

Query:  EKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK
        +KN++  S SNLFGLFYMKKIPTWV MES+S AKSKIGHAVMMILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLS+LNV GGG  G+K
Subjt:  EKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK

A0A5A7VG09 UPF0114 domain-containing protein1.3e-10978.91Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT
        MA+TR  + V+P+AA VSSSSSSSS SS   VR L KTGLN NNGE  ITSG  E RQ+VA+KAA   AP+TVE KT ELDLGSL+++L VQLKT + KT
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT

Query:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE
        KI++ +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGSFIVAESYLQYF+ LSQR++QTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE
Subjt:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE

Query:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNV--SGGGGYK
        KNR+ IS SNLFGLFYMKKIPTWV MES+S AKSKIGHAVMMILQVGVLEKFK+IPL+SA DLACFAAAVLISSASIFFLS+LNV   G GG+K
Subjt:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNV--SGGGGYK

A0A6J1CTB3 uncharacterized protein LOC111014021 isoform X12.3e-10978.97Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT
        MA+TR FRP++PSA+VVSSS+S S A+   TVRC+ +T    NNGE  +TSGD ERR+MV +KAAV AAPETV+ KTRELDLGSLLANL V+LKTAV KT
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKT

Query:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE
        K     IQ FIEK IIDCRFFTLFAVAGSLLGSILCY+EGSFIVAESYLQYF+GLSQ+SDQ HTVELLI+A+DMFLVGTAL VFG+GLFAMFVG EKMKE
Subjt:  KIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKE

Query:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGGG
        +NR   SGSNLFGLFYMKK+PTWV MESVS  KSKIGHAV+MILQVGVLEKFKSIPL SAADLACFAAAVLISSASIFFLS+LN  GGGG
Subjt:  KNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGGG

A0A6J1GGV4 uncharacterized protein LOC1114540791.4e-11983.45Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV
        MA+TR  R V+PSA VV SSSSSSS S+AATVRCL KTGLNS NGE  +TSGD ERRQ+V LK  AA AAAPETVE +TRELDLGSLLANL VQLK   V
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK--AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVV

Query:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM
        KTKIRR QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC++EGSFIVAESYLQYFNG+S+RSD++H VELLIE+LDMFLVGTALVVFG+GLFAMFVGSEKM
Subjt:  KTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKM

Query:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK
         EKN R +SGSNLFGLFYMK IPTWV MESVSEAKSKIGHAVMMILQVGVLEKFKSIPL+SA DLACFAAA+LISSASIFFLSRLN+ GG  GGYK
Subjt:  KEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGG--GGYK

A0A6J1KI88 uncharacterized protein LOC1114960205.6e-11681.82Show/hide
Query:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK---AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAV
        MA+TR  R V+PSA VV  SSSSSS S+AA VRCL KTGLNS NGE  ITSGD ERRQ+V LK   AA AAAPETVE KTRELDLGSLLANL VQLK   
Subjt:  MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALK---AAVAAAPETVEIKTRELDLGSLLANLFVQLKTAV

Query:  VKTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEK
        VK KIRR QIQKFIEKIII+CRFFTLFAVAGSLLGSILC++EGSFIVAESYLQYFN +S+RSD++H VELLIE+LDMFLVGTALVVFG+GLFAMFVGSEK
Subjt:  VKTKIRRLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEK

Query:  MKEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK
        M EKNRR +SGSNLFGLFYMK IPTWV MESVSEAKSKIGHAVMMILQVGVLEK KSIPL+SAADLACFAAA+LI SASIFFLSRLN+ GG   GYK
Subjt:  MKEKNRRLISGSNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGG--GYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)9.7e-3639.18Show/hide
Query:  AVVKTKIRRLQ-IQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVG
        AV      R + +++ IEK+I  CRF T     GSLLGS+LC+++G   V +S+LQY        ++   + LL+EA+D++L+GT ++VFG+GL+ +F+ 
Subjt:  AVVKTKIRRLQ-IQKFIEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVG

Query:  S-EKMKEKNRRLISG-SNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVS
        + +  + +   ++S  S+LFG+F +K+ P W+ ++SVSE K+K+GH ++M+L +G+ +K K + +TS  DL C + ++  SSA +F LSRLN S
Subjt:  S-EKMKEKNRRLISG-SNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVS

AT5G13720.1 Uncharacterised protein family (UPF0114)6.1e-3035.8Show/hide
Query:  IEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFV--GSEKMKEKNRRLISG
        +E+II D RF  L AV GSL GS+LC++ G   + E+Y  Y+   S+       V  L+EA+D++L GT +++F +GL+ +F+      +  ++ R +  
Subjt:  IEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFV--GSEKMKEKNRRLISG

Query:  SNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLN
        S+LFG+F MK+ P W+ + S+ E K+K+GH ++MIL V + E+ K + + +  DL  ++  + +SSAS++ L  L+
Subjt:  SNLFGLFYMKKIPTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCACTAGATCGTTCCGGCCGGTAAAGCCTTCAGCCGCCGTCGTGTCTTCCTCTTCTTCTTCTTCATCCGCGTCGTCGGCGGCGACTGTGAGGTGTTTGAGCAA
AACGGGGTTAAATTCAAACAACGGCGAATGGTCAATAACTTCCGGCGACGTCGAGAGAAGGCAGATGGTCGCCCTGAAGGCGGCGGTGGCGGCGGCTCCCGAGACTGTGG
AAATTAAAACCAGAGAACTCGATTTGGGTTCTTTGCTGGCGAATCTGTTCGTTCAATTGAAGACCGCTGTGGTGAAAACGAAGATTCGGAGGCTACAGATTCAGAAGTTC
ATCGAAAAGATCATAATCGACTGCCGATTCTTCACATTGTTCGCCGTCGCCGGATCTTTATTGGGTTCGATCCTCTGTTACGTGGAGGGGAGCTTTATCGTTGCAGAGTC
ATATCTGCAGTATTTCAATGGTCTTTCGCAGAGGTCGGATCAAACTCATACGGTGGAGCTTTTAATTGAAGCGTTAGATATGTTCCTCGTCGGAACTGCTCTGGTTGTTT
TTGGGATCGGATTGTTCGCAATGTTCGTCGGATCGGAGAAGATGAAGGAAAAAAACCGGCGTTTGATTTCTGGGTCGAATTTGTTTGGTCTGTTCTACATGAAGAAAATT
CCGACGTGGGTGGCAATGGAATCGGTGTCGGAGGCGAAGTCGAAGATCGGACATGCGGTGATGATGATACTGCAAGTGGGTGTGTTAGAGAAGTTCAAGAGTATTCCTTT
GACCTCTGCCGCCGATCTCGCGTGTTTTGCCGCCGCCGTTCTGATTTCCTCCGCTTCCATCTTCTTCCTCTCCAGACTTAACGTGAGCGGCGGAGGCGGTTACAAGTGA
mRNA sequenceShow/hide mRNA sequence
GTAAATCTTCACGTCCGGACAAATTCCGGCCAAAATATGCTGCGAATCTTCCTTCTCCTTTAGATGGTATAAAAAAACACACCAAATTTTGGCCATAATTAAAAAACAAA
AACAATGGCATCCACTAGATCGTTCCGGCCGGTAAAGCCTTCAGCCGCCGTCGTGTCTTCCTCTTCTTCTTCTTCATCCGCGTCGTCGGCGGCGACTGTGAGGTGTTTGA
GCAAAACGGGGTTAAATTCAAACAACGGCGAATGGTCAATAACTTCCGGCGACGTCGAGAGAAGGCAGATGGTCGCCCTGAAGGCGGCGGTGGCGGCGGCTCCCGAGACT
GTGGAAATTAAAACCAGAGAACTCGATTTGGGTTCTTTGCTGGCGAATCTGTTCGTTCAATTGAAGACCGCTGTGGTGAAAACGAAGATTCGGAGGCTACAGATTCAGAA
GTTCATCGAAAAGATCATAATCGACTGCCGATTCTTCACATTGTTCGCCGTCGCCGGATCTTTATTGGGTTCGATCCTCTGTTACGTGGAGGGGAGCTTTATCGTTGCAG
AGTCATATCTGCAGTATTTCAATGGTCTTTCGCAGAGGTCGGATCAAACTCATACGGTGGAGCTTTTAATTGAAGCGTTAGATATGTTCCTCGTCGGAACTGCTCTGGTT
GTTTTTGGGATCGGATTGTTCGCAATGTTCGTCGGATCGGAGAAGATGAAGGAAAAAAACCGGCGTTTGATTTCTGGGTCGAATTTGTTTGGTCTGTTCTACATGAAGAA
AATTCCGACGTGGGTGGCAATGGAATCGGTGTCGGAGGCGAAGTCGAAGATCGGACATGCGGTGATGATGATACTGCAAGTGGGTGTGTTAGAGAAGTTCAAGAGTATTC
CTTTGACCTCTGCCGCCGATCTCGCGTGTTTTGCCGCCGCCGTTCTGATTTCCTCCGCTTCCATCTTCTTCCTCTCCAGACTTAACGTGAGCGGCGGAGGCGGTTACAAG
TGAACTGCCCCCAGTGGCGGCGCGTTGGTCTAGGTTGGCCTCCACAAATATATGTAATTTTTTTTAGAAGTTTTGGCTGAGGGAAAGGAGAGGTTACCCATTCTTTTATT
ATTATTAATTATTAGTTTCCTAAAAGACCTATTTGGTAAGCGATCTAAACAAAAAAATGTATTTGGGAGTAGATTTAGAAATTTGATTCTATT
Protein sequenceShow/hide protein sequence
MASTRSFRPVKPSAAVVSSSSSSSSASSAATVRCLSKTGLNSNNGEWSITSGDVERRQMVALKAAVAAAPETVEIKTRELDLGSLLANLFVQLKTAVVKTKIRRLQIQKF
IEKIIIDCRFFTLFAVAGSLLGSILCYVEGSFIVAESYLQYFNGLSQRSDQTHTVELLIEALDMFLVGTALVVFGIGLFAMFVGSEKMKEKNRRLISGSNLFGLFYMKKI
PTWVAMESVSEAKSKIGHAVMMILQVGVLEKFKSIPLTSAADLACFAAAVLISSASIFFLSRLNVSGGGGYK