; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g24530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g24530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:18295843..18297995
RNA-Seq ExpressionMoc07g24530
SyntenyMoc07g24530
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.6e-10784.8Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP
        KGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASF+TLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN AVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP

Query:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHLKRKRKKKK
        IESSRPNSELAMVCGFASNVK KSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKRPR QTEAV+V  LGEEVREE  LKR+RKKKK
Subjt:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHLKRKRKKKK

Query:  ITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQ
         TSPLEVGARG LPASFAD VDDPEARM GT DVT RFRVEPSSSGVRDQ
Subjt:  ITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]8.2e-11481.34Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS

Query:  NVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEV
         VK KSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESSGGPSREKRPR QTEAV+  +   +V
Subjt:  NVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEV

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.2e-10174.04Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVAR------SNCL-----------------LGVFPQAFVASIQSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKD
        G   + A+ R+EPSSSGVRDQV+R        CL                 +    +AFVASIQS LAVKAELD RE LAAREKEEFSA LE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVAR------SNCL-----------------LGVFPQAFVASIQSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKD

Query:  ELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKHVTAELETTKERLSNGALLEESFRQHPDFD
        ELLKAHSEV+ LKAEVE++AELLKKEEDRR+AQLRA HAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+H TAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKHVTAELETTKERLSNGALLEESFRQHPDFD

Query:  GFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGTPQTGS
        GF KD SDAGFKFLMKGIASDM DLQIDL GLK+RY E+WASGP GTPGPQALVD+YVRDL+SDYSD EEDQVG+TQEG   TGS
Subjt:  GFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGTPQTGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-16385.35Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPESILLRIPEEGERADNPPEGWITLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPE+ILLR+PEEGERADNPPEGW+TLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPESILLRIPEEGERADNPPEGWITLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFASNVKCK
        S EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSELAMVCGFAS VK K
Subjt:  SEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFASNVKCK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVN
        SKGRAHALEAAQSS+PATPAV GPASEDPA VIELESSGGPSREKRPR QTEAV+
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVN

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.8e-17365.21Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP
        KG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+F+TLK+YK+HFPR RK+ TLVTDKLLLESGLLDYN  VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP

Query:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHL
        IE+SRPNSELAMVCGF  +VK KSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ SGG S EKR R ++EA++V  L  EVR E+ L
Subjt:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHL

Query:  KRKRKKKKITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQVAR--SNCL---------------------LGVFPQAFVASI
        +R+RKKKK +S  E GARG LP S ADLVDDPEARM GTS+V  RF +EPSSSGV+DQV+R  + CL                     +    +AF+ASI
Subjt:  KRKRKKKKITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQVAR--SNCL---------------------LGVFPQAFVASI

Query:  QSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKDELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALE
           + VKAELD REALAA+E+E   A LEAA +T+K ELLKA  EVDIL+AEV+AK +LLKKE ++ KA LRA HAITKGLEKEKFQLLKEKDD+ Q LE
Subjt:  QSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKDELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALE

Query:  AKDEELKHVTAELETTKERLSNGALLEESFRQHPDFDGFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNS
         KD  +  +T EL+  KERL+NG LLEESFRQHPDFDGF KD SDAGFKFLMKGIA+DM  LQIDL GLKK+Y E+WASGP+GTP PQ+LVDKYVR+L+S
Subjt:  AKDEELKHVTAELETTKERLSNGALLEESFRQHPDFDGFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNS

Query:  DYSDLEED--------QVGTTQEGTP
        DYSD+EE+        +VGTTQE  P
Subjt:  DYSDLEED--------QVGTTQEGTP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.2e-10784.8Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP
        KGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASF+TLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN AVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP

Query:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHLKRKRKKKK
        IESSRPNSELAMVCGFASNVK KSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKRPR QTEAV+V  LGEEVREE  LKR+RKKKK
Subjt:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHLKRKRKKKK

Query:  ITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQ
         TSPLEVGARG LPASFAD VDDPEARM GT DVT RFRVEPSSSGVRDQ
Subjt:  ITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138264.0e-11481.34Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS

Query:  NVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEV
         VK KSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESSGGPSREKRPR QTEAV+  +   +V
Subjt:  NVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEV

A0A6J1D971 uncharacterized protein LOC1110185386.0e-10274.04Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVAR------SNCL-----------------LGVFPQAFVASIQSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKD
        G   + A+ R+EPSSSGVRDQV+R        CL                 +    +AFVASIQS LAVKAELD RE LAAREKEEFSA LE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVAR------SNCL-----------------LGVFPQAFVASIQSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKD

Query:  ELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKHVTAELETTKERLSNGALLEESFRQHPDFD
        ELLKAHSEV+ LKAEVE++AELLKKEEDRR+AQLRA HAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+H TAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKHVTAELETTKERLSNGALLEESFRQHPDFD

Query:  GFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGTPQTGS
        GF KD SDAGFKFLMKGIASDM DLQIDL GLK+RY E+WASGP GTPGPQALVD+YVRDL+SDYSD EEDQVG+TQEG   TGS
Subjt:  GFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVGTTQEGTPQTGS

A0A6J1DXS5 uncharacterized protein LOC1110255025.5e-16485.35Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPESILLRIPEEGERADNPPEGWITLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPE+ILLR+PEEGERADNPPEGW+TLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPESILLRIPEEGERADNPPEGWITLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI+FWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFASNVKCK
        S EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSELAMVCGFAS VK K
Subjt:  SEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFASNVKCK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVN
        SKGRAHALEAAQSS+PATPAV GPASEDPA VIELESSGGPSREKRPR QTEAV+
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPSREKRPRGQTEAVN

A0A6J1DZB3 uncharacterized protein LOC1110256653.8e-17365.21Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP
        KG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+F+TLK+YK+HFPR RK+ TLVTDKLLLESGLLDYN  VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRP

Query:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHL
        IE+SRPNSELAMVCGF  +VK KSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ SGG S EKR R ++EA++V  L  EVR E+ L
Subjt:  IESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRGQTEAVNVLSLGEEVREEAHL

Query:  KRKRKKKKITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQVAR--SNCL---------------------LGVFPQAFVASI
        +R+RKKKK +S  E GARG LP S ADLVDDPEARM GTS+V  RF +EPSSSGV+DQV+R  + CL                     +    +AF+ASI
Subjt:  KRKRKKKKITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQVAR--SNCL---------------------LGVFPQAFVASI

Query:  QSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKDELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALE
           + VKAELD REALAA+E+E   A LEAA +T+K ELLKA  EVDIL+AEV+AK +LLKKE ++ KA LRA HAITKGLEKEKFQLLKEKDD+ Q LE
Subjt:  QSTLAVKAELDEREALAAREKEEFSAGLEAASSTMKDELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALE

Query:  AKDEELKHVTAELETTKERLSNGALLEESFRQHPDFDGFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNS
         KD  +  +T EL+  KERL+NG LLEESFRQHPDFDGF KD SDAGFKFLMKGIA+DM  LQIDL GLKK+Y E+WASGP+GTP PQ+LVDKYVR+L+S
Subjt:  AKDEELKHVTAELETTKERLSNGALLEESFRQHPDFDGFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNS

Query:  DYSDLEED--------QVGTTQEGTP
        DYSD+EE+        +VGTTQE  P
Subjt:  DYSDLEED--------QVGTTQEGTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATCTAGCTCGTAGGTTAGAATCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGG
GAGGATAGTGATGCTTCCACTTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAAAGC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAAGGATGGATCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAATTTCTCTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCATTTTTTGGTTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGAAAGGCGCAGGAGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGAG
GAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCGGTCCCCGAGCTTACTCAAGCCTCG
TTCAACACACTGAAGTATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAAC
TCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGATTTGCGAGTAACGTGAAATGCAAGTCCAAGGGCCGAGCCCATGCT
CTCGAGGCCGCCCAGAGTTCGGAACCCGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTAATCGAGCTGGAGTCTTCTGGGGGTCCTTCG
CGGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGAACGTCTTGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCATCTGAAGCGGAAGAGGAAGAAGAAGAAG
ATCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCTGGTGGACGATCCTGAAGCCAGGATGAGCGGGACGTCCGACGTTACA
GCACGGTTCAGAGTCGAACCGTCAAGCTCCGGGGTGAGGGACCAGGTAGCTCGGTCTAATTGTCTTCTTGGTGTTTTTCCCCAGGCGTTTGTTGCTTCCATTCAA
TCGACTCTGGCTGTAAAGGCCGAGCTGGATGAGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGGCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCTGAAGGCTCACTCTGAGGTGGACATTTTGAAGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCTCAGCTC
CGAGCTACCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTG
AAGCATGTGACTGCCGAGCTAGAGACGACGAAGGAGCGTCTTAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTTGATGGATTTGTCAAA
GACTTATCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTTTGACCTTCAGATCGACCTCGGTGGTCTGAAGAAGAGGTATGTCGAGCAG
TGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAATATGTCAGAGATCTAAACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGC
ACCACTCAGGAGGGCACTCCTCAAACAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATCTAGCTCGTAGGTTAGAATCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGG
GAGGATAGTGATGCTTCCACTTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAAAGC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAAGGATGGATCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAATTTCTCTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCATTTTTTGGTTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGAAAGGCGCAGGAGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGAG
GAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCGGTCCCCGAGCTTACTCAAGCCTCG
TTCAACACACTGAAGTATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAAC
TCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGATTTGCGAGTAACGTGAAATGCAAGTCCAAGGGCCGAGCCCATGCT
CTCGAGGCCGCCCAGAGTTCGGAACCCGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTAATCGAGCTGGAGTCTTCTGGGGGTCCTTCG
CGGGAGAAGCGCCCAAGGGGTCAGACCGAGGCGGTGAACGTCTTGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCATCTGAAGCGGAAGAGGAAGAAGAAGAAG
ATCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCTGGTGGACGATCCTGAAGCCAGGATGAGCGGGACGTCCGACGTTACA
GCACGGTTCAGAGTCGAACCGTCAAGCTCCGGGGTGAGGGACCAGGTAGCTCGGTCTAATTGTCTTCTTGGTGTTTTTCCCCAGGCGTTTGTTGCTTCCATTCAA
TCGACTCTGGCTGTAAAGGCCGAGCTGGATGAGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGGCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCTGAAGGCTCACTCTGAGGTGGACATTTTGAAGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCTCAGCTC
CGAGCTACCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTG
AAGCATGTGACTGCCGAGCTAGAGACGACGAAGGAGCGTCTTAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTTGATGGATTTGTCAAA
GACTTATCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTTTGACCTTCAGATCGACCTCGGTGGTCTGAAGAAGAGGTATGTCGAGCAG
TGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAATATGTCAGAGATCTAAACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGC
ACCACTCAGGAGGGCACTCCTCAAACAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPESILLRIPEEGERADNPPEGWITLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVAPNGWGVIFALAIIFWLRARDSEEAELKGAGGIVKGPTSIKGWVRKWFYASEEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAS
FNTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELAMVCGFASNVKCKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSGGPS
REKRPRGQTEAVNVLSLGEEVREEAHLKRKRKKKKITSPLEVGARGALPASFADLVDDPEARMSGTSDVTARFRVEPSSSGVRDQVARSNCLLGVFPQAFVASIQ
STLAVKAELDEREALAAREKEEFSAGLEAASSTMKDELLKAHSEVDILKAEVEAKAELLKKEEDRRKAQLRATHAITKGLEKEKFQLLKEKDDMLQALEAKDEEL
KHVTAELETTKERLSNGALLEESFRQHPDFDGFVKDLSDAGFKFLMKGIASDMFDLQIDLGGLKKRYVEQWASGPSGTPGPQALVDKYVRDLNSDYSDLEEDQVG
TTQEGTPQTGS