; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:10376251..10382222
RNA-Seq ExpressionMoc02g14070
SyntenyMoc02g14070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.1e-10580.08Show/hide
Query:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS
        MCARKGA GIVKGPTS+ GWVRKWFYASG+WLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS

Query:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEAREE
        AVRPIESSRPNSELAMVCGFAS VK KSKG+AHALEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEAREE

Query:  APLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQ
         PLKRRRKKKK  SP EVGA  VLPA F DRVDDP ARMGGT DV  RFR+EPSS GVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.9e-14594.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGAGGIVKGPTS+ GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVR

Query:  KWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYN AVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS

Query:  GVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEA
        GVK KSKGRAHALEAAQ+SKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEA D PPLGE A
Subjt:  GVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.2e-11081.05Show/hide
Query:  GTSDVMARFRIEPSSFGVRDQV------------------------------------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   ++A+ RIEPSS GVRDQV                                    FVASIQSAL VKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRIEPSSFGVRDQV------------------------------------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQH DFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.1e-19095.75Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGAGGIVKGPTS+ GWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVRKWFYASG

Query:  DWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFASGVKHKSK
        +WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYN AVRPIESSRPNSELAMVCGFASGVK KSK
Subjt:  DWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFASGVKHKSK

Query:  GRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQ+SKP TPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-16461.82Show/hide
Query:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS
        MCARKG GGIVKGPTS+ GWV KWF+ASG+WLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS

Query:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPP
         VR IE+SRPNSELAMVCGF   VK KSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPP

Query:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQV------------------------------
        L  E R E+PL+RRRKKKK  S SE GA   LP    D VDDP ARM GTS+V  RF +EPSS GV+DQV                              
Subjt:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQV------------------------------

Query:  ------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
              F+ASI  A++VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  ------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSDYSDPEED--------QVDSTQEGAP
        LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  LVDQYVRDLDSDYSDPEED--------QVDSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.0e-10580.08Show/hide
Query:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS
        MCARKGA GIVKGPTS+ GWVRKWFYASG+WLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS

Query:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEAREE
        AVRPIESSRPNSELAMVCGFAS VK KSKG+AHALEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEAREE

Query:  APLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQ
         PLKRRRKKKK  SP EVGA  VLPA F DRVDDP ARMGGT DV  RFR+EPSS GVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138263.4e-14594.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGAGGIVKGPTS+ GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVR

Query:  KWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYN AVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFAS

Query:  GVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEA
        GVK KSKGRAHALEAAQ+SKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEA D PPLGE A
Subjt:  GVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPPLGEEA

A0A6J1D971 uncharacterized protein LOC1110185382.1e-11081.05Show/hide
Query:  GTSDVMARFRIEPSSFGVRDQV------------------------------------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   ++A+ RIEPSS GVRDQV                                    FVASIQSAL VKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRIEPSSFGVRDQV------------------------------------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQH DFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

A0A6J1DXS5 uncharacterized protein LOC1110255023.4e-19095.75Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGAGGIVKGPTS+ GWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVRKWFYASG

Query:  DWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFASGVKHKSK
        +WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYN AVRPIESSRPNSELAMVCGFASGVK KSK
Subjt:  DWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFASGVKHKSK

Query:  GRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQ+SKP TPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.5e-16561.82Show/hide
Query:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS
        MCARKG GGIVKGPTS+ GWV KWF+ASG+WLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNS

Query:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPP
         VR IE+SRPNSELAMVCGF   VK KSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEATDAPP

Query:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQV------------------------------
        L  E R E+PL+RRRKKKK  S SE GA   LP    D VDDP ARM GTS+V  RF +EPSS GV+DQV                              
Subjt:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQV------------------------------

Query:  ------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
              F+ASI  A++VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  ------FVASIQSALVVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSDYSDPEED--------QVDSTQEGAP
        LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  LVDQYVRDLDSDYSDPEED--------QVDSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCTGAATTTAGGAAGGGTAAGATTGTATTGAATGTCGATGGTGGTGGCATTAGAGGCATTACTCCCGGAGCCAACAGTCTAAGTTTCAGACACTGTTTA
TGCAAAGATGTGCACAACAGTGTGTTCCTGGTTGTAGCTCGAACCCGGCCTCCGGACCGACCTGAACACTTGGACGGACCTGCACAAAAAGGTGAGCACTCCGAC
GATCAAGTCAGTATAGGTGGTGGTCCCGCTAATCTAGCGACTGTTACATTCGGTAATCTTGGGACCGTCGGTTACATCCGAGTATTCCCTTCCCCAAACATTGGC
CCCCTCTATGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAG
ATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCC
CTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGTAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTT
AGAATCTCCGATGACGGGGAGGATAGCGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTCGTAGGGGG
TTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTAC
GGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTG
GCCATCCCCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAAAATAGCTAAGAAGCCT
GGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATGATGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGACTGG
CTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGAT
ACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCA
GTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCACAAGTCTAAGGGCCGAGCCCATGCTCTTGAG
GCTGCCCAGAATTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAG
AAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGACGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGA
AGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTTCGGTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGG
ACGTCCGACGTGATGGCGCGGTTCAGAATTGAGCCGTCAAGTTTCGGGGTGAGGGACCAGGTGTTCGTTGCTTCCATTCAATCGGCTCTGGTTGTAAAGGCCGAG
CTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCC
GAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCTATCACCAGA
GGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGTTGGAG
ACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCTTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAG
TTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATTTCAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACC
CCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCA
GCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGCTGAATTTAGGAAGGGTAAGATTGTATTGAATGTCGATGGTGGTGGCATTAGAGGCATTACTCCCGGAGCCAACAGTCTAAGTTTCAGACACTGTTTA
TGCAAAGATGTGCACAACAGTGTGTTCCTGGTTGTAGCTCGAACCCGGCCTCCGGACCGACCTGAACACTTGGACGGACCTGCACAAAAAGGTGAGCACTCCGAC
GATCAAGTCAGTATAGGTGGTGGTCCCGCTAATCTAGCGACTGTTACATTCGGTAATCTTGGGACCGTCGGTTACATCCGAGTATTCCCTTCCCCAAACATTGGC
CCCCTCTATGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAG
ATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCC
CTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGTAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTT
AGAATCTCCGATGACGGGGAGGATAGCGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTCGTAGGGGG
TTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTAC
GGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTG
GCCATCCCCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAAAATAGCTAAGAAGCCT
GGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATGATGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGACTGG
CTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGAT
ACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCA
GTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCACAAGTCTAAGGGCCGAGCCCATGCTCTTGAG
GCTGCCCAGAATTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAG
AAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGACGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGA
AGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTTCGGTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGG
ACGTCCGACGTGATGGCGCGGTTCAGAATTGAGCCGTCAAGTTTCGGGGTGAGGGACCAGGTGTTCGTTGCTTCCATTCAATCGGCTCTGGTTGTAAAGGCCGAG
CTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCC
GAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCTATCACCAGA
GGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGTTGGAG
ACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTTAGGCAACATCTTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAG
TTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATTTCAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACC
CCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCA
GCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MTAEFRKGKIVLNVDGGGIRGITPGANSLSFRHCLCKDVHNSVFLVVARTRPPDRPEHLDGPAQKGEHSDDQVSIGGGPANLATVTFGNLGTVGYIRVFPSPNIG
PLYVWSDLDLAEKFIRLALDTWRLPIRGKIQPSRKIYRRNIQIFRRFGSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSSISSNLGSDLARRLESELEEIENF
RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFAL
AIPFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGAGGIVKGPTSMMGWVRKWFYASGDWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFD
TLKYYKERFPRGRKVGTLVTDELLLESGLLDYNSAVRPIESSRPNSELAMVCGFASGVKHKSKGRAHALEAAQNSKPPTPAVVGPASEDPAPVIELESSGGPSRE
KRPRDQTEAVDAQTEATDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGFGDRVDDPAARMGGTSDVMARFRIEPSSFGVRDQVFVASIQSALVVKAE
LDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELE
TAKERLSNGVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPP
AGS