; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g18530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g18530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:14151354..14154064
RNA-Seq ExpressionMoc11g18530
SyntenyMoc11g18530
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.9e-10480.84Show/hide
Query:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVK PTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEAREE
        AVRPIESSRPNSELAMV GFAS VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQ EAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEAREE

Query:  APLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQ
         PLKRRRKKKK  SP EVGA  VLPAS+A+RVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  APLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.7e-14395.64Show/hide
Query:  MFEYGLRLPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWL ARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVK PTSIKGWVR
Subjt:  MFEYGLRLPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMV  FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAV GPASEDPAPVIELESSGGPSREKRPRDQ EAVDAQTEAA+ PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.9e-11582.46Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAA--------------------------EKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRC+RRASKFVSAPGSVLQRTIDYAA                          EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAA--------------------------EKEEFSAALEAASSTMKD

Query:  KLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        +LLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLK+KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  KLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQALVDRYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        GFAKD SDAGFKFLMK IASDMPDLQIDLSGLKRRYAEKWASGP GT GPQALVD+YVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  GFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQALVDRYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-18695.18Show/hide
Query:  MSSSISSNLGSDLARRLEAELEEIEHFRISDDGEDNDASTSGQGSEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLE++LEEIE+ RISDDGED+DASTSGQG EYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLEAELEEIEHFRISDDGEDNDASTSGQGSEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVRKWFYASG
        LPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWL ARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVK PTSIKGWVRKWFYASG
Subjt:  LPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMV GFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD
        GRAHALEAAQSSKP TPAV GPASEDPA VIELESSGGPSREKRPRDQ EAVD
Subjt:  GRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.1e-16863.31Show/hide
Query:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVK PTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPP
         VR IE+SRPNSELAMV GF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPP

Query:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI
        L  E R E+PL+RRRKKKK  S SE GA   LP S A+ VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTI
Subjt:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI

Query:  DYAA--------------------------EKEEFSAALEAASSTMKDKLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
        D  A                          E+E   AALEAA +T+K +LLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  DYAA--------------------------EKEEFSAALEAASSTMKDKLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQA
        K+KDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKD SDAGFKFLMK IA+DMP LQIDL+GLK++Y+EKWASGP+GT  PQ+
Subjt:  KKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQA

Query:  LVDRYVRDLDSDYSDPEED--------QVDSTQEGAP
        LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  LVDRYVRDLDSDYSDPEED--------QVDSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.0e-10580.84Show/hide
Query:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVK PTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEAREE
        AVRPIESSRPNSELAMV GFAS VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQ EAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEAREE

Query:  APLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQ
         PLKRRRKKKK  SP EVGA  VLPAS+A+RVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  APLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQ

A0A6J1CR42 uncharacterized protein LOC1110138261.3e-14395.64Show/hide
Query:  MFEYGLRLPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVR
        MFEYGLRLPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWL ARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVK PTSIKGWVR
Subjt:  MFEYGLRLPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMV  FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAV GPASEDPAPVIELESSGGPSREKRPRDQ EAVDAQTEAA+ PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEA

A0A6J1D971 uncharacterized protein LOC1110185384.3e-11582.46Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAA--------------------------EKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRC+RRASKFVSAPGSVLQRTIDYAA                          EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAA--------------------------EKEEFSAALEAASSTMKD

Query:  KLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        +LLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLK+KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  KLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQALVDRYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        GFAKD SDAGFKFLMK IASDMPDLQIDLSGLKRRYAEKWASGP GT GPQALVD+YVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  GFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQALVDRYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

A0A6J1DXS5 uncharacterized protein LOC1110255027.2e-18795.18Show/hide
Query:  MSSSISSNLGSDLARRLEAELEEIEHFRISDDGEDNDASTSGQGSEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLE++LEEIE+ RISDDGED+DASTSGQG EYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLEAELEEIEHFRISDDGEDNDASTSGQGSEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVRKWFYASG
        LPLH FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWL ARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVK PTSIKGWVRKWFYASG
Subjt:  LPLHHFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMV GFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD
        GRAHALEAAQSSKP TPAV GPASEDPA VIELESSGGPSREKRPRDQ EAVD
Subjt:  GRAHALEAAQSSKPPTPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256655.2e-16963.31Show/hide
Query:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVK PTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPP
         VR IE+SRPNSELAMV GF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D         P
Subjt:  AVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPP

Query:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI
        L  E R E+PL+RRRKKKK  S SE GA   LP S A+ VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTI
Subjt:  LGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI

Query:  DYAA--------------------------EKEEFSAALEAASSTMKDKLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
        D  A                          E+E   AALEAA +T+K +LLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  DYAA--------------------------EKEEFSAALEAASSTMKDKLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQA
        K+KDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKD SDAGFKFLMK IA+DMP LQIDL+GLK++Y+EKWASGP+GT  PQ+
Subjt:  KKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSGPQA

Query:  LVDRYVRDLDSDYSDPEED--------QVDSTQEGAP
        LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  LVDRYVRDLDSDYSDPEED--------QVDSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATCTTGGGGCACCAATAGGGGTCTTCCACGTGTCCCGAGTATACCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAG
AAGTTCATTCCACCTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGGTCTCAGAGAGGATCCCAGCCACTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCCCTTTCGAACATAATTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTTAGAGGCCGAGCTCGAGGAGATAGAACACTTTAGAATCTCCGATGATGGGGAGGATAACGATGCCTCC
ACTTCAGGTCAGGGTTCGGAATATCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCG
GAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCATTTTGTCCAAGAA
TTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTTCTAGCTAGGGATAGTGAA
GAGGCCGAGCTACTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGT
GGTATAGTTAAGAGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTCCCCACTAGGTTTGGGAACCTAGTCTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGT
AGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCTAGGCCGAACTCTGAA
CTCGCCATGGTTTTCGGATTTGCAAGCGGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCGCCCACCCCTGCCGTG
GCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCCGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGGCCGAGGCGGTGGACGCC
CAGACCGAGGCGGCGAACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTC
GGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTAATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCG
TCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCTCCTGGGTCCGTTCTG
CAGAGGACCATTGACTACGCTGCCGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATAAGCTGCTGAAGGCTCACTCCGAGGTG
GAGACTTTGAAGGCCGAAGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTG
GAGAGGGAGAAGTTCCAGCTCCTGAAGAAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCG
AAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACCTTTCTGACGCGGGCTTCAAGTTCCTC
ATGAAGGACATTGCTTCCGACATGCCCGACCTTCAGATCGATTTAAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCTCTGGC
CCCCAAGCGTTGGTGGATCGGTATGTCAGGGATCTGGATTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGC
TCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATCTTGGGGCACCAATAGGGGTCTTCCACGTGTCCCGAGTATACCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAG
AAGTTCATTCCACCTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTC
GGGTCTCAGAGAGGATCCCAGCCACTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCCCTTTCGAACATAATTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTTAGAGGCCGAGCTCGAGGAGATAGAACACTTTAGAATCTCCGATGATGGGGAGGATAACGATGCCTCC
ACTTCAGGTCAGGGTTCGGAATATCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCG
GAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCATTTTGTCCAAGAA
TTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTTCTAGCTAGGGATAGTGAA
GAGGCCGAGCTACTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGT
GGTATAGTTAAGAGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTCCCCACTAGGTTTGGGAACCTAGTCTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGT
AGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCTAGGCCGAACTCTGAA
CTCGCCATGGTTTTCGGATTTGCAAGCGGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCGCCCACCCCTGCCGTG
GCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCCGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGGCCGAGGCGGTGGACGCC
CAGACCGAGGCGGCGAACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTC
GGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTAATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCG
TCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCTCCTGGGTCCGTTCTG
CAGAGGACCATTGACTACGCTGCCGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATAAGCTGCTGAAGGCTCACTCCGAGGTG
GAGACTTTGAAGGCCGAAGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTG
GAGAGGGAGAAGTTCCAGCTCCTGAAGAAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCG
AAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACCTTTCTGACGCGGGCTTCAAGTTCCTC
ATGAAGGACATTGCTTCCGACATGCCCGACCTTCAGATCGATTTAAGTGGTCTGAAAAGAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCTCTGGC
CCCCAAGCGTTGGTGGATCGGTATGTCAGGGATCTGGATTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGC
TCTTAG
Protein sequenceShow/hide protein sequence
MSHLGAPIGVFHVSRVYPSPNIGPLSVWSDLDLAEKFIPPALDTWRLPIRGKIQPSRKIYRRNIQIFRRFGSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSS
ISSNLGSDLARRLEAELEEIEHFRISDDGEDNDASTSGQGSEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHHFVQE
FLFRTGLAPAQVAPNGWGVIFALAILFWLLARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKRPTSIKGWVRKWFYASGEWLAKDESGRSFFD
VPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVFGFASGVKRKSKGRAHALEAAQSSKPPTPAV
AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAANAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWANRVDDPAARMGGTSDVTARFRIEP
SSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEKEEFSAALEAASSTMKDKLLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGL
EREKFQLLKKKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDLSDAGFKFLMKDIASDMPDLQIDLSGLKRRYAEKWASGPSGTSG
PQALVDRYVRDLDSDYSDPEEDQVDSTQEGAPPAGS