; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g27660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g27660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:19949126..19952067
RNA-Seq ExpressionMoc08g27660
SyntenyMoc08g27660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.3e-11386.22Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVC FAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQ
        KKKKT SPLEVGA GVLPA F DRVDDPEARMGGT DVT RFRV+PSS+GVRDQ
Subjt:  KKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.0e-13993.77Show/hide
Query:  MFEYGLRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRL LHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFE KRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG KVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.0e-11182.46Show/hide
Query:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE+QAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------RDLDSDYPDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+                        RDLDSDY D EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------RDLDSDYPDLEEDQVGTTQEGAPQADS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.8e-15984.51Show/hide
Query:  MSSSFSNNLGSDEDLASRLESELEEIENFRLSDDGEDSDASTS-----------------------------------GERVDNPPEGWVTLYFKMFEYG
        MSSS S+NL  + DLA RLES+LEEIEN R+SDDGEDSDASTS                                   GER DNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSNNLGSDEDLASRLESELEEIENFRLSDDGEDSDASTS-----------------------------------GERVDNPPEGWVTLYFKMFEYG

Query:  LRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRL LHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSE+AEL DVDQLLACFE KRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASSVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG KVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-17867.36Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR  K+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVC F  SVKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT S  E GA G LP    D VDDPEARM GTS+V  RF ++PSS+GV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------R
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK                        R
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------R

Query:  DLDSDYPDLEED--------QVGTTQEGAP
        +LDSDY D+EE+        +VGTTQE  P
Subjt:  DLDSDYPDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.1e-11386.22Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVC FAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQ
        KKKKT SPLEVGA GVLPA F DRVDDPEARMGGT DVT RFRV+PSS+GVRDQ
Subjt:  KKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138269.9e-14093.77Show/hide
Query:  MFEYGLRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRL LHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFE KRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG KVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.9e-11182.46Show/hide
Query:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ R++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE+QAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------RDLDSDYPDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+                        RDLDSDY D EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------RDLDSDYPDLEEDQVGTTQEGAPQADS

A0A6J1DXS5 uncharacterized protein LOC1110255028.6e-16084.51Show/hide
Query:  MSSSFSNNLGSDEDLASRLESELEEIENFRLSDDGEDSDASTS-----------------------------------GERVDNPPEGWVTLYFKMFEYG
        MSSS S+NL  + DLA RLES+LEEIEN R+SDDGEDSDASTS                                   GER DNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSNNLGSDEDLASRLESELEEIENFRLSDDGEDSDASTS-----------------------------------GERVDNPPEGWVTLYFKMFEYG

Query:  LRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRL LHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSE+AEL DVDQLLACFE KRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASSVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG KVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.3e-17967.36Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR  K+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVC F  SVKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT S  E GA G LP    D VDDPEARM GTS+V  RF ++PSS+GV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------R
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK                        R
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK------------------------R

Query:  DLDSDYPDLEED--------QVGTTQEGAP
        +LDSDY D+EE+        +VGTTQE  P
Subjt:  DLDSDYPDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTAGTCCTGCTAATCTCGCAACGGTTACACCCGGTAATCTCGGGATCGTCGATTACACCCGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTCC
TCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAACAACTTAGGATCCGATGAGGACTTAGCTAGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAC
TCTCCGATGACGGGGAGGATAGTGACGCCTCCACCTCAGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGA
CTTTCCCTTCACCCTTTTGTCCAAGAATTTCTCTTTCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTG
GCTACGAGCTCGGGATAGTGAGAAGGCCGAGCTGTTGGACGTGGACCAGCTCCTCGCGTGCTTCGAAGTGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCG
CAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACTTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTTTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCC
GAGGGGTTGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCTGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCCG
AACTTGCCATGGTTTGCAGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCA
GGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTT
GGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGACGATCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGATTTTCACAG
ATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGTGAGGGACCAGGTGTCCCGCATCTCG
GCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCAT
TCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGG
ATGAACTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCTGAGGTAGAGACCCAGGCTGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCT
GCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGAC
TGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACTTCGATGGATTTGCCAAGGACTTCTCTGACGCGG
GCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGAGATCTGGACTCTGACTACCCTGATCTCGAAGAG
GACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAGGCGGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTAGTCCTGCTAATCTCGCAACGGTTACACCCGGTAATCTCGGGATCGTCGATTACACCCGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTCC
TCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAACAACTTAGGATCCGATGAGGACTTAGCTAGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAC
TCTCCGATGACGGGGAGGATAGTGACGCCTCCACCTCAGGGGAGAGAGTTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGA
CTTTCCCTTCACCCTTTTGTCCAAGAATTTCTCTTTCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTG
GCTACGAGCTCGGGATAGTGAGAAGGCCGAGCTGTTGGACGTGGACCAGCTCCTCGCGTGCTTCGAAGTGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCG
CAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACTTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTTTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCC
GAGGGGTTGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCTGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCCG
AACTTGCCATGGTTTGCAGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCA
GGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTT
GGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGACGATCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGATTTTCACAG
ATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGTGAGGGACCAGGTGTCCCGCATCTCG
GCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCAT
TCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGG
ATGAACTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCTGAGGTAGAGACCCAGGCTGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCT
GCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGAC
TGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACTTCGATGGATTTGCCAAGGACTTCTCTGACGCGG
GCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGAGATCTGGACTCTGACTACCCTGATCTCGAAGAG
GACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAGGCGGACTCTTAG
Protein sequenceShow/hide protein sequence
MGGSPANLATVTPGNLGIVDYTRQLEPLVGRSLPSLPLSNVVAMSSSFSNNLGSDEDLASRLESELEEIENFRLSDDGEDSDASTSGERVDNPPEGWVTLYFKMFEYGLR
LSLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEKAELLDVDQLLACFEVKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGR
SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGWKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASSVKRKSKGRAHALEAAQSSKPATPAVA
GPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTISPLEVGACGVLPAIFTDRVDDPEARMGGTSDVTARFRVQPSSAGVRDQVSRIS
AASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETQAELLKKEEDRRKAQLRA
AHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRDLDSDYPDLEE
DQVGTTQEGAPQADS