; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:16484205..16486583
RNA-Seq ExpressionMoc04g22650
SyntenyMoc04g22650
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.1e-10682.68Show/hide
Query:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP
        +CARK A GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLV D+LLLESGLLDYNP
Subjt:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPPKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEA QSSKP TP + GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE P KRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPPKRRR

Query:  KKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  V PASFADRVDDP ARMG T DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.9e-13892.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGV+FALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFY+CARK AGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLV DELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEA QSSKP TP + GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.1e-10881.45Show/hide
Query:  VTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLK
        + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALE ASSTMKDELLK
Subjt:  VTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLK

Query:  AHSEVETLKAE--------------------------------------EKDDMLQTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAK
        AHSEVETLKAE                                      EKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAK
Subjt:  AHSEVETLKAE--------------------------------------EKDDMLQTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAK

Query:  DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAQEG
        DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+ QEG
Subjt:  DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAQEG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.6e-18794.33Show/hide
Query:  MSSSFSSNLGSDLARRLDSELEEIENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRLLEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRL+S+LEEIEN R+SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIP+NILLRL EEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLDSELEEIENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRLLEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGV+FALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARK AGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLV DELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEA QSSKPATP + GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.1e-17264.34Show/hide
Query:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP
        +CARK  GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLV D+LLLESGLLDYNP
Subjt:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVL--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP +        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVL--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPPKRRRKKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+P +RRRKKKK  S SE GA    P S AD VDDP ARM  TS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------EKDDML
        +ASI  A+ VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                      EKDD+ 
Subjt:  VASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------EKDDML

Query:  QTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDLEED--------QVGAAQEGTP
        +LDSDYSD+EE+        +VG  QE  P
Subjt:  DLDSDYSDLEED--------QVGAAQEGTP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.5e-10682.68Show/hide
Query:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP
        +CARK A GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLV D+LLLESGLLDYNP
Subjt:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPPKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEA QSSKP TP + GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE P KRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPPKRRR

Query:  KKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  V PASFADRVDDP ARMG T DVT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.9e-13892.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGV+FALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFY+CARK AGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLV DELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEA QSSKP TP + GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

A0A6J1D971 uncharacterized protein LOC1110185385.3e-10981.45Show/hide
Query:  VTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLK
        + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALE ASSTMKDELLK
Subjt:  VTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLK

Query:  AHSEVETLKAE--------------------------------------EKDDMLQTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAK
        AHSEVETLKAE                                      EKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAK
Subjt:  AHSEVETLKAE--------------------------------------EKDDMLQTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAK

Query:  DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAQEG
        DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+ QEG
Subjt:  DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAQEG

A0A6J1DXS5 uncharacterized protein LOC1110255027.5e-18894.33Show/hide
Query:  MSSSFSSNLGSDLARRLDSELEEIENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRLLEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRL+S+LEEIEN R+SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIP+NILLRL EEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLDSELEEIENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRLLEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGV+FALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFY+CARK AGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLV DELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEA QSSKPATP + GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEATQSSKPATPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-17264.34Show/hide
Query:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP
        +CARK  GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLV D+LLLESGLLDYNP
Subjt:  ICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVL--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP +        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPATPVL--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPPKRRRKKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+P +RRRKKKK  S SE GA    P S AD VDDP ARM  TS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------EKDDML
        +ASI  A+ VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                      EKDD+ 
Subjt:  VASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------EKDDML

Query:  QTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDLEED--------QVGAAQEGTP
        +LDSDYSD+EE+        +VG  QE  P
Subjt:  DLDSDYSDLEED--------QVGAAQEGTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGATTCCGAGCTCGAGGAGATAGAAAACTTTAGACTCTCCGATGACGGGGAGGAT
AGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTAAGAACATCCTA
CTCAGGCTTCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTTAAAATGTTTGAGTATGGCCTCAGACTTCCCCTTCACCCT
TTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCGTTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCT
CGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATATGCGCAAGG
AAAGCCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGC
TTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGATCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGG
CCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCACCCAGAGTTCGAAACCTGCC
ACCCCGGTCCTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAG
GCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAGAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGC
AGGGTCTGGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCAGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCG
GGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACC
ATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGACCGAGCTGGATGGAAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAG
TTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGAGAAGGACGACATGCTC
CAGACGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACAGCGAAGGAGCGGCTCAGCAATGGAGTTCTATTGGAGGAATCGTTTAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTC
AGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGTACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGAC
TACTCCGATCTCGAAGAGGACCAGGTCGGCGCTGCACAGGAGGGCACTCCTCAGGCGGACCCTTGGGCGACCATCCTTCATGAGGCTTTTCTCTGTCTCTCTTCT
CTTCCTTTTTTGTTTATAAGTGTCAGGGCAGAGCTGCAAGGTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGTTAGATTCCGAGCTCGAGGAGATAGAAAACTTTAGACTCTCCGATGACGGGGAGGAT
AGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTAAGAACATCCTA
CTCAGGCTTCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTTAAAATGTTTGAGTATGGCCTCAGACTTCCCCTTCACCCT
TTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCGTTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCT
CGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATATGCGCAAGG
AAAGCCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGC
TTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGATCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGG
CCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCACCCAGAGTTCGAAACCTGCC
ACCCCGGTCCTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAG
GCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAGAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGC
AGGGTCTGGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCAGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCG
GGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACC
ATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGACCGAGCTGGATGGAAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAG
TTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGAGAAGGACGACATGCTC
CAGACGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACAGCGAAGGAGCGGCTCAGCAATGGAGTTCTATTGGAGGAATCGTTTAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTC
AGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGTACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGAC
TACTCCGATCTCGAAGAGGACCAGGTCGGCGCTGCACAGGAGGGCACTCCTCAGGCGGACCCTTGGGCGACCATCCTTCATGAGGCTTTTCTCTGTCTCTCTTCT
CTTCCTTTTTTGTTTATAAGTGTCAGGGCAGAGCTGCAAGGTCTATAA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDLARRLDSELEEIENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPKNILLRLLEEGERADNPPEGWVTLYFKMFEYGLRLPLHP
FVQEFLFRTGLAPAQVAPNGWGVVFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYICARKAAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGR
SFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVIDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEATQSSKPA
TPVLAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPPKRRRKKKKAISPSEVGACRVWPASFADRVDDPAARMGRTSDVTARFRVEPSSS
GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKTELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEEKDDML
QTLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
YSDLEEDQVGAAQEGTPQADPWATILHEAFLCLSSLPFLFISVRAELQGL