; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g07130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g07130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:5622572..5624988
RNA-Seq ExpressionMoc09g07130
SyntenyMoc09g07130
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.3e-10984.65Show/hide
Query:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP
        MCA KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNP
Subjt:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP

Query:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRP+ESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQS KP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.9e-13792.31Show/hide
Query:  MFEYSLRLPLHPFVQEFLFRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVR
        MFEY LRLPLHPFVQEFLFR GLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLD+DQLLACFEAKRIAKKPGRFYMCA KGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPLHPFVQEFLFRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTR GNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLES LLDYNPAVRP+E SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEAAQS KP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.7e-12285.26Show/hide
Query:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE             AELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDMLQALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFN
        ELLKAHSEVE LKAEVESQ ELLKKEE+RR+AQLRAAHAITRGLE+EKFQLLKEK+DMLQAL+AKDKELEHATAELETAK+RLSNGVLLEE+FRQHPDF+
Subjt:  ELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDMLQALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFN

Query:  GFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVRDLDSDYSDLEEDQVGTTHEGAPQADS
        GFAKDFSD GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQ LVDQYVRDLDSDYSD EEDQVG+T EGA    S
Subjt:  GFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVRDLDSDYSDLEEDQVGTTHEGAPQADS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.7e-18495.32Show/hide
Query:  DLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPLHPFVQEFL
        DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEY LRLPLHPFVQEFL
Subjt:  DLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPLHPFVQEFL

Query:  FRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS
        FR GLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL D+DQLLACFEAKRIAKKPGRFYMCA KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS
Subjt:  FRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS

Query:  FFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQS
        FFDVPTR GNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLES LLDYNPAVRP+ESSRPNSELAMVCGFAS VKRKSKGRAHALEAAQS
Subjt:  FFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQS

Query:  LKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
         KP TPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  LKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.2e-18668.11Show/hide
Query:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP
        MCA KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR GNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLES LLDYNP
Subjt:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP

Query:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR +E+SRPNSELAMVCGF  SVKRKSKGRAHAL+     +P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE  
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDML
                   AELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEK+D+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDML

Query:  QALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFNGFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVR
        Q L+ KD  +   T EL+  K+RL+NG LLEESFRQHPDF+GFAKDFSD GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ LVD+YVR
Subjt:  QALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFNGFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVR

Query:  DLDSDYSDLEED--------QVGTTHEGAP
        +LDSDYSD+EE+        +VGTT E  P
Subjt:  DLDSDYSDLEED--------QVGTTHEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.3e-11084.65Show/hide
Query:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP
        MCA KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNP
Subjt:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP

Query:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR
        AVRP+ESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQS KP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKRRR
Subjt:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138269.3e-13892.31Show/hide
Query:  MFEYSLRLPLHPFVQEFLFRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVR
        MFEY LRLPLHPFVQEFLFR GLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLD+DQLLACFEAKRIAKKPGRFYMCA KGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPLHPFVQEFLFRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTR GNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLES LLDYNPAVRP+E SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
         VKRKSKGRAHALEAAQS KP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  SVKRKSKGRAHALEAAQSLKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

A0A6J1D971 uncharacterized protein LOC1110185384.2e-12285.26Show/hide
Query:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE             AELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDMLQALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFN
        ELLKAHSEVE LKAEVESQ ELLKKEE+RR+AQLRAAHAITRGLE+EKFQLLKEK+DMLQAL+AKDKELEHATAELETAK+RLSNGVLLEE+FRQHPDF+
Subjt:  ELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDMLQALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFN

Query:  GFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVRDLDSDYSDLEEDQVGTTHEGAPQADS
        GFAKDFSD GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQ LVDQYVRDLDSDYSD EEDQVG+T EGA    S
Subjt:  GFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVRDLDSDYSDLEEDQVGTTHEGAPQADS

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-18495.32Show/hide
Query:  DLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPLHPFVQEFL
        DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEY LRLPLHPFVQEFL
Subjt:  DLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPLHPFVQEFL

Query:  FRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS
        FR GLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL D+DQLLACFEAKRIAKKPGRFYMCA KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS
Subjt:  FRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRS

Query:  FFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQS
        FFDVPTR GNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLES LLDYNPAVRP+ESSRPNSELAMVCGFAS VKRKSKGRAHALEAAQS
Subjt:  FFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQS

Query:  LKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
         KP TPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  LKPTTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.1e-18668.11Show/hide
Query:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP
        MCA KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR GNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLES LLDYNP
Subjt:  MCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNP

Query:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR +E+SRPNSELAMVCGF  SVKRKSKGRAHAL+     +P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKPTTPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE  
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDML
                   AELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEK+D+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQTELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKNDML

Query:  QALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFNGFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVR
        Q L+ KD  +   T EL+  K+RL+NG LLEESFRQHPDF+GFAKDFSD GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ LVD+YVR
Subjt:  QALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFNGFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQVLVDQYVR

Query:  DLDSDYSDLEED--------QVGTTHEGAP
        +LDSDYSD+EE+        +VGTT E  P
Subjt:  DLDSDYSDLEED--------QVGTTHEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCAATCCTTCAGCACTTAGGATCCAATGAGGACCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAG
GATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATC
CTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACAGTCTCAGACTTCCCCTTCAC
CCTTTCGTCCAAGAATTTCTCTTCCGGATGGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGA
GCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACATAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCA
AGTAAAGGTGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGGT
CGTTCCTTCTTCGACGTCCCCACTAGGATTGGGAACCTGGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTAAAATACTACAAGGAG
CGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGAGCTGCTAGATTACAACCCTGCAGTTCGTCCCGTCGAATCCTCA
AGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTTGAAACCT
ACCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCCTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACC
GAGGCTGTGGACGCCCCGCCTTTGGGCGAAGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCT
TGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGT
TCCGGAGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGG
ACCATCGACTACGCCGCCGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCAAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATG
AAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAAGCCGAGGTGGAGTCCCAGACCGAGCTGCTGAAAAAGGAAGAGAACAGGCGCAAGGCCCAA
CTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGAACGACATGCTCCAGGCGCTTGATGCGAAGGATAAGGAG
CTGGAGCATGCGACTGCAGAGCTGGAAACGGCGAAGAAGCGCCTCAGCAATGGAGTCCTATTAGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCC
AAAGACTTTTCTGACACGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATTGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAG
AAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGTGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTC
GGCACCACACATGAGGGCGCTCCTCAGGCGGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCAATCCTTCAGCACTTAGGATCCAATGAGGACCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAG
GATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATC
CTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACAGTCTCAGACTTCCCCTTCAC
CCTTTCGTCCAAGAATTTCTCTTCCGGATGGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGA
GCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACATAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCA
AGTAAAGGTGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGGT
CGTTCCTTCTTCGACGTCCCCACTAGGATTGGGAACCTGGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTAAAATACTACAAGGAG
CGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGAGCTGCTAGATTACAACCCTGCAGTTCGTCCCGTCGAATCCTCA
AGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTTGAAACCT
ACCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCCTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACC
GAGGCTGTGGACGCCCCGCCTTTGGGCGAAGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCT
TGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGT
TCCGGAGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGG
ACCATCGACTACGCCGCCGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCAAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATG
AAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAAGCCGAGGTGGAGTCCCAGACCGAGCTGCTGAAAAAGGAAGAGAACAGGCGCAAGGCCCAA
CTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGAACGACATGCTCCAGGCGCTTGATGCGAAGGATAAGGAG
CTGGAGCATGCGACTGCAGAGCTGGAAACGGCGAAGAAGCGCCTCAGCAATGGAGTCCTATTAGAGGAATCGTTTAGGCAACATCCTGACTTCAATGGATTTGCC
AAAGACTTTTCTGACACGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATTGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAG
AAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGTGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTC
GGCACCACACATGAGGGCGCTCCTCAGGCGGACTCTTAG
Protein sequenceShow/hide protein sequence
MPPILQHLGSNEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPLH
PFVQEFLFRMGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDIDQLLACFEAKRIAKKPGRFYMCASKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESG
RSFFDVPTRIGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESELLDYNPAVRPVESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSLKP
TTPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRVEPSS
SGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQTELLKKEENRRKAQ
LRAAHAITRGLEKEKFQLLKEKNDMLQALDAKDKELEHATAELETAKKRLSNGVLLEESFRQHPDFNGFAKDFSDTGFKFLMKGIASDMPDLQIDLSGLKRRYAE
KWASGPGGTPGPQVLVDQYVRDLDSDYSDLEEDQVGTTHEGAPQADS