; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:16902334..16905620
RNA-Seq ExpressionMoc04g23380
SyntenyMoc04g23380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.1e-10080.8Show/hide
Query:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP
        KGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V +RPVPELTQASFDTLKYYKE FPRGRKV TLVT+KLLLESGLL YNPAVRP
Subjt:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP

Query:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK
        IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV                       KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK
Subjt:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK

Query:  TTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        TTSPLEVGARGVLPASF DRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  TTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.7e-8165.57Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG                                                  KGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRF NLV +RPVPELTQASFDTLKYYKERFPRGRKV TLVT++LLLESGLL YNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TPAV                       KRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.1e-9281.74Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAMKAELDGREVLAAREKEEFSAALEAASSAMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQSALA+KAELDGREVLAAREKEEFSAALE ASS MKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAMKAELDGREVLAAREKEEFSAALEAASSAMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WA
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.9e-13173.52Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSHASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDS ASTSGQGLEYPS+IPEHYLGSLRRGFAIPENILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSHASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNG                                                  KGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRF NLV +RPVPELTQASFDTLKYYKERFPRGRKV TLVT++LLLESGLL YNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV                       KRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.3e-15465.06Show/hide
Query:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP
        KG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLV ++ +PEL QA+FDTLK+YK+ FPR RK+ TLVT+KLLLESGLL YNP VR 
Subjt:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP

Query:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-------------------------------KRPRDQTEAVDVSPLGEEVREEVPL
        IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V                               KR R+++EA+DVSPL  EVR E PL
Subjt:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-------------------------------KRPRDQTEAVDVSPLGEEVREEVPL

Query:  KRRRKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASI
        +RRRKKKKT+S  E GARG LP S  D VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKF                 AF+ASI
Subjt:  KRRRKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASI

Query:  QSALAMKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALE
          A+ +KAELDGRE LAA+E+E   AALEAA++ +K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE
Subjt:  QSALAMKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALE

Query:  AKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA
         K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WA
Subjt:  AKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.0e-10080.8Show/hide
Query:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP
        KGACGIVKGPTSIKGWVRKWFYASGEWLAKDES              V +RPVPELTQASFDTLKYYKE FPRGRKV TLVT+KLLLESGLL YNPAVRP
Subjt:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP

Query:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK
        IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV                       KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK
Subjt:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKK

Query:  TTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        TTSPLEVGARGVLPASF DRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  TTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138268.1e-8265.57Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG                                                  KGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRF NLV +RPVPELTQASFDTLKYYKERFPRGRKV TLVT++LLLESGLL YNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TPAV                       KRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185381.0e-9281.74Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAMKAELDGREVLAAREKEEFSAALEAASSAMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQSALA+KAELDGREVLAAREKEEFSAALE ASS MKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAMKAELDGREVLAAREKEEFSAALEAASSAMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WA
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA

A0A6J1DXS5 uncharacterized protein LOC1110255021.9e-13173.52Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSHASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDS ASTSGQGLEYPS+IPEHYLGSLRRGFAIPENILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSHASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNG                                                  KGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNG--------------------------------------------------KGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRF NLV +RPVPELTQASFDTLKYYKERFPRGRKV TLVT++LLLESGLL YNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV                       KRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAV-----------------------KRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.6e-15465.06Show/hide
Query:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP
        KG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLV ++ +PEL QA+FDTLK+YK+ FPR RK+ TLVT+KLLLESGLL YNP VR 
Subjt:  KGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAVRP

Query:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-------------------------------KRPRDQTEAVDVSPLGEEVREEVPL
        IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V                               KR R+++EA+DVSPL  EVR E PL
Subjt:  IESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV-------------------------------KRPRDQTEAVDVSPLGEEVREEVPL

Query:  KRRRKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASI
        +RRRKKKKT+S  E GARG LP S  D VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKF                 AF+ASI
Subjt:  KRRRKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASI

Query:  QSALAMKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALE
          A+ +KAELDGRE LAA+E+E   AALEAA++ +K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE
Subjt:  QSALAMKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALE

Query:  AKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA
         K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WA
Subjt:  AKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGATACCTTCAATGTTCAAGTTGTTGACCATAAAGCAATACAATGGCCTCAAAGATTCAATAGACCACATAGATGTTGAATTCATTTTCCACAGTGGG
TTGGTGCACTACATCCTGCTAAGGGAAGTCGTCAGCTTCGAGATATTCTCTACTTCAAAAGACAAGTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATT
GCAGCTCGAACTCGGCCTCCAGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATGAAGTCAGTATAGGTCGGATTCCCAGTTTA
GTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACGTGCTTTGGACGCGTGGCGACTTC
CTATTTGTGGGAAAATATAACCGTTGCGGTAGATTTATCGTTGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACA
CGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTGGTTGCCATGTCGTCCTCGTTTAGCAGCAACTTAGGATCCGATGAGGATTTA
GCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTCATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCT
TCTAAGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCT
CTAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCG
GCTCAAGTGGCCCCCAATGGGAAAGGCGCATGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTT
GCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTAGGAACCTAGTTTTAGTCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACG
CTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGAAACCTTGGTGACCAACAAGCTGCTGCTTGAGTCCGGGCTGCTAGTTTACAACCCCGCAGTT
CGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCC
GCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCT
CTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTGGACGATCCTGAGGCC
AGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGCGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGC
TGCCTAAGGAGGGCGTCCAAATTTGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCATGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAA
GAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAG
GCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATAACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAG
AAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAAGAGCGTCTCAGCAATGGAGCCCTATTG
GAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGAC
CTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTGGGCCCAGCGGCATCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGATACCTTCAATGTTCAAGTTGTTGACCATAAAGCAATACAATGGCCTCAAAGATTCAATAGACCACATAGATGTTGAATTCATTTTCCACAGTGGG
TTGGTGCACTACATCCTGCTAAGGGAAGTCGTCAGCTTCGAGATATTCTCTACTTCAAAAGACAAGTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATT
GCAGCTCGAACTCGGCCTCCAGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATGAAGTCAGTATAGGTCGGATTCCCAGTTTA
GTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACGTGCTTTGGACGCGTGGCGACTTC
CTATTTGTGGGAAAATATAACCGTTGCGGTAGATTTATCGTTGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACA
CGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTGGTTGCCATGTCGTCCTCGTTTAGCAGCAACTTAGGATCCGATGAGGATTTA
GCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTCATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCT
TCTAAGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCT
CTAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCG
GCTCAAGTGGCCCCCAATGGGAAAGGCGCATGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTT
GCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTAGGAACCTAGTTTTAGTCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACG
CTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGAAACCTTGGTGACCAACAAGCTGCTGCTTGAGTCCGGGCTGCTAGTTTACAACCCCGCAGTT
CGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCC
GCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCT
CTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTGGACGATCCTGAGGCC
AGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGCGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGC
TGCCTAAGGAGGGCGTCCAAATTTGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCATGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAA
GAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAG
GCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATAACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAG
AAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAAGAGCGTCTCAGCAATGGAGCCCTATTG
GAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGAC
CTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTGGGCCCAGCGGCATCCCTAG
Protein sequenceShow/hide protein sequence
MEKIPSMFKLLTIKQYNGLKDSIDHIDVEFIFHSGLVHYILLREVVSFEIFSTSKDKCLCKDMHNSVFQIAARTRPPDRSEYLGGPAQKGEHSDDEVSIGRIPSL
VRGYSLPQTLAPSLSGPISTWQRSSFDVLWTRGDFLFVGKYNRCGRFIVGIFKYSDASDLREDPSRSLITRLEPLVGRSLPSLSLSNVVAMSSSFSSNLGSDEDL
ARRLESELEEIENFRFSDDGEDSHASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAP
AQVAPNGKGACGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFRNLVLVRPVPELTQASFDTLKYYKERFPRGRKVETLVTNKLLLESGLLVYNPAV
RPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFTDRVDDPEA
RMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFAFVASIQSALAMKAELDGREVLAAREKEEFSAALEAASSAMKDELLKAHSEVEILKAEVETK
AELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSD
LQIDLGGLKKRYAEQWAWAQRHP