; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g06090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g06090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:4360380..4366839
RNA-Seq ExpressionMoc10g06090
SyntenyMoc10g06090
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.6e-9591.79Show/hide
Query:  KGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSSKPATPA
        K   V+IRPVPELTQASFDTLKYYKE F RGRKVGTLVTDKLLLE GLLDYNPAV PIESSRPNSELAMVCG+ SNVK KSKG+AHALEAAQSSKP TPA
Subjt:  KGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSSKPATPA

Query:  VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMGGTSDVTTRFRVEPS
        VVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEV AR VLPASFADRVDDPEARMGGT DVTTRFRVEPS
Subjt:  VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMGGTSDVTTRFRVEPS

Query:  SSGVRDQ
        SSGVRDQ
Subjt:  SSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.3e-10175.46Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD--------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +V+QLLACFEAKRIAKKPGRFYMCARKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD--------------

Query:  -----------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYES
                                     VSIRPVPELTQASFDTLKYYKERF RGRKVGTLVTD+LLLE GLLDYNPAV PIE SRPNS LAMVC + S
Subjt:  -----------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYES

Query:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VK KSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.2e-14980.56Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQDLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWITLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQ LEYPS+IPEHYLGSLRRGFAIPENILLR+PEEGERADNP EGW+TLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQDLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWITLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD-------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +V+QLLACFEAKRIAKKPGRFYMCARKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD-------------------

Query:  ------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCK
                                VSIRPVPELTQASFDTLKYYKERF RGRKVGTLVTD+LLLE GLLDYNPAV PIESSRPNSELAMVCG+ S VK K
Subjt:  ------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]3.7e-9770.51Show/hide
Query:  MVCGYESNVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARV
        MVCG+ S+VK KSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A  
Subjt:  MVCGYESNVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARV

Query:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAR
        VLPASFADRVDDPEARMGGTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAR

Query:  EKEEFSAALEAASSTMKDELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKER
        EKEEFS                                                                        ALEAKD+EL+HATAELE AKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKER

Query:  LSNGALLEESFR
        LSNG LLEESFR
Subjt:  LSNGALLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.1e-12467.51Show/hide
Query:  FYMCARKGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ F R RK+ TLVTDKLLLE GLLDYNP V  IE+SRPNSELAMVCG+  +VK KSKGRAHAL+    +
Subjt:  FYMCARKGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSS

Query:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMG
        +P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  E  AR  LP S AD VDDPEARM 
Subjt:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMG

Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K 
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKERLSNGALLEESFRQHP
        ELLKA  EV+IL+AEV+AK++LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHP
Subjt:  ELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKERLSNGALLEESFRQHP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.2e-9591.79Show/hide
Query:  KGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSSKPATPA
        K   V+IRPVPELTQASFDTLKYYKE F RGRKVGTLVTDKLLLE GLLDYNPAV PIESSRPNSELAMVCG+ SNVK KSKG+AHALEAAQSSKP TPA
Subjt:  KGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSSKPATPA

Query:  VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMGGTSDVTTRFRVEPS
        VVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEV AR VLPASFADRVDDPEARMGGT DVTTRFRVEPS
Subjt:  VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMGGTSDVTTRFRVEPS

Query:  SSGVRDQ
        SSGVRDQ
Subjt:  SSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138263.5e-10175.46Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD--------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +V+QLLACFEAKRIAKKPGRFYMCARKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD--------------

Query:  -----------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYES
                                     VSIRPVPELTQASFDTLKYYKERF RGRKVGTLVTD+LLLE GLLDYNPAV PIE SRPNS LAMVC + S
Subjt:  -----------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYES

Query:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VK KSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1DXS5 uncharacterized protein LOC1110255023.5e-14980.56Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQDLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWITLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQ LEYPS+IPEHYLGSLRRGFAIPENILLR+PEEGERADNP EGW+TLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQDLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWITLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD-------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +V+QLLACFEAKRIAKKPGRFYMCARKGA                    
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGAD-------------------

Query:  ------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCK
                                VSIRPVPELTQASFDTLKYYKERF RGRKVGTLVTD+LLLE GLLDYNPAV PIESSRPNSELAMVCG+ S VK K
Subjt:  ------------------------VSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256061.8e-9770.51Show/hide
Query:  MVCGYESNVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARV
        MVCG+ S+VK KSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A  
Subjt:  MVCGYESNVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARV

Query:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAR
        VLPASFADRVDDPEARMGGTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAR

Query:  EKEEFSAALEAASSTMKDELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKER
        EKEEFS                                                                        ALEAKD+EL+HATAELE AKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKER

Query:  LSNGALLEESFR
        LSNG LLEESFR
Subjt:  LSNGALLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256651.0e-12467.51Show/hide
Query:  FYMCARKGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ F R RK+ TLVTDKLLLE GLLDYNP V  IE+SRPNSELAMVCG+  +VK KSKGRAHAL+    +
Subjt:  FYMCARKGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESSRPNSELAMVCGYESNVKCKSKGRAHALEAAQSS

Query:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMG
        +P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  E  AR  LP S AD VDDPEARM 
Subjt:  KPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLPASFADRVDDPEARMG

Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K 
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKERLSNGALLEESFRQHP
        ELLKA  EV+IL+AEV+AK++LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHP
Subjt:  ELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKERLSNGALLEESFRQHP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCCGACAAATCCTCAACGGCTCTTTAAGGATGATGTGAAGTGCGGCGTGATGTCACGGATTCTGTCGGAGAATAGACTGAGTCGAAGTGGTCCAACTCGAACTCG
GCCTCCGGACCGATCTGAAAACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCA
TCGTACCTGATCGTGGAGTCGGACCTCGGCCAGGTTCACCTCGGCCCTCATACATAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACT
CATGGGTCATCTTGGAGCACCAATAGGGGTCCTCCATGTGTCCAGGTTATTCTTTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCAATCTCGACCTGGCAGAGAAGT
TCATTCGACTTGTTTTGGACACGTGGCGACTTCCTATTCGTGGGAAGACACAACCGTTGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGCATCCTAGCC
GCTCGTTGATTACACGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGG
TTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGATTTGGAATACCCTTCTAAGATACCTGA
GCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGATCACTC
TCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGG
GGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTCGAGGTAAACCAACTCCTCGCGTGCTTCGAAGCGAAAAGGAT
AGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGACGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACA
AGGAGCGTTTTCTGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTTCGGGCTGCTAGATTACAACCCTGCAGTTCATCCCATTGAATCCTCA
AGGCCGAACTCCGAACTTGCCATGGTTTGCGGATATGAAAGCAACGTGAAGTGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCAC
CCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGG
ACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGAAGCTCGTGTGGTCTTGCCT
GCGAGCTTCGCAGATCGGGTGGATGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGT
GTCCCGCATCTCGGCTGCAAGTTTGGATCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGT
TTGTTGCTTCCATTCAATCGGCTTTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCT
TCCACCATGAAGGATGAGCTGCTGAAAGCTCACTTTGAGGTGGAAATTTTGAAGGCTGAGGTGGAAGCCAAGATCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGC
CCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAAGACGACATGTTCCAGGCGCTTGAAGCGAAGGATGAGGAGC
TGAAGCATGCGACTGCCGAGCTGGAGATGGCGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGACAACATCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCCCGACAAATCCTCAACGGCTCTTTAAGGATGATGTGAAGTGCGGCGTGATGTCACGGATTCTGTCGGAGAATAGACTGAGTCGAAGTGGTCCAACTCGAACTCG
GCCTCCGGACCGATCTGAAAACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCA
TCGTACCTGATCGTGGAGTCGGACCTCGGCCAGGTTCACCTCGGCCCTCATACATAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACT
CATGGGTCATCTTGGAGCACCAATAGGGGTCCTCCATGTGTCCAGGTTATTCTTTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCAATCTCGACCTGGCAGAGAAGT
TCATTCGACTTGTTTTGGACACGTGGCGACTTCCTATTCGTGGGAAGACACAACCGTTGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGCATCCTAGCC
GCTCGTTGATTACACGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGG
TTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGATTTGGAATACCCTTCTAAGATACCTGA
GCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGATCACTC
TCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGG
GGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTCGAGGTAAACCAACTCCTCGCGTGCTTCGAAGCGAAAAGGAT
AGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGACGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACA
AGGAGCGTTTTCTGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTTCGGGCTGCTAGATTACAACCCTGCAGTTCATCCCATTGAATCCTCA
AGGCCGAACTCCGAACTTGCCATGGTTTGCGGATATGAAAGCAACGTGAAGTGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCAC
CCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGG
ACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGAAGCTCGTGTGGTCTTGCCT
GCGAGCTTCGCAGATCGGGTGGATGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGT
GTCCCGCATCTCGGCTGCAAGTTTGGATCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGT
TTGTTGCTTCCATTCAATCGGCTTTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCT
TCCACCATGAAGGATGAGCTGCTGAAAGCTCACTTTGAGGTGGAAATTTTGAAGGCTGAGGTGGAAGCCAAGATCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGC
CCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAAGACGACATGTTCCAGGCGCTTGAAGCGAAGGATGAGGAGC
TGAAGCATGCGACTGCCGAGCTGGAGATGGCGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGACAACATCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MFPTNPQRLFKDDVKCGVMSRILSENRLSRSGPTRTRPPDRSENLGGPAQKGEHSDDQVSIGRIPSLVRGQKIIVPDRGVGPRPGSPRPSYIASVNASGGDLGGPSWGMT
HGSSWSTNRGPPCVQVILFPKHWPPLCLVQSRPGREVHSTCFGHVATSYSWEDTTVVGIFKYSDASDLREHPSRSLITRRSLPSLSLSNVVAMSSSFSSNLGSDEDLARR
LESELEEIENFRFSDDGEDSDASTSGQDLEYPSKIPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWITLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW
GVIFALAILFWLRARDSEEAELFEVNQLLACFEAKRIAKKPGRFYMCARKGADVSIRPVPELTQASFDTLKYYKERFLRGRKVGTLVTDKLLLEFGLLDYNPAVHPIESS
RPNSELAMVCGYESNVKCKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVEARVVLP
ASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAAS
STMKDELLKAHFEVEILKAEVEAKIELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMFQALEAKDEELKHATAELEMAKERLSNGALLEESFRQHPQAGS