; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:15868215..15876962
RNA-Seq ExpressionMoc06g20290
SyntenyMoc06g20290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.2e-8958.11Show/hide
Query:  FEAKRIAKKPGEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL----
        F     + KP +  AK+ S        +   + +SIKPIPEL QA+FDTLKFYKD+FPRGRKIGTLVTDKLLLE GLLDYNPLVR I+ASRPNSEL    
Subjt:  FEAKRIAKKPGEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL----

Query:  ---GNVKRKSKGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCEMGGTSDVKMRFRMEPSSSGVKD
            +VKRKSKGRAHALK VQSSDPVT AVDQ A QDQAGPSS  PT VIELDSTGERSREKRSRSESEALDVSPL E+       ++   E   + ++ 
Subjt:  ---GNVKRKSKGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCEMGGTSDVKMRFRMEPSSSGVKD

Query:  QVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHAVEAELDVREALAAKERENSSAALEAA--------------TTLKGELLKSRSEVDILRAEVDLGN
          +   T  L++   +  K   D    L+R    A    L+  E  A KER  + A LEAA              +    + L      D+   EVDLG+
Subjt:  QVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHAVEAELDVREALAAKERENSSAALEAA--------------TTLKGELLKSRSEVDILRAEVDLGN

Query:  LKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVNLL
        LKKRYAEKWASGPNGT GPA+LVDKYVRDLDSDYSDLDED+ PSQEP EVGTTQE  PSQQ GSQEVNLL
Subjt:  LKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVNLL

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.8e-6761.67Show/hide
Query:  APNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GEWLAKDESGRPFHDVLVRFGN
        APNGWGVIFALAI FWLRARD +EAELL+V+QL  CFEAKRIAKKP                                GEWLAKDESGR F DV  RFGN
Subjt:  APNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GEWLAKDESGRPFHDVLVRFGN

Query:  LVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKGRAHALKTVQSSDPVTTAVDQ
        LVSI+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP VR I+ SRPNS L         VKRKSKGRAHAL+  QSS P T AV  
Subjt:  LVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKGRAHALKTVQSSDPVTTAVDQ

Query:  LAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD
               GP+SEDP  VIEL+S+G  SREKR R ++EA+D
Subjt:  LAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.3e-5750.33Show/hide
Query:  MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSSAALEAATTLK
        MGGT DV+ RFRMEPSSSGVKDQVSRIS +CLDRCL RASKFVSDPGSVLQRTID+A             V+AELD REALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSSAALEAATTLK

Query:  GELLKSRSEVDILRAEV-----------------------------------------------------------------------------------
        GELLK++ EV ILRAEV                                                                                   
Subjt:  GELLKSRSEVDILRAEV-----------------------------------------------------------------------------------

Query:  ----------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVN
                                    DL NLKK+Y+EKWASGPNGTPGP +LV KYVR+LDSDYSD++E+DAPSQEPNE+GTTQEE PSQQ GSQEVN
Subjt:  ----------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVN

Query:  LL
        LL
Subjt:  LL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.4e-7249.17Show/hide
Query:  SSPKPTDSGEDLALRLESELEEIENFRFSDDGEDSDTSPRA---------------------------------------------------------RT
        SS   ++   DLA RLES+LEEIEN R SDDGEDSD S                                                            R 
Subjt:  SSPKPTDSGEDLALRLESELEEIENFRFSDDGEDSDTSPRA---------------------------------------------------------RT

Query:  SSSPFCPRVSQPNWIGSCSSAPNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GE
           PF         +     APNGWGVIFALAI FWLRARD +EAEL +V+QL  CFEAKRIAKKP                                GE
Subjt:  SSSPFCPRVSQPNWIGSCSSAPNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GE

Query:  WLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKG
        WLAKDESGR F DV  RFGNLVSI+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP VR I++SRPNSEL         VKRKSKG
Subjt:  WLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKG

Query:  RAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD
        RAHAL+  QSS P T AV         GP+SEDP LVIEL+S+G  SREKR R ++EA+D
Subjt:  RAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.1e-11252.96Show/hide
Query:  GEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKS
        GEWLAKDESGR F DV  RFGNLVSIK IPEL QA+FDTLK YKDHFPR RKI TLVTDKLLLE GLLDYNPLVRLI+ASRPNSEL       G+VKRKS
Subjt:  GEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKS

Query:  KGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCE--------------------------------
        KGRAHALKTV  ++PVT  V +   Q  +GPSS  PT VIELD +G RS EKRSR ESEALDVSPL E                                
Subjt:  KGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCE--------------------------------

Query:  ----------MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSS
                  M GTS+V+MRF MEPSSSGVKDQVSRIS +CLDR L RASKFVSDPGSVLQRTID+              V+AELD REALAAKERENS 
Subjt:  ----------MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSS

Query:  AALEAATTLKGELLKSRSEVDILRAEV-------------------------------------------------------------------------
        AALEAATTLKGELLK++ EVDILRAEV                                                                         
Subjt:  AALEAATTLKGELLKSRSEVDILRAEV-------------------------------------------------------------------------

Query:  --------------------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAP
                                              DL  LKK+Y+EKWASGPNGTP P +LVDKYVR+LDSDYSD++E+DAPSQEP EVGTTQEE P
Subjt:  --------------------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAP

Query:  SQQGGS
        SQQGGS
Subjt:  SQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.1e-8958.11Show/hide
Query:  FEAKRIAKKPGEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL----
        F     + KP +  AK+ S        +   + +SIKPIPEL QA+FDTLKFYKD+FPRGRKIGTLVTDKLLLE GLLDYNPLVR I+ASRPNSEL    
Subjt:  FEAKRIAKKPGEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL----

Query:  ---GNVKRKSKGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCEMGGTSDVKMRFRMEPSSSGVKD
            +VKRKSKGRAHALK VQSSDPVT AVDQ A QDQAGPSS  PT VIELDSTGERSREKRSRSESEALDVSPL E+       ++   E   + ++ 
Subjt:  ---GNVKRKSKGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCEMGGTSDVKMRFRMEPSSSGVKD

Query:  QVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHAVEAELDVREALAAKERENSSAALEAA--------------TTLKGELLKSRSEVDILRAEVDLGN
          +   T  L++   +  K   D    L+R    A    L+  E  A KER  + A LEAA              +    + L      D+   EVDLG+
Subjt:  QVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHAVEAELDVREALAAKERENSSAALEAA--------------TTLKGELLKSRSEVDILRAEVDLGN

Query:  LKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVNLL
        LKKRYAEKWASGPNGT GPA+LVDKYVRDLDSDYSDLDED+ PSQEP EVGTTQE  PSQQ GSQEVNLL
Subjt:  LKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVNLL

A0A6J1CR42 uncharacterized protein LOC1110138268.7e-6861.67Show/hide
Query:  APNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GEWLAKDESGRPFHDVLVRFGN
        APNGWGVIFALAI FWLRARD +EAELL+V+QL  CFEAKRIAKKP                                GEWLAKDESGR F DV  RFGN
Subjt:  APNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GEWLAKDESGRPFHDVLVRFGN

Query:  LVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKGRAHALKTVQSSDPVTTAVDQ
        LVSI+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP VR I+ SRPNS L         VKRKSKGRAHAL+  QSS P T AV  
Subjt:  LVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKGRAHALKTVQSSDPVTTAVDQ

Query:  LAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD
               GP+SEDP  VIEL+S+G  SREKR R ++EA+D
Subjt:  LAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD

A0A6J1DF31 uncharacterized protein LOC1110199096.3e-5850.33Show/hide
Query:  MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSSAALEAATTLK
        MGGT DV+ RFRMEPSSSGVKDQVSRIS +CLDRCL RASKFVSDPGSVLQRTID+A             V+AELD REALAAKERENSSAALEAATTLK
Subjt:  MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSSAALEAATTLK

Query:  GELLKSRSEVDILRAEV-----------------------------------------------------------------------------------
        GELLK++ EV ILRAEV                                                                                   
Subjt:  GELLKSRSEVDILRAEV-----------------------------------------------------------------------------------

Query:  ----------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVN
                                    DL NLKK+Y+EKWASGPNGTPGP +LV KYVR+LDSDYSD++E+DAPSQEPNE+GTTQEE PSQQ GSQEVN
Subjt:  ----------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVN

Query:  LL
        LL
Subjt:  LL

A0A6J1DXS5 uncharacterized protein LOC1110255021.2e-7249.17Show/hide
Query:  SSPKPTDSGEDLALRLESELEEIENFRFSDDGEDSDTSPRA---------------------------------------------------------RT
        SS   ++   DLA RLES+LEEIEN R SDDGEDSD S                                                            R 
Subjt:  SSPKPTDSGEDLALRLESELEEIENFRFSDDGEDSDTSPRA---------------------------------------------------------RT

Query:  SSSPFCPRVSQPNWIGSCSSAPNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GE
           PF         +     APNGWGVIFALAI FWLRARD +EAEL +V+QL  CFEAKRIAKKP                                GE
Subjt:  SSSPFCPRVSQPNWIGSCSSAPNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKP--------------------------------GE

Query:  WLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKG
        WLAKDESGR F DV  RFGNLVSI+P+PELTQASFDTLK+YK+ FPRGRK+GTLVTD+LLLE GLLDYNP VR I++SRPNSEL         VKRKSKG
Subjt:  WLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKSKG

Query:  RAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD
        RAHAL+  QSS P T AV         GP+SEDP LVIEL+S+G  SREKR R ++EA+D
Subjt:  RAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256655.2e-11352.96Show/hide
Query:  GEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKS
        GEWLAKDESGR F DV  RFGNLVSIK IPEL QA+FDTLK YKDHFPR RKI TLVTDKLLLE GLLDYNPLVRLI+ASRPNSEL       G+VKRKS
Subjt:  GEWLAKDESGRPFHDVLVRFGNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSEL-------GNVKRKS

Query:  KGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCE--------------------------------
        KGRAHALKTV  ++PVT  V +   Q  +GPSS  PT VIELD +G RS EKRSR ESEALDVSPL E                                
Subjt:  KGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPTLVIELDSTGERSREKRSRSESEALDVSPLCE--------------------------------

Query:  ----------MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSS
                  M GTS+V+MRF MEPSSSGVKDQVSRIS +CLDR L RASKFVSDPGSVLQRTID+              V+AELD REALAAKERENS 
Subjt:  ----------MGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHA-------------VEAELDVREALAAKERENSS

Query:  AALEAATTLKGELLKSRSEVDILRAEV-------------------------------------------------------------------------
        AALEAATTLKGELLK++ EVDILRAEV                                                                         
Subjt:  AALEAATTLKGELLKSRSEVDILRAEV-------------------------------------------------------------------------

Query:  --------------------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAP
                                              DL  LKK+Y+EKWASGPNGTP P +LVDKYVR+LDSDYSD++E+DAPSQEP EVGTTQEE P
Subjt:  --------------------------------------DLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAP

Query:  SQQGGS
        SQQGGS
Subjt:  SQQGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCATCTACCTCCTCGCTTCTCGCCCTGATGCCATGCCTGAGGTCGTCTGGGTCATTCCTCTTACACTGAAGAGTCTCGCGATCGTTTTCAGTATTGTTCGATACGG
TTACACTCGGAACGATCACTGGTCCTTCCGCGTAGCGGAGAGAGAATTCCACACTCCAATGCGGGGCCAACAGCGGTGTTCCATGTGGATACCTCTGATGGGTCAATGTG
GAGTGTTAGCTTTTGCCCGAGGAGTCAGGCTGGCTGAACATGCAGTGGAGGAAATTGATCCTCCACGTTCTCCTGAACCAGGGGGTTCTGGTTCGGTTGTAGGTTCACCA
AGGGGTCGTCAGCTCGGCGCCCTTCTCCCTGAGCCCCATTTGGGACTCCGTTCACGACCGGGCGATCGCCTACTGCTTGGAGCGTTGGGAGCCAGTCCCTTGAGGTCCCG
ACGACATGAGATCATCCTTGGGCTTCTCGAAGGAGAAGCCACGCGTGCCTCTAAATATCCAATCTGCGCAACGTTGGAGGGGGATGGACTTCTTTCCTCTTCGGCTTGGA
CGCGTTGGGCCTCCATTGAGCCCCCCCCCCCCCGGATTTCTCCTTCATCTTCTGACCCCGCCGCCCCCCTTCTCTTGTTTTTTCCGGCGAGGTGCAGCGGCTCCGGCGGC
GCCCTTCGGATCGAGCAGCAGCGCTGCGGTGCTCCTTCGAACAGCAACGACGACGTCCCGATCCAGCGGCGGCGCAGCCCCCCTCCTCGCAGCGGCGCACGGCGGACTGT
TACAGCGACGGACCGGCGACGCTCCCCGACGTTCCGACGGTCTGCAGTAGTGGCGGCGCGCCCCGGCGATCTGCAACGAACCAGCGCGGCCTCCTCCTCGCGGCGGCGCA
CGGCGGACGAGCGGTACAGCAGTGGAACAGCGGTGACTTGCTGCTACGACCTGACGAACAACGACTCTAGCGACGGGATTTGTACGTTACAGCGGTGTTTAGGATGTTTA
GCGGTGGCCCACTTCCGTTCGAAGCTCGATTTAGGCTACCCACACCTTGGCGAGTTAGATCTAGGTGACCCACATCTATACGAAGGTGAGGTCGACGCAAAACTTAAGGG
CGAGGTTGTGGAAAGTTGTCAAGGTTGTGAAAGCATGCTGAACTGTACTTGTGAGTTGAAAGATGCAGGGTTAATGGGTCGATACGAGGAGTCCTTTGGAGGGAAGACTA
TTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAATGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTCTGGGTATAAA
GGTCAGGGGCCGACCCTACTCGAAGGAAATAGCTGTGATAAGCGTAGAGGTACTGGAAATGTTAGGGAGACTGGAGGTTCCGAGCAGATCGAGCCCCAGCCAGGTCGAAT
CTCGACATCTATACTTAGCCTTTTTCAATGGGCGAACCCGGTCTCCTCGGTAGGGCCGAGGTCAAACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAACCAGCTC
GAACCTTCTGTAGTAGTGATAGCCTAGGTAGTGCAGGTCGGACTGTAAGTAGTTCGTCCCCCAAACCAACTGATTCTGGGGAGGACTTAGCCCTTAGGTTAGAGTCCGAG
CTGGAAGAGATAGAGAACTTTAGGTTTTCGGATGATGGTGAGGATAGCGACACTTCACCTCGGGCCAGGACTTCCTCTTCACCCTTTTGCCCAAGAGTTTCTCAACCGAA
CTGGATTGGCTCCTGCTCAAGTGCCCCCAACGGATGGGGCGTCATTTTTGCGTTGGCCATCTTTTTCTGGTTGCGAGCTCGGGACGTGGATGAGGCCGAGCTGCTGAACG
TTGAGCAGCTTTTTGGATGCTTCGAAGCCAAAAGGATAGCTAAGAAGCCTGGTGAATGGCTGGCAAAGGACGAATCAGGTCGTCCCTTCCATGACGTGCTTGTTAGGTTT
GGGAACCTAGTGTCGATCAAACCGATTCCCGAGCTCACACAAGCCTCTTTTGACACTCTCAAGTTTTACAAAGACCATTTCCCAAGGGGTCGGAAGATCGGAACCTTGGT
GACTGACAAACTGCTCCTAGAGTTGGGGCTGTTGGACTACAACCCTTTAGTTCGTCTCATCAAAGCTTCTAGGCCAAACTCCGAACTCGGTAACGTGAAGCGCAAGTCTA
AAGGTCGTGCTCACGCCCTTAAGACTGTTCAAAGCTCTGATCCAGTGACTACTGCTGTGGATCAACTTGCGGTTCAGGACCAGGCTGGGCCATCCTCTGAAGATCCAACT
CTGGTGATCGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAATCTGAGGCGCTAGACGTGTCGCCTCTTTGCGAGATGGGGGGGACGTCCGA
CGTAAAGATGCGGTTCCGAATGGAACCGTCAAGCTCCGGGGTGAAGGACCAAGTGTCACGCATCTCGACTTCATGCTTGGATCGCTGTCTCATAAGAGCATCCAAGTTTG
TGAGCGATCCGGGGTCCGTGCTGCAACGGACTATTGACCACGCTGTCGAGGCTGAGCTGGATGTAAGGGAGGCCTTGGCAGCGAAAGAGAGGGAGAATTCTTCTGCTGCC
TTAGAGGCTGCCACTACGCTCAAAGGCGAGCTGCTGAAGTCTCGGAGCGAGGTGGACATTTTGAGGGCCGAGGTCGACCTCGGCAATCTAAAGAAGAGGTATGCTGAGAA
ATGGGCTTCTGGGCCCAACGGCACTCCAGGTCCTGCAACCCTGGTGGACAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTGGACGAAGACGATGCCCCTAGTC
AGGAGCCTAACGAGGTCGGCACTACCCAAGAGGAAGCTCCTTCGCAGCAGGGCGGATCTCAGGAGGTCAACCTTCTGGTTCTCAGGGCGAGCTATCCTCTCATCTCGGGA
GCAGCTGAGATTCCTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCATCTACCTCCTCGCTTCTCGCCCTGATGCCATGCCTGAGGTCGTCTGGGTCATTCCTCTTACACTGAAGAGTCTCGCGATCGTTTTCAGTATTGTTCGATACGG
TTACACTCGGAACGATCACTGGTCCTTCCGCGTAGCGGAGAGAGAATTCCACACTCCAATGCGGGGCCAACAGCGGTGTTCCATGTGGATACCTCTGATGGGTCAATGTG
GAGTGTTAGCTTTTGCCCGAGGAGTCAGGCTGGCTGAACATGCAGTGGAGGAAATTGATCCTCCACGTTCTCCTGAACCAGGGGGTTCTGGTTCGGTTGTAGGTTCACCA
AGGGGTCGTCAGCTCGGCGCCCTTCTCCCTGAGCCCCATTTGGGACTCCGTTCACGACCGGGCGATCGCCTACTGCTTGGAGCGTTGGGAGCCAGTCCCTTGAGGTCCCG
ACGACATGAGATCATCCTTGGGCTTCTCGAAGGAGAAGCCACGCGTGCCTCTAAATATCCAATCTGCGCAACGTTGGAGGGGGATGGACTTCTTTCCTCTTCGGCTTGGA
CGCGTTGGGCCTCCATTGAGCCCCCCCCCCCCCGGATTTCTCCTTCATCTTCTGACCCCGCCGCCCCCCTTCTCTTGTTTTTTCCGGCGAGGTGCAGCGGCTCCGGCGGC
GCCCTTCGGATCGAGCAGCAGCGCTGCGGTGCTCCTTCGAACAGCAACGACGACGTCCCGATCCAGCGGCGGCGCAGCCCCCCTCCTCGCAGCGGCGCACGGCGGACTGT
TACAGCGACGGACCGGCGACGCTCCCCGACGTTCCGACGGTCTGCAGTAGTGGCGGCGCGCCCCGGCGATCTGCAACGAACCAGCGCGGCCTCCTCCTCGCGGCGGCGCA
CGGCGGACGAGCGGTACAGCAGTGGAACAGCGGTGACTTGCTGCTACGACCTGACGAACAACGACTCTAGCGACGGGATTTGTACGTTACAGCGGTGTTTAGGATGTTTA
GCGGTGGCCCACTTCCGTTCGAAGCTCGATTTAGGCTACCCACACCTTGGCGAGTTAGATCTAGGTGACCCACATCTATACGAAGGTGAGGTCGACGCAAAACTTAAGGG
CGAGGTTGTGGAAAGTTGTCAAGGTTGTGAAAGCATGCTGAACTGTACTTGTGAGTTGAAAGATGCAGGGTTAATGGGTCGATACGAGGAGTCCTTTGGAGGGAAGACTA
TTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAATGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTCTGGGTATAAA
GGTCAGGGGCCGACCCTACTCGAAGGAAATAGCTGTGATAAGCGTAGAGGTACTGGAAATGTTAGGGAGACTGGAGGTTCCGAGCAGATCGAGCCCCAGCCAGGTCGAAT
CTCGACATCTATACTTAGCCTTTTTCAATGGGCGAACCCGGTCTCCTCGGTAGGGCCGAGGTCAAACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAACCAGCTC
GAACCTTCTGTAGTAGTGATAGCCTAGGTAGTGCAGGTCGGACTGTAAGTAGTTCGTCCCCCAAACCAACTGATTCTGGGGAGGACTTAGCCCTTAGGTTAGAGTCCGAG
CTGGAAGAGATAGAGAACTTTAGGTTTTCGGATGATGGTGAGGATAGCGACACTTCACCTCGGGCCAGGACTTCCTCTTCACCCTTTTGCCCAAGAGTTTCTCAACCGAA
CTGGATTGGCTCCTGCTCAAGTGCCCCCAACGGATGGGGCGTCATTTTTGCGTTGGCCATCTTTTTCTGGTTGCGAGCTCGGGACGTGGATGAGGCCGAGCTGCTGAACG
TTGAGCAGCTTTTTGGATGCTTCGAAGCCAAAAGGATAGCTAAGAAGCCTGGTGAATGGCTGGCAAAGGACGAATCAGGTCGTCCCTTCCATGACGTGCTTGTTAGGTTT
GGGAACCTAGTGTCGATCAAACCGATTCCCGAGCTCACACAAGCCTCTTTTGACACTCTCAAGTTTTACAAAGACCATTTCCCAAGGGGTCGGAAGATCGGAACCTTGGT
GACTGACAAACTGCTCCTAGAGTTGGGGCTGTTGGACTACAACCCTTTAGTTCGTCTCATCAAAGCTTCTAGGCCAAACTCCGAACTCGGTAACGTGAAGCGCAAGTCTA
AAGGTCGTGCTCACGCCCTTAAGACTGTTCAAAGCTCTGATCCAGTGACTACTGCTGTGGATCAACTTGCGGTTCAGGACCAGGCTGGGCCATCCTCTGAAGATCCAACT
CTGGTGATCGAGTTGGATTCTACTGGGGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAATCTGAGGCGCTAGACGTGTCGCCTCTTTGCGAGATGGGGGGGACGTCCGA
CGTAAAGATGCGGTTCCGAATGGAACCGTCAAGCTCCGGGGTGAAGGACCAAGTGTCACGCATCTCGACTTCATGCTTGGATCGCTGTCTCATAAGAGCATCCAAGTTTG
TGAGCGATCCGGGGTCCGTGCTGCAACGGACTATTGACCACGCTGTCGAGGCTGAGCTGGATGTAAGGGAGGCCTTGGCAGCGAAAGAGAGGGAGAATTCTTCTGCTGCC
TTAGAGGCTGCCACTACGCTCAAAGGCGAGCTGCTGAAGTCTCGGAGCGAGGTGGACATTTTGAGGGCCGAGGTCGACCTCGGCAATCTAAAGAAGAGGTATGCTGAGAA
ATGGGCTTCTGGGCCCAACGGCACTCCAGGTCCTGCAACCCTGGTGGACAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTGGACGAAGACGATGCCCCTAGTC
AGGAGCCTAACGAGGTCGGCACTACCCAAGAGGAAGCTCCTTCGCAGCAGGGCGGATCTCAGGAGGTCAACCTTCTGGTTCTCAGGGCGAGCTATCCTCTCATCTCGGGA
GCAGCTGAGATTCCTCATTGA
Protein sequenceShow/hide protein sequence
MVIYLLASRPDAMPEVVWVIPLTLKSLAIVFSIVRYGYTRNDHWSFRVAEREFHTPMRGQQRCSMWIPLMGQCGVLAFARGVRLAEHAVEEIDPPRSPEPGGSGSVVGSP
RGRQLGALLPEPHLGLRSRPGDRLLLGALGASPLRSRRHEIILGLLEGEATRASKYPICATLEGDGLLSSSAWTRWASIEPPPPRISPSSSDPAAPLLLFFPARCSGSGG
ALRIEQQRCGAPSNSNDDVPIQRRRSPPPRSGARRTVTATDRRRSPTFRRSAVVAARPGDLQRTSAASSSRRRTADERYSSGTAVTCCYDLTNNDSSDGICTLQRCLGCL
AVAHFRSKLDLGYPHLGELDLGDPHLYEGEVDAKLKGEVVESCQGCESMLNCTCELKDAGLMGRYEESFGGKTIGALGINGQGPIDGEVIGASGINGRGPMQQGRGSGYK
GQGPTLLEGNSCDKRRGTGNVRETGGSEQIEPQPGRISTSILSLFQWANPVSSVGPRSNLTFPEFLEFDLKPARTFCSSDSLGSAGRTVSSSSPKPTDSGEDLALRLESE
LEEIENFRFSDDGEDSDTSPRARTSSSPFCPRVSQPNWIGSCSSAPNGWGVIFALAIFFWLRARDVDEAELLNVEQLFGCFEAKRIAKKPGEWLAKDESGRPFHDVLVRF
GNLVSIKPIPELTQASFDTLKFYKDHFPRGRKIGTLVTDKLLLELGLLDYNPLVRLIKASRPNSELGNVKRKSKGRAHALKTVQSSDPVTTAVDQLAVQDQAGPSSEDPT
LVIELDSTGERSREKRSRSESEALDVSPLCEMGGTSDVKMRFRMEPSSSGVKDQVSRISTSCLDRCLIRASKFVSDPGSVLQRTIDHAVEAELDVREALAAKERENSSAA
LEAATTLKGELLKSRSEVDILRAEVDLGNLKKRYAEKWASGPNGTPGPATLVDKYVRDLDSDYSDLDEDDAPSQEPNEVGTTQEEAPSQQGGSQEVNLLVLRASYPLISG
AAEIPH