; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:12785790..12788957
RNA-Seq ExpressionMoc09g14840
SyntenyMoc09g14840
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]8.7e-4638.28Show/hide
Query:  VAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK----KAPTLMAID
        ++I PIP+L+Q+TFD LKFYKD F  GRKI   +TDKLLL SGLLDYNPL+ P EA RPNSELAMVCGF+  VK+K  S+G A + K      P   A+D
Subjt:  VAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK----KAPTLMAID

Query:  -----------LPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRASPKKLKKKKKHQSPSSEDVLDEGRAELMPRPMLEEGFQVCKRPESAIQ
                         V+E+       +   +  + E LDVSPLREVR                        E +AEL+ R   +E  +   R   AI 
Subjt:  -----------LPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRASPKKLKKKKKHQSPSSEDVLDEGRAELMPRPMLEEGFQVCKRPESAIQ

Query:  RMLDYYANAAIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHASSELEVLRTKLELAE
        + L+         K K        +KE++  L AL+   A  G L                  AEL++ K                              
Subjt:  RMLDYYANAAIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHASSELEVLRTKLELAE

Query:  SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYLKDLDSE
         +L+NG LLE  FRQHPDFDGFAKDF DAGF FLMKG+    P  E+DL  ++  YA+KWAS PN T  P  +VD+Y++DLDS+
Subjt:  SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYLKDLDSE

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.0e-4645.82Show/hide
Query:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP
        APAQVAPNGWGVI +LA+LFWL                                    RK AGG +KGPTSIK WV KWF AS  WL KDES + F  VP
Subjt:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP

Query:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG---VASSSKKAPT
         RFGNLV+I P+P+L+Q++FD LK+YK+RF  GRK+   +TD+LLL SGLLDYNP + P E  RPNS LAMVC F+ GVK+K   +     A+ S K PT
Subjt:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG---VASSSKKAPT

Query:  LMAIDLPTE--VEVVEVHQDALTPKGIGTVQDQ--------EILDVSPLRE
           +   +E    V+E+      P      +DQ        E  DV PL E
Subjt:  LMAIDLPTE--VEVVEVHQDALTPKGIGTVQDQ--------EILDVSPLRE

XP_022152115.1 uncharacterized protein LOC111019905 [Momordica charantia]9.0e-4366.91Show/hide
Query:  LRKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLL
        +RK AGG +KGPTSIK WVGKWF AS  WLTKDES + F  +P RFGNLV+I PIP+L Q+TFD LKFYK+ F  GRKI   +TDKLLL SGLLDYNPL+
Subjt:  LRKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLL

Query:  IPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK
         P EA RPNSELAMVCGF+  VK+K  S+G A + K
Subjt:  IPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.1e-4852.76Show/hide
Query:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP
        APAQVAPNGWGVI +LA+LFWL                                    RK AGG +KGPTSIK WV KWF AS  WL KDES + F  VP
Subjt:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP

Query:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG----VASSSKKA
         RFGNLV+I P+P+L+Q++FD LK+YK+RF  GRK+   +TD+LLL SGLLDYNP + P E+ RPNSELAMVCGF+ GVK+K   +      A SSK A
Subjt:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG----VASSSKKA

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-8041.13Show/hide
Query:  RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLI
        RK  GG +KGPTSIK WVGKWF AS  WL KDES + F  VP RFGNLV+I  IP+L+Q+TFD LK YKD F   RKI   +TDKLLL SGLLDYNPL+ 
Subjt:  RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLI

Query:  PSEAHRPNSELAMVCGFSQGVKQKRPSQG-----VASSSKKAPTL----------MAIDLPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRA
          EA RPNSELAMVCGF+  VK+K   +      V  +    PT+           +  +PT V  +++       K   + ++ E LDVSPL EVR   
Subjt:  PSEAHRPNSELAMVCGFSQGVKQKRPSQG-----VASSSKKAPTL----------MAIDLPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRA

Query:  SPKKLKKKKKHQSPSSE----DVLDEGRAELMPRP-------------------------------------MLEEGFQVCKRPESAIQRMLDYYANA--
        SP + ++KKK  S SSE      L    A+L+  P                                      L    +    P S +QR +D  A A  
Subjt:  SPKKLKKKKKHQSPSSE----DVLDEGRAELMPRP-------------------------------------MLEEGFQVCKRPESAIQRMLDYYANA--

Query:  -----AIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHA------SSELEVLRTKLEL
             A+MVK +LDGR+ L  KERE S AAL+ A  L+GEL +A+ E    +  ++VD A+++ +K +  +H   LR AHA        + ++L+ K +L
Subjt:  -----AIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHA------SSELEVLRTKLEL

Query:  AE--------------------SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYL
        A+                     +L+NG LLEE+FRQHPDFDGFAKDF DAGF FLMKG+    P  ++DL  ++  Y++KWAS PN TP PQ +VD+Y+
Subjt:  AE--------------------SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYL

Query:  KDLDSEAERNEGE
        ++LDS+    E E
Subjt:  KDLDSEAERNEGE

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124674.2e-4638.28Show/hide
Query:  VAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK----KAPTLMAID
        ++I PIP+L+Q+TFD LKFYKD F  GRKI   +TDKLLL SGLLDYNPL+ P EA RPNSELAMVCGF+  VK+K  S+G A + K      P   A+D
Subjt:  VAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK----KAPTLMAID

Query:  -----------LPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRASPKKLKKKKKHQSPSSEDVLDEGRAELMPRPMLEEGFQVCKRPESAIQ
                         V+E+       +   +  + E LDVSPLREVR                        E +AEL+ R   +E  +   R   AI 
Subjt:  -----------LPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRASPKKLKKKKKHQSPSSEDVLDEGRAELMPRPMLEEGFQVCKRPESAIQ

Query:  RMLDYYANAAIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHASSELEVLRTKLELAE
        + L+         K K        +KE++  L AL+   A  G L                  AEL++ K                              
Subjt:  RMLDYYANAAIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHASSELEVLRTKLELAE

Query:  SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYLKDLDSE
         +L+NG LLE  FRQHPDFDGFAKDF DAGF FLMKG+    P  E+DL  ++  YA+KWAS PN T  P  +VD+Y++DLDS+
Subjt:  SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYLKDLDSE

A0A6J1CR42 uncharacterized protein LOC1110138261.5e-4645.82Show/hide
Query:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP
        APAQVAPNGWGVI +LA+LFWL                                    RK AGG +KGPTSIK WV KWF AS  WL KDES + F  VP
Subjt:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP

Query:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG---VASSSKKAPT
         RFGNLV+I P+P+L+Q++FD LK+YK+RF  GRK+   +TD+LLL SGLLDYNP + P E  RPNS LAMVC F+ GVK+K   +     A+ S K PT
Subjt:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG---VASSSKKAPT

Query:  LMAIDLPTE--VEVVEVHQDALTPKGIGTVQDQ--------EILDVSPLRE
           +   +E    V+E+      P      +DQ        E  DV PL E
Subjt:  LMAIDLPTE--VEVVEVHQDALTPKGIGTVQDQ--------EILDVSPLRE

A0A6J1DD09 uncharacterized protein LOC1110199054.4e-4366.91Show/hide
Query:  LRKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLL
        +RK AGG +KGPTSIK WVGKWF AS  WLTKDES + F  +P RFGNLV+I PIP+L Q+TFD LKFYK+ F  GRKI   +TDKLLL SGLLDYNPL+
Subjt:  LRKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLL

Query:  IPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK
         P EA RPNSELAMVCGF+  VK+K  S+G A + K
Subjt:  IPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSK

A0A6J1DXS5 uncharacterized protein LOC1110255023.5e-4852.76Show/hide
Query:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP
        APAQVAPNGWGVI +LA+LFWL                                    RK AGG +KGPTSIK WV KWF AS  WL KDES + F  VP
Subjt:  APAQVAPNGWGVIVSLAVLFWL------------------------------------RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVP

Query:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG----VASSSKKA
         RFGNLV+I P+P+L+Q++FD LK+YK+RF  GRK+   +TD+LLL SGLLDYNP + P E+ RPNSELAMVCGF+ GVK+K   +      A SSK A
Subjt:  CRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQG----VASSSKKA

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-8041.13Show/hide
Query:  RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLI
        RK  GG +KGPTSIK WVGKWF AS  WL KDES + F  VP RFGNLV+I  IP+L+Q+TFD LK YKD F   RKI   +TDKLLL SGLLDYNPL+ 
Subjt:  RKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFDILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLI

Query:  PSEAHRPNSELAMVCGFSQGVKQKRPSQG-----VASSSKKAPTL----------MAIDLPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRA
          EA RPNSELAMVCGF+  VK+K   +      V  +    PT+           +  +PT V  +++       K   + ++ E LDVSPL EVR   
Subjt:  PSEAHRPNSELAMVCGFSQGVKQKRPSQG-----VASSSKKAPTL----------MAIDLPTEVEVVEVHQDALTPKGIGTVQDQEILDVSPLREVRRRA

Query:  SPKKLKKKKKHQSPSSE----DVLDEGRAELMPRP-------------------------------------MLEEGFQVCKRPESAIQRMLDYYANA--
        SP + ++KKK  S SSE      L    A+L+  P                                      L    +    P S +QR +D  A A  
Subjt:  SPKKLKKKKKHQSPSSE----DVLDEGRAELMPRP-------------------------------------MLEEGFQVCKRPESAIQRMLDYYANA--

Query:  -----AIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHA------SSELEVLRTKLEL
             A+MVK +LDGR+ L  KERE S AAL+ A  L+GEL +A+ E    +  ++VD A+++ +K +  +H   LR AHA        + ++L+ K +L
Subjt:  -----AIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARAEARPWKSTSKVDKAELESVKAKAARHLDLLRGAHA------SSELEVLRTKLEL

Query:  AE--------------------SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYL
        A+                     +L+NG LLEE+FRQHPDFDGFAKDF DAGF FLMKG+    P  ++DL  ++  Y++KWAS PN TP PQ +VD+Y+
Subjt:  AE--------------------SKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAP--ELDLKPIRLWYAKKWASSPNDTPIPQDVVDQYL

Query:  KDLDSEAERNEGE
        ++LDS+    E E
Subjt:  KDLDSEAERNEGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAAATTGCACAACGTCAATCTAGCTCGAACCCGGTCCTTGCTCCGACTCGAAACGTTAAGGAAGACCTACACAAGAGGGTAAAATCTCCAACGCTCAAGTCAGC
GATTACCTTGGTCGGATTTGAGATCGATCATGTTGGGCTCCAGCAGATCGAGCTCGGACATTTTCCTGAAAATGACGGGTTGACCCTCAATCGGAATGGCAAGTCTTCCA
TCTCAGAAGGTTCCGAGATGGAGCTCCGTTCTTCTAGTTTGAATTCTTCTAATAGTGTAGATCGGAATCGGAGCCTTTCCCTTGGAAAGGCCAGTTATGTCGAGGAATTT
GCTAATAGGCTAGATTCCGAATTAGAAGAAGAGATAGATAATTTCAGGTTTCCTGATGAGGATGAGGATGATAGTGCTCCTGCTCAAGTGGCGCCCAATGGATGGGGTGT
AATTGTTAGTTTAGCCGTATTGTTCTGGCTTAGGAAGAGTGCAGGAGGCACCCTCAAAGGTCCGACCTCAATAAAGAAGTGGGTCGGAAAATGGTTTTTGGCCTCCGAAG
CATGGCTGACTAAGGACGAGTCCGACCAGCGTTTCCACCACGTTCCCTGTAGGTTTGGGAACTTAGTTGCTATCTGGCCCATTCCTCAGCTCTCTCAATCTACTTTCGAC
ATTCTGAAGTTTTACAAGGATAGGTTTAAGAGTGGCAGGAAAATCAGTGACTTTCTAACCGACAAGCTTCTCTTAGCTTCAGGTCTGCTCGACTACAACCCACTGCTCAT
TCCTTCTGAGGCCCACAGACCCAACTCAGAGCTTGCGATGGTTTGCGGATTTTCTCAAGGCGTGAAACAAAAGCGCCCTAGCCAAGGGGTTGCCTCTAGCTCCAAGAAAG
CGCCCACCCTTATGGCTATCGACCTTCCTACTGAGGTCGAGGTGGTGGAGGTTCACCAGGATGCTCTCACTCCCAAGGGAATTGGCACCGTTCAAGACCAGGAGATTTTG
GACGTTTCTCCCCTCAGGGAGGTTCGGAGACGCGCCTCCCCTAAGAAGTTGAAGAAGAAGAAGAAGCATCAGTCCCCTTCTTCCGAGGACGTGCTGGACGAGGGCCGAGC
AGAACTCATGCCTCGACCGATGCTGGAGGAAGGCTTCCAAGTTTGTAAGCGCCCCGAGTCTGCTATCCAACGCATGCTGGACTACTATGCGAATGCAGCCATCATGGTGA
AGCCTAAGCTGGACGGGCGTGACCTTTTGACTGTGAAGGAACGAGAAGCCTCCTTGGCTGCATTGAAGGTTGCTGCAGCCTTGGAAGGGGAGCTCAAGGAGGCCCGAGCT
GAGGCCCGACCGTGGAAATCCACTTCGAAGGTTGACAAGGCCGAGCTCGAGAGTGTCAAGGCGAAGGCTGCTCGCCACCTGGATCTTCTGCGGGGGGCGCACGCGAGCTC
CGAGTTGGAGGTGCTGAGGACCAAGCTTGAGCTTGCGGAGTCAAAGCTCAGCAACGGAGTACTCTTGGAGGAAACTTTTCGCCAACATCCTGATTTTGACGGGTTCGCCA
AAGATTTCTATGATGCGGGCTTCACGTTCCTAATGAAGGGACTTAAGGAAATAGCTCCCGAGCTTGACCTCAAGCCAATCAGGCTTTGGTACGCCAAGAAGTGGGCTTCG
AGCCCAAATGATACTCCCATCCCCCAGGACGTGGTGGACCAGTACCTGAAAGATCTCGACTCTGAAGCCGAGCGCAATGAGGGCGAATATACCGGCTTCTCTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAAATTGCACAACGTCAATCTAGCTCGAACCCGGTCCTTGCTCCGACTCGAAACGTTAAGGAAGACCTACACAAGAGGGTAAAATCTCCAACGCTCAAGTCAGC
GATTACCTTGGTCGGATTTGAGATCGATCATGTTGGGCTCCAGCAGATCGAGCTCGGACATTTTCCTGAAAATGACGGGTTGACCCTCAATCGGAATGGCAAGTCTTCCA
TCTCAGAAGGTTCCGAGATGGAGCTCCGTTCTTCTAGTTTGAATTCTTCTAATAGTGTAGATCGGAATCGGAGCCTTTCCCTTGGAAAGGCCAGTTATGTCGAGGAATTT
GCTAATAGGCTAGATTCCGAATTAGAAGAAGAGATAGATAATTTCAGGTTTCCTGATGAGGATGAGGATGATAGTGCTCCTGCTCAAGTGGCGCCCAATGGATGGGGTGT
AATTGTTAGTTTAGCCGTATTGTTCTGGCTTAGGAAGAGTGCAGGAGGCACCCTCAAAGGTCCGACCTCAATAAAGAAGTGGGTCGGAAAATGGTTTTTGGCCTCCGAAG
CATGGCTGACTAAGGACGAGTCCGACCAGCGTTTCCACCACGTTCCCTGTAGGTTTGGGAACTTAGTTGCTATCTGGCCCATTCCTCAGCTCTCTCAATCTACTTTCGAC
ATTCTGAAGTTTTACAAGGATAGGTTTAAGAGTGGCAGGAAAATCAGTGACTTTCTAACCGACAAGCTTCTCTTAGCTTCAGGTCTGCTCGACTACAACCCACTGCTCAT
TCCTTCTGAGGCCCACAGACCCAACTCAGAGCTTGCGATGGTTTGCGGATTTTCTCAAGGCGTGAAACAAAAGCGCCCTAGCCAAGGGGTTGCCTCTAGCTCCAAGAAAG
CGCCCACCCTTATGGCTATCGACCTTCCTACTGAGGTCGAGGTGGTGGAGGTTCACCAGGATGCTCTCACTCCCAAGGGAATTGGCACCGTTCAAGACCAGGAGATTTTG
GACGTTTCTCCCCTCAGGGAGGTTCGGAGACGCGCCTCCCCTAAGAAGTTGAAGAAGAAGAAGAAGCATCAGTCCCCTTCTTCCGAGGACGTGCTGGACGAGGGCCGAGC
AGAACTCATGCCTCGACCGATGCTGGAGGAAGGCTTCCAAGTTTGTAAGCGCCCCGAGTCTGCTATCCAACGCATGCTGGACTACTATGCGAATGCAGCCATCATGGTGA
AGCCTAAGCTGGACGGGCGTGACCTTTTGACTGTGAAGGAACGAGAAGCCTCCTTGGCTGCATTGAAGGTTGCTGCAGCCTTGGAAGGGGAGCTCAAGGAGGCCCGAGCT
GAGGCCCGACCGTGGAAATCCACTTCGAAGGTTGACAAGGCCGAGCTCGAGAGTGTCAAGGCGAAGGCTGCTCGCCACCTGGATCTTCTGCGGGGGGCGCACGCGAGCTC
CGAGTTGGAGGTGCTGAGGACCAAGCTTGAGCTTGCGGAGTCAAAGCTCAGCAACGGAGTACTCTTGGAGGAAACTTTTCGCCAACATCCTGATTTTGACGGGTTCGCCA
AAGATTTCTATGATGCGGGCTTCACGTTCCTAATGAAGGGACTTAAGGAAATAGCTCCCGAGCTTGACCTCAAGCCAATCAGGCTTTGGTACGCCAAGAAGTGGGCTTCG
AGCCCAAATGATACTCCCATCCCCCAGGACGTGGTGGACCAGTACCTGAAAGATCTCGACTCTGAAGCCGAGCGCAATGAGGGCGAATATACCGGCTTCTCTTTTTAG
Protein sequenceShow/hide protein sequence
MQEIAQRQSSSNPVLAPTRNVKEDLHKRVKSPTLKSAITLVGFEIDHVGLQQIELGHFPENDGLTLNRNGKSSISEGSEMELRSSSLNSSNSVDRNRSLSLGKASYVEEF
ANRLDSELEEEIDNFRFPDEDEDDSAPAQVAPNGWGVIVSLAVLFWLRKSAGGTLKGPTSIKKWVGKWFLASEAWLTKDESDQRFHHVPCRFGNLVAIWPIPQLSQSTFD
ILKFYKDRFKSGRKISDFLTDKLLLASGLLDYNPLLIPSEAHRPNSELAMVCGFSQGVKQKRPSQGVASSSKKAPTLMAIDLPTEVEVVEVHQDALTPKGIGTVQDQEIL
DVSPLREVRRRASPKKLKKKKKHQSPSSEDVLDEGRAELMPRPMLEEGFQVCKRPESAIQRMLDYYANAAIMVKPKLDGRDLLTVKEREASLAALKVAAALEGELKEARA
EARPWKSTSKVDKAELESVKAKAARHLDLLRGAHASSELEVLRTKLELAESKLSNGVLLEETFRQHPDFDGFAKDFYDAGFTFLMKGLKEIAPELDLKPIRLWYAKKWAS
SPNDTPIPQDVVDQYLKDLDSEAERNEGEYTGFSF