; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g30660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g30660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein FAF-like
Genome locationchr4:22983755..22986217
RNA-Seq ExpressionMoc04g30660
SyntenyMoc04g30660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579018.1 hypothetical protein SDJN03_23466, partial [Cucurbita argyrosperma subsp. sororia]6.7e-7255.9Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----
        M   K+ PV E+VRFSISGLKALISS+E   E G+E+  R+IRS G+GII SKLLT  SSSSSI SC LLMDDLIGTESGV LT +  EE +EK +    
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----

Query:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV
        D  ++  +     +QNQRC  +KQFPPPI  LA QAG RTR PW+LTR+ SD RL L LERV + Q MESHRENGRLIL  VP+P+P     D++DLQF+
Subjt:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV

Query:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADSKIITSRFPVTVSSL
        EED G        E+I+SIE EEGE  ++ E S    +SFTYGGEG+ GG+F DR+ FC V     +RHVV+ HF S PLRP+ DS ++  R   TV S+
Subjt:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADSKIITSRFPVTVSSL

Query:  DMLLEEMMQEAKDGAFGVIFQL
        D++L  MMQ+ KD AF  IF L
Subjt:  DMLLEEMMQEAKDGAFGVIFQL

KAG7016541.1 hypothetical protein SDJN02_21650, partial [Cucurbita argyrosperma subsp. argyrosperma]4.2e-6656.85Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----
        M   K+ PV E+VRFSISGLKALISS+E   E G+E+  R+IRS G+GII SKLLT  SSSSSI SC LLMDDLIGTESGV LT +  EE +EK +    
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----

Query:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV
        D  ++  +     +QNQRC  +KQFPPPI  LA QAG RTR PW+LTR+ SD RL L LERVR+ Q MESHRENGRLIL  VP+P+P     D++DLQF+
Subjt:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV

Query:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADSKIITSR
        EED G        E+I+SIE EEGE  ++ E S    +SFTYGGEG+ GG+F DR+ FC V     +RHVV+ HF S PLRP+ DS ++  R
Subjt:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADSKIITSR

XP_022141482.1 uncharacterized protein LOC111011861 [Momordica charantia]1.7e-147100Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV
        MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV

Query:  DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE
        DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE
Subjt:  DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE

Query:  KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM
        KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM
Subjt:  KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM

XP_022939162.1 uncharacterized protein LOC111445156 [Cucurbita moschata]9.3e-6657.69Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----
        M   K+ PV E+VRFSISGLKALISS+E   E G+EK  R+IRS G+GII SKLLT SSSSSSI SC LLMDDLIGTESGV LT +  EE +EK +    
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----

Query:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV
        D  ++  +     +QNQRC  +KQFPPPI  LA QAG RTR PW+LTR+ SD RL L LERV + Q MESHRENGRLIL  VP+P+P     D++DLQF+
Subjt:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV

Query:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADS
        EED G        E+I+SIE EEGE   + E S    +SFTYGGEG+ GG+F DR+ FC V     +RHVV+ HF S PLRP+  S
Subjt:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADS

XP_023551345.1 uncharacterized protein LOC111809192 [Cucurbita pepo subsp. pepo]4.3e-6356.01Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLT----------HSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEE
        M   K+ PV E+VRFSISGLKALISSEE   E G+EK  R+IRS G+GII SKLLT           SSSSSSI SC LLMDDLIGTESGV LT +  EE
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLT----------HSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEE

Query:  IDEKSSDYRYHRVDH----RDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDS
         +EK +   + R  +        +QNQRC  +KQFPPPI  LA QAG RTR PW+LTR+ SD RL L LERVR+ Q MESHRENGRLIL  VP+P+P   
Subjt:  IDEKSSDYRYHRVDH----RDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDS

Query:  EDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM
           ++DLQF+EED G        E+I+SIE  EGE      + + +SFTYGGEG+ GG+F DR+ FC V     +RHVV+GHF S PLRP+
Subjt:  EDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM

TrEMBL top hitse value%identityAlignment
A0A0A0KRM2 Uncharacterized protein2.0e-5049.49Show/hide
Query:  MQTHKRRPVPESVRFSIS-GLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSS----------------SSIRSC-LLMDDLIGTESGVCLTN
        +Q H R P P  + FSIS GLK+LISS++    +     +IRS G+ IIRS LLTHSSSS                SSIRSC   MDDLIGTESGVCLT+
Subjt:  MQTHKRRPVPESVRFSIS-GLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSS----------------SSIRSC-LLMDDLIGTESGVCLTN

Query:  SIAEEIDEKSSDYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQ-AGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPD
        +  E       D  Y+R D      QNQRC  +KQFPPPI F+A Q AG R R PWVLTR+ S+ RL L LERV   Q +ES RENGRLIL  VP     
Subjt:  SIAEEIDEKSSDYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQ-AGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPD

Query:  DSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM
        D  D D     +EE  G E      E +ESI+C+ GE     S+ + +S+TYGGEG GGG       FC       +RHVV+GHFGSAPLRPM
Subjt:  DSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM

A0A5D3DS64 Protein FAF-like1.9e-4047.39Show/hide
Query:  PESVRFSIS-GLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRVDHRDLDKQ
        P  + FSIS GLK+LISS++    +     LI S G+ IIRS   +             MDDLIGTESGVCLT++  E       D R +R +  +   Q
Subjt:  PESVRFSIS-GLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRVDHRDLDKQ

Query:  NQRCAAEKQFPPPISFLAAQ-AGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAE
        NQRC  +KQ+PPPI F+A Q AG R R PW+LTR+ S+ RL L LERV   Q +ES RENGRLIL  VP    +D +D  + L  +EE  G      D E
Subjt:  NQRCAAEKQFPPPISFLAAQ-AGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAE

Query:  DIESIECEEGEAIEEASDPAFKSFTYGGE-GIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM
         +ESIEC+ GE     S+ +FKS TYGGE G GGG       FC V     +RHVV+GH GSAPLRPM
Subjt:  DIESIECEEGEAIEEASDPAFKSFTYGGE-GIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPM

A0A6J1CJZ0 uncharacterized protein LOC1110118618.1e-148100Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV
        MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV

Query:  DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE
        DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE
Subjt:  DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGRE

Query:  KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM
        KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM
Subjt:  KRTEDAEDIESIECEEGEAIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPM

A0A6J1FG13 uncharacterized protein LOC1114451564.5e-6657.69Show/hide
Query:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----
        M   K+ PV E+VRFSISGLKALISS+E   E G+EK  R+IRS G+GII SKLLT SSSSSSI SC LLMDDLIGTESGV LT +  EE +EK +    
Subjt:  MQTHKRRPVPESVRFSISGLKALISSEE--GEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSC-LLMDDLIGTESGVCLTNSIAEEIDEKSS----

Query:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV
        D  ++  +     +QNQRC  +KQFPPPI  LA QAG RTR PW+LTR+ SD RL L LERV + Q MESHRENGRLIL  VP+P+P     D++DLQF+
Subjt:  DYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFV

Query:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADS
        EED G        E+I+SIE EEGE   + E S    +SFTYGGEG+ GG+F DR+ FC V     +RHVV+ HF S PLRP+  S
Subjt:  EEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHVVNGHFGSAPLRPMADS

A0A6J1JWW3 uncharacterized protein LOC1114890656.6e-4151.17Show/hide
Query:  MDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV----DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMES
        MDDLIGTE+ V LT++  EE +EK +   + R     +     + NQRC  +KQFPPPI  LA QAG +TR PW+LTR+ SD RL L LERV + Q MES
Subjt:  MDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRV----DHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMES

Query:  HRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHV
        HRENGRLIL  VP+P+P      ++DLQF+EED G        E+I+SIE E GE   + E S    +SFTYGG+G+  G+F DR+ FC V     +RHV
Subjt:  HRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGE--AIEEASDPAFKSFTYGGEGIGGGMFRDRQPFCSV-----DRHV

Query:  VNGHFGSAPLRPM
        V+GHF S PLRP+
Subjt:  VNGHFGSAPLRPM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22110.1 structural constituent of ribosome1.8e-1134.05Show/hide
Query:  IRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNS------IAEEIDEKSSDYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTR
        + S +L+ +SSSSS      + D IGTES   + ++      ++   +   S +RY     R  +++  R AA ++FPPPI  LA         PWVL R
Subjt:  IRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNS------IAEEIDEKSSDYRYHRVDHRDLDKQNQRCAAEKQFPPPISFLAAQAGPRTRPPWVLTR

Query:  -HCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASD
           SDGRL L  E+VRH +Y  ++R NGRL L  V  PL DD  D  ++    + D   +    D ED    EC++ +  E   D
Subjt:  -HCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAIEEASD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACGCACAAACGCCGTCCAGTTCCCGAATCAGTACGATTTTCGATTTCTGGACTCAAAGCCCTAATTTCTTCGGAAGAAGGGGAAGAAGGAGAAGAAAAGGCACA
ACGACTTATTCGCAGTGCTGGAGTTGGCATTATCAGGTCAAAACTGTTGACTCATTCTTCTTCTTCTTCTTCGATCAGATCTTGTTTGTTAATGGACGATTTGATAGGCA
CTGAGAGCGGTGTTTGCTTGACCAATTCGATTGCAGAAGAAATCGATGAGAAATCCTCCGATTATCGTTATCATCGCGTTGATCACCGCGATCTTGATAAGCAAAATCAG
CGATGTGCGGCGGAAAAGCAGTTTCCACCGCCGATTTCTTTCCTAGCAGCACAGGCAGGGCCTCGAACGCGGCCGCCGTGGGTTTTAACCAGACATTGCTCCGATGGAAG
ATTAACTTTGACGCTGGAGAGAGTGAGGCACCTCCAGTACATGGAATCGCACCGGGAAAATGGGCGACTCATCCTGAAATTCGTTCCGGCGCCGTTACCCGACGATTCTG
AGGACGACGATCGAGATCTCCAATTCGTCGAAGAAGACGGAGGAAGAGAGAAGCGAACGGAGGACGCGGAGGACATCGAATCAATAGAATGCGAAGAAGGCGAAGCAATC
GAGGAGGCGTCAGATCCGGCTTTCAAGAGTTTTACGTACGGTGGCGAGGGCATCGGCGGTGGAATGTTTCGTGATCGGCAACCGTTTTGCAGTGTGGATCGACATGTCGT
CAATGGACACTTCGGTTCAGCGCCTCTTCGTCCGATGGCTGACTCCAAAATCATAACGTCTCGGTTCCCGGTGACGGTATCATCTCTCGATATGCTTCTTGAAGAGATGA
TGCAGGAAGCCAAAGATGGAGCCTTTGGTGTCATATTCCAACTGAATGTGATTTCAACAACTTGTGAAAGAGACCAGAGAGACCACACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAACGCACAAACGCCGTCCAGTTCCCGAATCAGTACGATTTTCGATTTCTGGACTCAAAGCCCTAATTTCTTCGGAAGAAGGGGAAGAAGGAGAAGAAAAGGCACA
ACGACTTATTCGCAGTGCTGGAGTTGGCATTATCAGGTCAAAACTGTTGACTCATTCTTCTTCTTCTTCTTCGATCAGATCTTGTTTGTTAATGGACGATTTGATAGGCA
CTGAGAGCGGTGTTTGCTTGACCAATTCGATTGCAGAAGAAATCGATGAGAAATCCTCCGATTATCGTTATCATCGCGTTGATCACCGCGATCTTGATAAGCAAAATCAG
CGATGTGCGGCGGAAAAGCAGTTTCCACCGCCGATTTCTTTCCTAGCAGCACAGGCAGGGCCTCGAACGCGGCCGCCGTGGGTTTTAACCAGACATTGCTCCGATGGAAG
ATTAACTTTGACGCTGGAGAGAGTGAGGCACCTCCAGTACATGGAATCGCACCGGGAAAATGGGCGACTCATCCTGAAATTCGTTCCGGCGCCGTTACCCGACGATTCTG
AGGACGACGATCGAGATCTCCAATTCGTCGAAGAAGACGGAGGAAGAGAGAAGCGAACGGAGGACGCGGAGGACATCGAATCAATAGAATGCGAAGAAGGCGAAGCAATC
GAGGAGGCGTCAGATCCGGCTTTCAAGAGTTTTACGTACGGTGGCGAGGGCATCGGCGGTGGAATGTTTCGTGATCGGCAACCGTTTTGCAGTGTGGATCGACATGTCGT
CAATGGACACTTCGGTTCAGCGCCTCTTCGTCCGATGGCTGACTCCAAAATCATAACGTCTCGGTTCCCGGTGACGGTATCATCTCTCGATATGCTTCTTGAAGAGATGA
TGCAGGAAGCCAAAGATGGAGCCTTTGGTGTCATATTCCAACTGAATGTGATTTCAACAACTTGTGAAAGAGACCAGAGAGACCACACTTAG
Protein sequenceShow/hide protein sequence
MQTHKRRPVPESVRFSISGLKALISSEEGEEGEEKAQRLIRSAGVGIIRSKLLTHSSSSSSIRSCLLMDDLIGTESGVCLTNSIAEEIDEKSSDYRYHRVDHRDLDKQNQ
RCAAEKQFPPPISFLAAQAGPRTRPPWVLTRHCSDGRLTLTLERVRHLQYMESHRENGRLILKFVPAPLPDDSEDDDRDLQFVEEDGGREKRTEDAEDIESIECEEGEAI
EEASDPAFKSFTYGGEGIGGGMFRDRQPFCSVDRHVVNGHFGSAPLRPMADSKIITSRFPVTVSSLDMLLEEMMQEAKDGAFGVIFQLNVISTTCERDQRDHT