; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15683996..15687643
RNA-Seq ExpressionMoc04g21540
SyntenyMoc04g21540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.1e-7457.14Show/hide
Query:  MRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAME----------MRSILAREAISA-----------------TISTKRE
        MRTQM TME+MY+EMVQAAG  SRSE+R A +++HEQ G HL  V +  PE  E          +R  L R+  S+                   S+   
Subjt:  MRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAME----------MRSILAREAISA-----------------TISTKRE

Query:  AFPEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFH
          PEG+ITREEFDQLKSKFDAQVEALKA+CEKKE+ FDDGDLGESPFTS+ILEA IP KFKTPTMKPYD SKDPKDYVEVFEGLMDFQAATDAIKC  F 
Subjt:  AFPEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFH

Query:  IALTGSACLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEA
        IALTGSA LWYRRL                                                   EEQLKV HCSD SAMCYFLT LADETLTVKL EEA
Subjt:  IALTGSACLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEA

Query:  PATFAEVL
        PATF EVL
Subjt:  PATFAEVL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.4e-7650.66Show/hide
Query:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR
        MVQPANSTNTADRRALA N+G QREV A+  E Q  E L TEPL R ARITT VLPPAHPKPSK                                    
Subjt:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR

Query:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALKAR
              E  YN +                                 TP                           G+ITREEFDQLKSKFDAQVEALKAR
Subjt:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALKAR

Query:  CEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL---------------
        CEKKE+ FDDGDLGE  F+S+ILEA IPPKFKTPTMKPYD SKDPKDYVEVFE LMDFQAATDAIKC AF IALTGSA LWYRRL               
Subjt:  CEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL---------------

Query:  -----------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
                                            EEQLKV HCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVL
Subjt:  -----------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]1.5e-6373.41Show/hide
Query:  PEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIA
        P G+ITREEFDQL+ + DAQVEALKA+CE+K+   +DGDLGE PFTS++LEAPIPPKFK PT+KPYD +KDPKDYVEVFEGLMDFQAA+DAIKC AF IA
Subjt:  PEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIA

Query:  LTGSACLWYRRL-----------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
        LTGSA LWYRRL                  EEQLKV HCSDDSAMCYF TGLADE LTVKLGEEAP TFAEVL
Subjt:  LTGSACLWYRRL-----------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.6e-9470.03Show/hide
Query:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR
        MVQP +STNT DRRAL  N+G QREV A+  E Q+ EGL TEP  R ARITT  L PAHPKP K NRG+ GASRRTT GAAPAP++ENFD LQKEMEAMR
Subjt:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR

Query:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGL--HLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALK
        TQM TMEEMYNEMVQA G GSRSEDRAA D   E+G L  HL     S+       S   + +     S+     PEG+ITREEFDQLKSKFDAQVE LK
Subjt:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGL--HLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALK

Query:  ARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL
        ARCE K + FDDGDLGESPFTS+ILEA IP KFKTPTMKPYD SKDPKDYVEVFEGLM FQAATDAIK  AF IALT SA LWYRRL
Subjt:  ARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.8e-1179.59Show/hide
Query:  WYRRLSEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
        W++   EEQLKV H SDDSA+CYFLT L DETLTVKLGEEAPATFAEVL
Subjt:  WYRRLSEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.8e-8059.8Show/hide
Query:  MEAMRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAF-----------------PEGMI
        MEAMRTQMRTMEEMYN+MVQ AG  SRS D+  +++VHEQG LH D VDE      ++R  L R+  S+    +   +                 PEG+I
Subjt:  MEAMRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAF-----------------PEGMI

Query:  TREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSA
        TREEF+QLKSKFDAQVEALK RCEKKE+ FDDGDLGESPFTS+ILEA IPPKFKTPTMK YD SKDPKDYVEVFEGLMDFQAATDAIKC AF IALTGSA
Subjt:  TREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSA

Query:  CLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEV
         LWYRRL                                                   EEQLKVVHCSDDS+MCYFLTGLADET TVKLGEEA ATFAEV
Subjt:  CLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEV

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196345.3e-7557.14Show/hide
Query:  MRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAME----------MRSILAREAISA-----------------TISTKRE
        MRTQM TME+MY+EMVQAAG  SRSE+R A +++HEQ G HL  V +  PE  E          +R  L R+  S+                   S+   
Subjt:  MRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAME----------MRSILAREAISA-----------------TISTKRE

Query:  AFPEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFH
          PEG+ITREEFDQLKSKFDAQVEALKA+CEKKE+ FDDGDLGESPFTS+ILEA IP KFKTPTMKPYD SKDPKDYVEVFEGLMDFQAATDAIKC  F 
Subjt:  AFPEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFH

Query:  IALTGSACLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEA
        IALTGSA LWYRRL                                                   EEQLKV HCSD SAMCYFLT LADETLTVKL EEA
Subjt:  IALTGSACLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEA

Query:  PATFAEVL
        PATF EVL
Subjt:  PATFAEVL

A0A6J1DHB3 uncharacterized protein LOC1110204791.7e-7650.66Show/hide
Query:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR
        MVQPANSTNTADRRALA N+G QREV A+  E Q  E L TEPL R ARITT VLPPAHPKPSK                                    
Subjt:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR

Query:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALKAR
              E  YN +                                 TP                           G+ITREEFDQLKSKFDAQVEALKAR
Subjt:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALKAR

Query:  CEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL---------------
        CEKKE+ FDDGDLGE  F+S+ILEA IPPKFKTPTMKPYD SKDPKDYVEVFE LMDFQAATDAIKC AF IALTGSA LWYRRL               
Subjt:  CEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL---------------

Query:  -----------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
                                            EEQLKV HCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVL
Subjt:  -----------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

A0A6J1DXR9 uncharacterized protein LOC1110251097.2e-6473.41Show/hide
Query:  PEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIA
        P G+ITREEFDQL+ + DAQVEALKA+CE+K+   +DGDLGE PFTS++LEAPIPPKFK PT+KPYD +KDPKDYVEVFEGLMDFQAA+DAIKC AF IA
Subjt:  PEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIA

Query:  LTGSACLWYRRL-----------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
        LTGSA LWYRRL                  EEQLKV HCSDDSAMCYF TGLADE LTVKLGEEAP TFAEVL
Subjt:  LTGSACLWYRRL-----------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

A0A6J1DZJ1 uncharacterized protein LOC1110257381.8e-9470.03Show/hide
Query:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR
        MVQP +STNT DRRAL  N+G QREV A+  E Q+ EGL TEP  R ARITT  L PAHPKP K NRG+ GASRRTT GAAPAP++ENFD LQKEMEAMR
Subjt:  MVQPANSTNTADRRALAGNNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMR

Query:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGL--HLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALK
        TQM TMEEMYNEMVQA G GSRSEDRAA D   E+G L  HL     S+       S   + +     S+     PEG+ITREEFDQLKSKFDAQVE LK
Subjt:  TQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGL--HLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALK

Query:  ARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL
        ARCE K + FDDGDLGESPFTS+ILEA IP KFKTPTMKPYD SKDPKDYVEVFEGLM FQAATDAIK  AF IALT SA LWYRRL
Subjt:  ARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRL

A0A6J1DZJ1 uncharacterized protein LOC1110257381.9e-1179.59Show/hide
Query:  WYRRLSEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL
        W++   EEQLKV H SDDSA+CYFLT L DETLTVKLGEEAPATFAEVL
Subjt:  WYRRLSEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL

A0A6J1DZJ1 uncharacterized protein LOC1110257388.5e-8159.8Show/hide
Query:  MEAMRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAF-----------------PEGMI
        MEAMRTQMRTMEEMYN+MVQ AG  SRS D+  +++VHEQG LH D VDE      ++R  L R+  S+    +   +                 PEG+I
Subjt:  MEAMRTQMRTMEEMYNEMVQAAGVGSRSEDRAAYDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAF-----------------PEGMI

Query:  TREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSA
        TREEF+QLKSKFDAQVEALK RCEKKE+ FDDGDLGESPFTS+ILEA IPPKFKTPTMK YD SKDPKDYVEVFEGLMDFQAATDAIKC AF IALTGSA
Subjt:  TREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKPYDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSA

Query:  CLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEV
         LWYRRL                                                   EEQLKVVHCSDDS+MCYFLTGLADET TVKLGEEA ATFAEV
Subjt:  CLWYRRL--------------------------------------------------SEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEV

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGTCCGCCCAAGTATTCAGATCGGTCCACTTCAAACCTGCC
GAGGAGATCGGATTTGTCCATAAACAAGGCTAAGTGTAAAAAGTCGTGGTTCGACTATGACGGGTTCGATCTGCTCAAACCCGACAAGTTCGACATGAAAAAAGACAAAG
GTTCACTCCACGATTCGGGTCGAGTTGGTGATCGAGTTCGAGCTAGATATACAAAGATGGTTCAACCAGCAAACTCAACCAACACGGCAGACCGAAGAGCTCTGGCTGGT
AACAATGGCCTCCAGAGGGAGGTCGACGCTAAAGGGGCAGAGGATCAGGTTCAGGAAGGCCTAGAGACCGAGCCGCTCCGTAGGTTGGCACGCATTACCACGTCTGTTCT
GCCGCCAGCACATCCAAAACCATCTAAGGTCAATCGCGGCCAAGTTGGTGCCTCGAGAAGAACCACTCGAGGAGCAGCTCCAGCTCCTACTAAGGAGAACTTTGATACCC
TCCAGAAAGAAATGGAGGCAATGCGCACCCAGATGCGAACCATGGAAGAGATGTACAATGAAATGGTGCAAGCTGCTGGTGTCGGGTCTCGATCTGAAGACCGAGCAGCG
TACGACGAAGTGCACGAACAAGGGGGTCTTCACCTCGACCTAGTCGATGAGAGCACCCCGGAGGCGATGGAGATGAGGAGTATACTCGCCAGAGAAGCGATCTCCGCGAC
CATCTCAACAAAAAGAGAAGCTTTTCCTGAAGGAATGATCACAAGGGAGGAGTTCGACCAGCTCAAGAGCAAGTTTGATGCTCAAGTAGAAGCCTTGAAAGCAAGGTGCG
AGAAGAAAGAGACAGTGTTTGATGATGGCGACTTGGGAGAATCGCCGTTCACCTCAAATATTTTGGAGGCTCCAATTCCTCCAAAGTTCAAAACTCCCACTATGAAGCCG
TATGATAGGTCTAAGGACCCAAAGGATTATGTAGAGGTCTTTGAAGGCCTCATGGATTTTCAGGCGGCAACAGATGCCATAAAGTGTCTTGCCTTCCATATCGCGCTGAC
CGGCAGCGCGTGCTTATGGTACAGAAGGTTGTCGGAGGAGCAGCTGAAAGTCGTACACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCTTGGCCGATGAGA
CTCTTACTGTAAAACTTGGAGAGGAGGCTCCAGCCACTTTCGCCGAGGTCCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGTCCGCCCAAGTATTCAGATCGGTCCACTTCAAACCTGCC
GAGGAGATCGGATTTGTCCATAAACAAGGCTAAGTGTAAAAAGTCGTGGTTCGACTATGACGGGTTCGATCTGCTCAAACCCGACAAGTTCGACATGAAAAAAGACAAAG
GTTCACTCCACGATTCGGGTCGAGTTGGTGATCGAGTTCGAGCTAGATATACAAAGATGGTTCAACCAGCAAACTCAACCAACACGGCAGACCGAAGAGCTCTGGCTGGT
AACAATGGCCTCCAGAGGGAGGTCGACGCTAAAGGGGCAGAGGATCAGGTTCAGGAAGGCCTAGAGACCGAGCCGCTCCGTAGGTTGGCACGCATTACCACGTCTGTTCT
GCCGCCAGCACATCCAAAACCATCTAAGGTCAATCGCGGCCAAGTTGGTGCCTCGAGAAGAACCACTCGAGGAGCAGCTCCAGCTCCTACTAAGGAGAACTTTGATACCC
TCCAGAAAGAAATGGAGGCAATGCGCACCCAGATGCGAACCATGGAAGAGATGTACAATGAAATGGTGCAAGCTGCTGGTGTCGGGTCTCGATCTGAAGACCGAGCAGCG
TACGACGAAGTGCACGAACAAGGGGGTCTTCACCTCGACCTAGTCGATGAGAGCACCCCGGAGGCGATGGAGATGAGGAGTATACTCGCCAGAGAAGCGATCTCCGCGAC
CATCTCAACAAAAAGAGAAGCTTTTCCTGAAGGAATGATCACAAGGGAGGAGTTCGACCAGCTCAAGAGCAAGTTTGATGCTCAAGTAGAAGCCTTGAAAGCAAGGTGCG
AGAAGAAAGAGACAGTGTTTGATGATGGCGACTTGGGAGAATCGCCGTTCACCTCAAATATTTTGGAGGCTCCAATTCCTCCAAAGTTCAAAACTCCCACTATGAAGCCG
TATGATAGGTCTAAGGACCCAAAGGATTATGTAGAGGTCTTTGAAGGCCTCATGGATTTTCAGGCGGCAACAGATGCCATAAAGTGTCTTGCCTTCCATATCGCGCTGAC
CGGCAGCGCGTGCTTATGGTACAGAAGGTTGTCGGAGGAGCAGCTGAAAGTCGTACACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCTTGGCCGATGAGA
CTCTTACTGTAAAACTTGGAGAGGAGGCTCCAGCCACTTTCGCCGAGGTCCTCTAA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAEVRPTGKLGGGRCPPKYSDRSTSNLPRRSDLSINKAKCKKSWFDYDGFDLLKPDKFDMKKDKGSLHDSGRVGDRVRARYTKMVQPANSTNTADRRALAG
NNGLQREVDAKGAEDQVQEGLETEPLRRLARITTSVLPPAHPKPSKVNRGQVGASRRTTRGAAPAPTKENFDTLQKEMEAMRTQMRTMEEMYNEMVQAAGVGSRSEDRAA
YDEVHEQGGLHLDLVDESTPEAMEMRSILAREAISATISTKREAFPEGMITREEFDQLKSKFDAQVEALKARCEKKETVFDDGDLGESPFTSNILEAPIPPKFKTPTMKP
YDRSKDPKDYVEVFEGLMDFQAATDAIKCLAFHIALTGSACLWYRRLSEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVL