; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g18010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g18010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:11961302..11963151
RNA-Seq ExpressionMoc03g18010
SyntenyMoc03g18010
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.1e-8771.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        VITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCR F+IALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLPA SISTY+QLR+EFLA FSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQKAKKVIDGQELLRTKT RPER+IGR RSGKD E ADPKSKDKGSFSSGRAE+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]2.2e-8569.4Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        +ITR EFDQLRG+LDAQ EALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCR FQIALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLPARSISTY+QLR+EFLAQFSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQK KKVIDG ELLRTKT RPER+I R RSGKD EK DPKSKDKGSFSSGR E+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]2.0e-8655.88Show/hide
Query:  MRTQMRSMEEMYNEMVLAASAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR----------------------------------
        MRTQM +ME+MY+EMV AA A SRSENRV R D+ EQRG HLGP ++  PE  E E +THQRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMVLAASAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR----------------------------------

Query:  -----VITRAEFDQLRGKLDAQVEALKAK---------------SP---------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQ
             VITR EFDQL+ K DAQVEALKAK               SP         IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----VITRAEFDQLRGKLDAQVEALKAK---------------SP---------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQ

Query:  IALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTY+QLRKEF+ QFSS HYD+KTATHL TIRQKE                                  ADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEA

Query:  PATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKSKDKGSFS-SGRAEFRRAVNGPTRSRP
        PATF E+LQKAKK+IDGQELLRTKT RPE++I + R+ KD+ K D K++DKG  S S R  +RR+ N   RSRP
Subjt:  PATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKSKDKGSFS-SGRAEFRRAVNGPTRSRP

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]5.7e-8971.64Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        +ITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCR FQIALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLP RSISTY+QLR+EFLAQFSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQKAKKVIDGQELLRTKT RPER+IGR RSGKD E+ADPKSKDKGSFSSGRAE+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]2.7e-9955.53Show/hide
Query:  DSTIREVGAAVVEGQGHDGLATEPLRRLARITALVLPPAQPRTSKATGGRGGTSKKGARGLAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMVLAASA
        D   REVGA VVEGQ H+GL TEP  R ARIT   L PA P+  KA  GRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +MEEMYNEMV A  A
Subjt:  DSTIREVGAAVVEGQGHDGLATEPLRRLARITALVLPPAQPRTSKATGGRGGTSKKGARGLAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMVLAASA

Query:  GSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR------------VITRAEFDQLRGKLDAQVEALKAK---------------SP--
        GSRSE+R  R D R     HL            S   +H+  + +            VITR EFDQL+ K DAQVE LKA+               SP  
Subjt:  GSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR------------VITRAEFDQLRGKLDAQVEALKAK---------------SP--

Query:  -------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLAT
               IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK R FQIALT SARLWYRRLPARSISTY+QLRKEF +QFSS HY++KTATHLAT
Subjt:  -------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLAT

Query:  IRQKE----------------------------------ADEALTVKLGEEAPATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKS
        IRQKE                                   DE LTVKLGEEAPATFAE+LQKAKKVIDGQEL RTKT R E++I + +  +++ KA+ KS
Subjt:  IRQKE----------------------------------ADEALTVKLGEEAPATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKS

Query:  KDKGSFSSGRAEFRRAVNGPTRSRP
        KDK       AE+RR+ +GP+RSRP
Subjt:  KDKGSFSSGRAEFRRAVNGPTRSRP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.5e-8771.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        VITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCR F+IALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLPA SISTY+QLR+EFLA FSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQKAKKVIDGQELLRTKT RPER+IGR RSGKD E ADPKSKDKGSFSSGRAE+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

A0A6J1CKB3 uncharacterized protein LOC1110120811.1e-8569.4Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        +ITR EFDQLRG+LDAQ EALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCR FQIALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLPARSISTY+QLR+EFLAQFSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQK KKVIDG ELLRTKT RPER+I R RSGKD EK DPKSKDKGSFSSGR E+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

A0A6J1DDW5 uncharacterized protein LOC1110196349.8e-8755.88Show/hide
Query:  MRTQMRSMEEMYNEMVLAASAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR----------------------------------
        MRTQM +ME+MY+EMV AA A SRSENRV R D+ EQRG HLGP ++  PE  E E +THQRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMVLAASAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR----------------------------------

Query:  -----VITRAEFDQLRGKLDAQVEALKAK---------------SP---------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQ
             VITR EFDQL+ K DAQVEALKAK               SP         IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----VITRAEFDQLRGKLDAQVEALKAK---------------SP---------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQ

Query:  IALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEA
        IALTGSARLWYRRLPARSISTY+QLRKEF+ QFSS HYD+KTATHL TIRQKE                                  ADE LTVKL EEA
Subjt:  IALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEA

Query:  PATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKSKDKGSFS-SGRAEFRRAVNGPTRSRP
        PATF E+LQKAKK+IDGQELLRTKT RPE++I + R+ KD+ K D K++DKG  S S R  +RR+ N   RSRP
Subjt:  PATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKSKDKGSFS-SGRAEFRRAVNGPTRSRP

A0A6J1DS95 uncharacterized protein LOC1110234212.8e-8971.64Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG
        +ITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCR FQIALTG
Subjt:  VITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTG

Query:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA
        SARLWYRRLP RSISTY+QLR+EFLAQFSS HYDKKTATHLATIRQKE                                  ADEALTVKLGEEAPATFA
Subjt:  SARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKE----------------------------------ADEALTVKLGEEAPATFA

Query:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP
        E+LQKAKKVIDGQELLRTKT RPER+IGR RSGKD E+ADPKSKDKGSFSSGRAE+RRA NGPTRSRP
Subjt:  ELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKD-EKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP

A0A6J1DZJ1 uncharacterized protein LOC1110257381.3e-9955.53Show/hide
Query:  DSTIREVGAAVVEGQGHDGLATEPLRRLARITALVLPPAQPRTSKATGGRGGTSKKGARGLAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMVLAASA
        D   REVGA VVEGQ H+GL TEP  R ARIT   L PA P+  KA  GRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +MEEMYNEMV A  A
Subjt:  DSTIREVGAAVVEGQGHDGLATEPLRRLARITALVLPPAQPRTSKATGGRGGTSKKGARGLAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMVLAASA

Query:  GSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR------------VITRAEFDQLRGKLDAQVEALKAK---------------SP--
        GSRSE+R  R D R     HL            S   +H+  + +            VITR EFDQL+ K DAQVE LKA+               SP  
Subjt:  GSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLR------------VITRAEFDQLRGKLDAQVEALKAK---------------SP--

Query:  -------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLAT
               IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK R FQIALT SARLWYRRLPARSISTY+QLRKEF +QFSS HY++KTATHLAT
Subjt:  -------IPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLAT

Query:  IRQKE----------------------------------ADEALTVKLGEEAPATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKS
        IRQKE                                   DE LTVKLGEEAPATFAE+LQKAKKVIDGQEL RTKT R E++I + +  +++ KA+ KS
Subjt:  IRQKE----------------------------------ADEALTVKLGEEAPATFAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDE-KADPKS

Query:  KDKGSFSSGRAEFRRAVNGPTRSRP
        KDK       AE+RR+ +GP+RSRP
Subjt:  KDKGSFSSGRAEFRRAVNGPTRSRP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGTATGACGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATTCCCGAGATTACCGGGTGTAACCATCGCGAGATTAGCAAG
ACTACCACCCATAAGTAGAGGACCTGACTCCACGATCAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTTGG
CACGAATCACCGCGCTTGTCCTACCACCTGCGCAGCCTCGGACATCCAAGGCCACCGGTGGCCGAGGTGGTACCTCTAAGAAGGGCGCCCGAGGTCTAGCTCCGGCTCCA
ACAAGCGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATGTATAACGAAATGGTACTAGCTGCAAGCGCAGGGTC
TCGATCTGAAAATCGAGTGACGCGCGTTGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAATGAGAGCGAGGGACACACTC
ACCAGAGGGGAGACCTCCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAG
TTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGATGCAATCAAATG
CCGCCCCTTTCAGATCGCGCTTACCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAAAGAGTTCCTTGCCCAAT
TCTCCTCTTGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGCTGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAACTGCTGCAGAAGGCGAAAAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGGAAAAGA
TGAAAAGGCGGATCCCAAGTCCAAGGATAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGTATGACGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATTCCCGAGATTACCGGGTGTAACCATCGCGAGATTAGCAAG
ACTACCACCCATAAGTAGAGGACCTGACTCCACGATCAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTTGG
CACGAATCACCGCGCTTGTCCTACCACCTGCGCAGCCTCGGACATCCAAGGCCACCGGTGGCCGAGGTGGTACCTCTAAGAAGGGCGCCCGAGGTCTAGCTCCGGCTCCA
ACAAGCGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATGTATAACGAAATGGTACTAGCTGCAAGCGCAGGGTC
TCGATCTGAAAATCGAGTGACGCGCGTTGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAATGAGAGCGAGGGACACACTC
ACCAGAGGGGAGACCTCCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAG
TTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGATGCAATCAAATG
CCGCCCCTTTCAGATCGCGCTTACCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAAAGAGTTCCTTGCCCAAT
TCTCCTCTTGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAAGCTGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACC
TTCGCCGAACTGCTGCAGAAGGCGAAAAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCAGCCGACCAGAACGAAGGATCGGCCGGGATAGAAGCGGAAAAGA
TGAAAAGGCGGATCCCAAGTCCAAGGATAAGGGATCTTTCTCCAGTGGCCGAGCTGAGTTTCGAAGGGCGGTGAACGGACCCACCAGGAGCCGACCTTAG
Protein sequenceShow/hide protein sequence
MRSMTAEVNLAEVRPTGKLGGGRFPRLPGVTIARLARLPPISRGPDSTIREVGAAVVEGQGHDGLATEPLRRLARITALVLPPAQPRTSKATGGRGGTSKKGARGLAPAP
TSENFDALQREMEAMRTQMRSMEEMYNEMVLAASAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKAKSPIPPK
FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRPFQIALTGSARLWYRRLPARSISTYAQLRKEFLAQFSSWHYDKKTATHLATIRQKEADEALTVKLGEEAPAT
FAELLQKAKKVIDGQELLRTKTSRPERRIGRDRSGKDEKADPKSKDKGSFSSGRAEFRRAVNGPTRSRP