; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSWIM-type domain-containing protein
Genome locationchr9:19916047..19922064
RNA-Seq ExpressionMoc09g26730
SyntenyMoc09g26730
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7138490.1 hypothetical protein RHSIM_Rhsim07G0255600 [Rhododendron simsii]2.0e-5333.58Show/hide
Query:  EPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEV-SGYRVIESDKDVIELTSLLSDGMTFEIYVEH-----------------
        E  ++Y+ G V+   + D D  S  EV D+VE+LGY  Y  LW R P   +  G RVIE+DKD + + +++      EIYVEH                 
Subjt:  EPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEV-SGYRVIESDKDVIELTSLLSDGMTFEIYVEH-----------------

Query:  --------------------------------------------EYDDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEV
                                                    +YDDVL + Y +K+ E T K +     +D+    D +++ +               
Subjt:  --------------------------------------------EYDDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEV

Query:  CVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRL
            ++  ++     E   F  GMLF+SL +FK A+ +YAV GG+ I+F KND++RVRA C D C +    +KV  + T+QLKT   EH C+R++ N RL
Subjt:  CVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRL

Query:  TSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALL-NLDVRQEQEEVDAEKAF
        +S++L+ ++V  VK+QP++RL  IQ+KV  K++ QI++ KA+RAK+ ALD+V GSHKEQY+ LW+YC E++RSNP S+ ++  LD   +  +   E  F
Subjt:  TSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALL-NLDVRQEQEEVDAEKAF

KAF7153765.1 hypothetical protein RHSIM_Rhsim01G0020300 [Rhododendron simsii]2.0e-4533.72Show/hide
Query:  DVVEQLGYYDYLKLWGRDPTKEVS--GYRVIESDKDVIELTSLLSDGMTFEIYVEHEYDDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDGRSDIDSSE
        ++V  LGY  +  +W R P  E++  G RV+  D D +++ S   +    E+YVEH  ++ L    + +D        D +  ++ +  ++ ++D     
Subjt:  DVVEQLGYYDYLKLWGRDPTKEVS--GYRVIESDKDVIELTSLLSDGMTFEIYVEHEYDDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDGRSDIDSSE

Query:  LLSLKDSSDSEVCV-------------------------------------------------VTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDY
          S+ DS DS V V                                                 V  + +YR +   E   FE GM+F SL++FK AV +Y
Subjt:  LLSLKDSSDSEVCV-------------------------------------------------VTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDY

Query:  AVKGGWQIRFVKNDKTRVRAKC--VDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQIN
        A+ GG+ I+F KNDK RVRA C     C +   V+KV  ++T+Q+KT V EH+C+RS+ N RL S++LS +++N VK+QP+++L  IQ++VQ+KY+  I+
Subjt:  AVKGGWQIRFVKNDKTRVRAKC--VDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQIN

Query:  KVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLD
        + KA+RAK+ A+D V GSH EQY+ LW+YC E+RRSN  S+  ++++
Subjt:  KVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLD

KAG5541483.1 hypothetical protein RHGRI_021341 [Rhododendron griersonianum]3.2e-5133.25Show/hide
Query:  EPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEV--SGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------
        E   +Y+ G V+   + DSD WS  E+ +++  LGY  +  +W R P  E+   G RV+  D D +++ S   +    E+YVEH                
Subjt:  EPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEV--SGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------

Query:  -------------------------------------------------DDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDG-------------RSDI
                                                         DD L + Y D  V+G  +  D     D +  +D              + D 
Subjt:  -------------------------------------------------DDVLSNIYVDKDVEGTSKPSDIIRTYDNVENNDG-------------RSDI

Query:  DSSELLSLKDSSDS----EVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKC--VDSCKWLAYVAKVQGEM
        +S ELLS   SS+S    E   V  + +YR +   E   F+ GM+F SL++FK AV +YAV GG+ I+F KNDK RVRA C     C +   V+KV  ++
Subjt:  DSSELLSLKDSSDS----EVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKC--VDSCKWLAYVAKVQGEM

Query:  TYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSS
        T+Q+KT V EH+C+RS+ N RL S++LS +++N VK+QP+++L  IQ++VQ+KY+  I++ KA+RAK+ ALD V GSH EQY+ LW+YC E+RRSN  S+
Subjt:  TYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSS

Query:  ALLNLDVRQEQEEVDAEK
          +N++   E   + A +
Subjt:  ALLNLDVRQEQEEVDAEK

XP_022153708.1 uncharacterized protein LOC111021158 [Momordica charantia]1.9e-15779.89Show/hide
Query:  MHEPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------
        +HEP I+YI+GKVDRYTNFD DYWSFFEV+ VVEQLGYY YLKLWGRDPTKEV GYR+IESDKDV+EL SLLSDGM FEIYVEHE               
Subjt:  MHEPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------

Query:  -------------DDVLSNIYVDKDVEGT-SKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEF
                     DD+LSNIY+DKDVEGT SKPS+II   DNVENNDG SDIDSSELLSLKDSSDSEV V  KY +YRE TGDEKP FELGM FNSL EF
Subjt:  -------------DDVLSNIYVDKDVEGT-SKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEF

Query:  KNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKY
        KN VDDYAVKGGWQIRFVKNDKTRVRAKCVD CKWLAYVAKVQGEMTYQLKTFVGEHSCSRSF+NP LTSRWL RQIVNDVKEQPD+RLRAIQ+KVQRKY
Subjt:  KNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKY

Query:  ISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQEEVDAE
        ISQINKVKAFRAKK+ALDIVHGSHKEQ S LWEYC EI RSNP SSALLNLDVRQEQEEVD +
Subjt:  ISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQEEVDAE

XP_023896782.1 uncharacterized protein LOC112008678 [Quercus suber]9.9e-4543.72Show/hide
Query:  NNDGRSDIDSSELLSLKDSSDSEVCVVT---KYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAK
        ++ G   +D S+     D++   V  VT   K+ ++++++  E   FE  MLF S  +FK+A+ +YAV GGW ++FVKNDK RVRAKC   CK+ AY+AK
Subjt:  NNDGRSDIDSSELLSLKDSSDSEVCVVT---KYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAK

Query:  VQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRS
        +  EM+YQLKT   EH+C+RS+ NPR T+++L+R+++  V+ QP+++L+ IQ+ V  KY+  IN  KA RA++ A + V GS+ EQY++LW+YC E+RRS
Subjt:  VQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRS

Query:  NPSSSALLNLDVRQE
        +P S+ L+      E
Subjt:  NPSSSALLNLDVRQE

TrEMBL top hitse value%identityAlignment
A0A2N9EMB4 SWIM-type domain-containing protein3.3e-5441.18Show/hide
Query:  EVSGYRV--IESDKDVIELTSLLSDGMTFEIYVEH---------EYDDVLSNIYVDKDVEGTSKPSD--------------IIRTYDNVENND-------
        E  G RV  I+SDKD + +   +      E++VEH         E+  + +   VD+D   + + SD               + TY ++E+++       
Subjt:  EVSGYRV--IESDKDVIELTSLLSDGMTFEIYVEH---------EYDDVLSNIYVDKDVEGTSKPSD--------------IIRTYDNVENND-------

Query:  -------------GRSDID----SSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCV
                     G SD D    S ELLS+ D       V  +Y +Y+     +   FE+GM FNSL EFKNAV DYAV GG  IRF KNDK RVRA C 
Subjt:  -------------GRSDID----SSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCV

Query:  DSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSR
          CKW+AY  K+ GEMT QL+TFV EH+CSRS+ N R TS+WL +++ N + EQPDM    IQ+KV +K++  I++ KA+RAKK+AL+ + GSHKEQYSR
Subjt:  DSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSR

Query:  LWEYCEEIRRSNPSSSALLNLDV
        + +YC E+ R+N  SS LL  D+
Subjt:  LWEYCEEIRRSNPSSSALLNLDV

A0A2N9H0G1 Uncharacterized protein4.0e-5254.05Show/hide
Query:  YHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWL
        + I+R +   +   FELGMLF S  +FK A+  YAV+GGW IRFVKNDK RVRA C + CK++AY+AK+  E+T+QLKT   EHSCSR F NPR+T+++L
Subjt:  YHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWL

Query:  SRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQE
        ++++V  VK+QPD++LR+IQ KV++KY++ I++ KA+RAK  A+DI+ GSH EQY+ LW+YCEE+RRSNP S+ L+ +    E E
Subjt:  SRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQE

A0A2N9HKG1 SWIM-type domain-containing protein1.0e-5538.89Show/hide
Query:  VVEQLGYYDYLKLWGRDP--TKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEYDDVLSNI---YVDKDVEGT-------------------SKPSD
        +V +LGY    +LW R P  + E  G + + SD D + +T L+      E++VEH  ++ + N     VD D  G                        D
Subjt:  VVEQLGYYDYLKLWGRDP--TKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEYDDVLSNI---YVDKDVEGT-------------------SKPSD

Query:  IIRTYDNV------ENNDGRSDIDSSELLSLKDSS---DSEVCVVTK----------YHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIR
        +   Y ++      + N   SD D+S+  +  +++   +S   V  +          + I+R +   +   FELGMLF S  +FK A+ +YAV+GGW IR
Subjt:  IIRTYDNV------ENNDGRSDIDSSELLSLKDSS---DSEVCVVTK----------YHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIR

Query:  FVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMA
        FVKNDK RVRA C + CK++AY+AK+  E+T+QLKT   EHSCSR F NPR+T+++L++++V  VK+QPD++LR+IQ KV +KY++ I++ KA+RAK  A
Subjt:  FVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMA

Query:  LDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQE
        +DI+ GSH EQY+ LW+YCEE+RRSNP S+ L+ +    E E
Subjt:  LDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQE

A0A2N9HUS0 Uncharacterized protein6.9e-5252.86Show/hide
Query:  GRSDID----SSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQ
        G SD D    S ELLS+ D       V  +Y +Y+     +   FE+GM FNSL EFKNAV DYAV GG  IRF KNDK RVRA C   CKW+AY  K+ 
Subjt:  GRSDID----SSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQ

Query:  GEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNP
        GEMT QL+TFV EH+CSRS+ N R TS+WL +++ N + EQPDM    IQ+KV +K++  I++ KA+RAKK+AL+ + GSHKEQYSR+ +YC E+ R+N 
Subjt:  GEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNP

Query:  SSSALLNLDV
         SS LL  D+
Subjt:  SSSALLNLDV

A0A6J1DI69 uncharacterized protein LOC1110211589.3e-15879.89Show/hide
Query:  MHEPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------
        +HEP I+YI+GKVDRYTNFD DYWSFFEV+ VVEQLGYY YLKLWGRDPTKEV GYR+IESDKDV+EL SLLSDGM FEIYVEHE               
Subjt:  MHEPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEY--------------

Query:  -------------DDVLSNIYVDKDVEGT-SKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEF
                     DD+LSNIY+DKDVEGT SKPS+II   DNVENNDG SDIDSSELLSLKDSSDSEV V  KY +YRE TGDEKP FELGM FNSL EF
Subjt:  -------------DDVLSNIYVDKDVEGT-SKPSDIIRTYDNVENNDGRSDIDSSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEF

Query:  KNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKY
        KN VDDYAVKGGWQIRFVKNDKTRVRAKCVD CKWLAYVAKVQGEMTYQLKTFVGEHSCSRSF+NP LTSRWL RQIVNDVKEQPD+RLRAIQ+KVQRKY
Subjt:  KNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTYQLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKY

Query:  ISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQEEVDAE
        ISQINKVKAFRAKK+ALDIVHGSHKEQ S LWEYC EI RSNP SSALLNLDVRQEQEEVD +
Subjt:  ISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQEEVDAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGAGCCATCTATTCAATACATAACTGGTAAGGTGGATCGCTATACAAATTTTGATTCAGATTATTGGTCTTTCTTTGAGGTAGTTGATGTTGTTGAACAATTGGG
GTACTATGATTATCTAAAATTGTGGGGTAGGGATCCAACAAAAGAGGTGAGTGGTTATCGAGTAATAGAATCTGATAAAGATGTCATAGAATTAACTAGCCTTTTAAGTG
ATGGAATGACATTTGAGATATATGTAGAACATGAATATGATGATGTATTGTCTAATATATATGTTGATAAGGATGTAGAGGGTACTTCAAAACCATCGGATATCATAAGA
ACATATGATAATGTTGAGAATAATGATGGCAGAAGTGATATTGACTCTAGCGAGCTACTCTCATTGAAAGACTCGTCTGATTCTGAAGTTTGTGTTGTAACAAAATATCA
TATTTATAGAGAAATAACAGGTGATGAGAAACCATACTTTGAATTAGGCATGTTATTTAACTCATTGAGTGAGTTCAAGAATGCTGTAGATGATTATGCTGTTAAGGGAG
GATGGCAAATACGATTTGTGAAAAATGATAAGACTAGAGTTAGAGCCAAATGTGTGGATAGTTGTAAATGGTTGGCTTATGTTGCCAAGGTACAAGGAGAAATGACTTAT
CAGTTGAAGACCTTTGTTGGTGAACACTCGTGCAGTAGATCCTTCAACAATCCACGCCTAACATCTAGGTGGCTAAGTAGGCAAATTGTAAATGATGTGAAGGAACAACC
AGATATGAGATTGAGAGCTATCCAGGATAAGGTACAACGTAAGTACATTTCACAAATTAATAAAGTGAAAGCTTTTAGAGCAAAGAAAATGGCTTTAGATATAGTGCATG
GATCACACAAAGAGCAATATAGTCGATTATGGGAGTATTGTGAGGAGATACGCAGATCCAATCCTAGTAGCAGTGCTCTTTTGAACTTGGATGTGCGACAAGAACAAGAA
GAGGTTGATGCTGAGAAAGCTTTTTTGGCGAGCAGCCAACGCAACTTATTATCAGCAGATTGTCAGGTGTCTTCGCTTACGAGGGGTCATGACGTCTGGATTGAAGAGCT
TATCATGAACACCTTTATGGAGGCTGAGGCAAATATTATCCTTCAAATCCATATTCCAAGGAGGGATTCCGACGATGAGATTATTTGGGATTATGACAAAAAAGGAATTT
TCTCCATCAAAAGTGCGTACCATGTGGGGATGATGGATGAATCCAAAGAAGAGACTTCCTCCTCAAATAATGAAAATCAAGGACGTTGGTGGAAAGGCTTGTGGAAGGCG
AATGTTCTCCCAAAAGTCAAAATATGCTACTGGAAACTTATTCATGACATTATGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGAGCCATCTATTCAATACATAACTGGTAAGGTGGATCGCTATACAAATTTTGATTCAGATTATTGGTCTTTCTTTGAGGTAGTTGATGTTGTTGAACAATTGGG
GTACTATGATTATCTAAAATTGTGGGGTAGGGATCCAACAAAAGAGGTGAGTGGTTATCGAGTAATAGAATCTGATAAAGATGTCATAGAATTAACTAGCCTTTTAAGTG
ATGGAATGACATTTGAGATATATGTAGAACATGAATATGATGATGTATTGTCTAATATATATGTTGATAAGGATGTAGAGGGTACTTCAAAACCATCGGATATCATAAGA
ACATATGATAATGTTGAGAATAATGATGGCAGAAGTGATATTGACTCTAGCGAGCTACTCTCATTGAAAGACTCGTCTGATTCTGAAGTTTGTGTTGTAACAAAATATCA
TATTTATAGAGAAATAACAGGTGATGAGAAACCATACTTTGAATTAGGCATGTTATTTAACTCATTGAGTGAGTTCAAGAATGCTGTAGATGATTATGCTGTTAAGGGAG
GATGGCAAATACGATTTGTGAAAAATGATAAGACTAGAGTTAGAGCCAAATGTGTGGATAGTTGTAAATGGTTGGCTTATGTTGCCAAGGTACAAGGAGAAATGACTTAT
CAGTTGAAGACCTTTGTTGGTGAACACTCGTGCAGTAGATCCTTCAACAATCCACGCCTAACATCTAGGTGGCTAAGTAGGCAAATTGTAAATGATGTGAAGGAACAACC
AGATATGAGATTGAGAGCTATCCAGGATAAGGTACAACGTAAGTACATTTCACAAATTAATAAAGTGAAAGCTTTTAGAGCAAAGAAAATGGCTTTAGATATAGTGCATG
GATCACACAAAGAGCAATATAGTCGATTATGGGAGTATTGTGAGGAGATACGCAGATCCAATCCTAGTAGCAGTGCTCTTTTGAACTTGGATGTGCGACAAGAACAAGAA
GAGGTTGATGCTGAGAAAGCTTTTTTGGCGAGCAGCCAACGCAACTTATTATCAGCAGATTGTCAGGTGTCTTCGCTTACGAGGGGTCATGACGTCTGGATTGAAGAGCT
TATCATGAACACCTTTATGGAGGCTGAGGCAAATATTATCCTTCAAATCCATATTCCAAGGAGGGATTCCGACGATGAGATTATTTGGGATTATGACAAAAAAGGAATTT
TCTCCATCAAAAGTGCGTACCATGTGGGGATGATGGATGAATCCAAAGAAGAGACTTCCTCCTCAAATAATGAAAATCAAGGACGTTGGTGGAAAGGCTTGTGGAAGGCG
AATGTTCTCCCAAAAGTCAAAATATGCTACTGGAAACTTATTCATGACATTATGATGTGA
Protein sequenceShow/hide protein sequence
MHEPSIQYITGKVDRYTNFDSDYWSFFEVVDVVEQLGYYDYLKLWGRDPTKEVSGYRVIESDKDVIELTSLLSDGMTFEIYVEHEYDDVLSNIYVDKDVEGTSKPSDIIR
TYDNVENNDGRSDIDSSELLSLKDSSDSEVCVVTKYHIYREITGDEKPYFELGMLFNSLSEFKNAVDDYAVKGGWQIRFVKNDKTRVRAKCVDSCKWLAYVAKVQGEMTY
QLKTFVGEHSCSRSFNNPRLTSRWLSRQIVNDVKEQPDMRLRAIQDKVQRKYISQINKVKAFRAKKMALDIVHGSHKEQYSRLWEYCEEIRRSNPSSSALLNLDVRQEQE
EVDAEKAFLASSQRNLLSADCQVSSLTRGHDVWIEELIMNTFMEAEANIILQIHIPRRDSDDEIIWDYDKKGIFSIKSAYHVGMMDESKEETSSSNNENQGRWWKGLWKA
NVLPKVKICYWKLIHDIMM