; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g27880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g27880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:20942408..20951618
RNA-Seq ExpressionMoc06g27880
SyntenyMoc06g27880
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.9e-7989.89Show/hide
Query:  MFQYKRREKKSSKRRAVQAKKLTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNATNLATRTYLMKSRKIMTELGFDLTLGDVPD
        MFQYKRRE KSSKRRAVQ  K TVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRT LMK  KIMTELGFDLTLGDVPD
Subjt:  MFQYKRREKKSSKRRAVQAKKLTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNATNLATRTYLMKSRKIMTELGFDLTLGDVPD

Query:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPESHIAIVRGKEIRFDATQINCTFNIKNIRDAVGNKILVTPTLE
        DWR+TAR KEWRPLIQPIQCEALELVREFYAA HP+SHIAIVRGKEIRFDATQIN TFNIKNI+DAVGNK+LVTPTLE
Subjt:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPESHIAIVRGKEIRFDATQINCTFNIKNIRDAVGNKILVTPTLE

XP_022153526.1 uncharacterized protein LOC111021009 [Momordica charantia]1.5e-5256.68Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        T+GQF GL +EDP+SHLKSF ++AN+F+LP +S DALRLK+FPFSL   A  WLNA   +SIN+W  + +KFLAKY   T+N D+RE+I+SFRQ+ENE V
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIHATCRVCRGQSDCSSLSRVEP
         EAWERFKEL+R CP+  +PACVQIE FYRG D  ++MMLNTAANG    K+ NEIV IL+++ +             CS  SR +P
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIHATCRVCRGQSDCSSLSRVEP

XP_022155016.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022160 [Momordica charantia]2.2e-5160.74Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        T+GQFGG  ++DP+ HLK+F +IA AF+ P ++ DALRL +FPFSL D ARTWLN     SI TW  L EKFL KY   TR+ D+ E+IV+FRQ + E V
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKM
         EAWERFKELLR+CP+HGLPAC+QIE F+RGLD  ++MMLN AANG+  +K+ NEIVDIL  +
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKM

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.8e-8090.42Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        TMGQFGGLTNEDPYSHLKSFIEIANAFQLP  S DALRLKMFPFSL DGARTW+NALE NSINTWAELT+KFLAKYHTLT+N DLREDIVSFRQKENEAV
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH
        QEAWERFKELLRRCPSHGLP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKS NEIVD+LNKM DI+
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.4e-7182.53Show/hide
Query:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAVQ
        M QFGG TNEDPYSHLKSFI+IANAFQLP VS DALRLKMFPFSL DGA TW+N LEQN I TWAELT+KFLAKYHTLTRN DL+EDIVSFRQ+E+EAVQ
Subjt:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAVQ

Query:  EAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH
        EAWERFKELL+RC SHGLP CVQI+QFYRGLD   RMM +TAAN SLLEKS NEI+DILNKMIDI+
Subjt:  EAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH

TrEMBL top hitse value%identityAlignment
A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195159.1e-8089.89Show/hide
Query:  MFQYKRREKKSSKRRAVQAKKLTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNATNLATRTYLMKSRKIMTELGFDLTLGDVPD
        MFQYKRRE KSSKRRAVQ  K TVPMNEPKTRAAKAKAAEAKKKVVAPGPVD IELDLSEGE+VET WNA NLATRT LMK  KIMTELGFDLTLGDVPD
Subjt:  MFQYKRREKKSSKRRAVQAKKLTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNATNLATRTYLMKSRKIMTELGFDLTLGDVPD

Query:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPESHIAIVRGKEIRFDATQINCTFNIKNIRDAVGNKILVTPTLE
        DWR+TAR KEWRPLIQPIQCEALELVREFYAA HP+SHIAIVRGKEIRFDATQIN TFNIKNI+DAVGNK+LVTPTLE
Subjt:  DWRETARDKEWRPLIQPIQCEALELVREFYAAVHPESHIAIVRGKEIRFDATQINCTFNIKNIRDAVGNKILVTPTLE

A0A6J1DKX0 uncharacterized protein LOC1110210097.3e-5356.68Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        T+GQF GL +EDP+SHLKSF ++AN+F+LP +S DALRLK+FPFSL   A  WLNA   +SIN+W  + +KFLAKY   T+N D+RE+I+SFRQ+ENE V
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIHATCRVCRGQSDCSSLSRVEP
         EAWERFKEL+R CP+  +PACVQIE FYRG D  ++MMLNTAANG    K+ NEIV IL+++ +             CS  SR +P
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIHATCRVCRGQSDCSSLSRVEP

A0A6J1DQF5 LOW QUALITY PROTEIN: uncharacterized protein LOC1110221601.1e-5160.74Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        T+GQFGG  ++DP+ HLK+F +IA AF+ P ++ DALRL +FPFSL D ARTWLN     SI TW  L EKFL KY   TR+ D+ E+IV+FRQ + E V
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKM
         EAWERFKELLR+CP+HGLPAC+QIE F+RGLD  ++MMLN AANG+  +K+ NEIVDIL  +
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKM

A0A6J1DYY9 uncharacterized protein LOC1110255575.4e-7283.13Show/hide
Query:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAVQ
        M QFGG TNEDPYSHLKSFI+IANAFQLP VS DALRLKMFPFSL DGA TWLN LEQN I TWAELT+KFLAKYHTLTRN DL+EDIVSFRQ+E+EAVQ
Subjt:  MGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAVQ

Query:  EAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH
        EAWERFKELL+RC SHGLP CVQI+QFYRGLD   RMM +TAAN SLLEKS NEI+DILNKMIDI+
Subjt:  EAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH

A0A6J1E251 uncharacterized protein LOC1110253021.8e-8090.42Show/hide
Query:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV
        TMGQFGGLTNEDPYSHLKSFIEIANAFQLP  S DALRLKMFPFSL DGARTW+NALE NSINTWAELT+KFLAKYHTLT+N DLREDIVSFRQKENEAV
Subjt:  TMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQKENEAV

Query:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH
        QEAWERFKELLRRCPSHGLP+CVQIEQFYRGLDRSS+MMLNT ANGSLLEKS NEIVD+LNKM DI+
Subjt:  QEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCTGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAA
GCTAAAGCAGCTGAAGCTAAGAAAAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCG
ACAAATTTAGCCACTCGAACTTACTTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACC
GCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCGAGTCACATATAGCC
ATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTGCACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATTTTAGTGACTCCG
ACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCTTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCC
CTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTA
AAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACATGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTA
TGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGA
AGGGAAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACTGACACTGATCCTAGTCCACAACCTAGCTGGGCTAGTAGCCGGTCTTTTAGATGTAAAAGT
CTTAGTGTAAGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGGAAATACCTACATGGGAGGCGTCAGG
CGCCTGGGAAGCCTGCAGAAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAATGCGTCTTCCCATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAA
GAAGTGTTCCACTATCAGTTTGAGCACGATTTGACGATGGGTCAGTTCGGAGGACTGACTAATGAAGATCCTTACTCCCACCTCAAATCTTTTATTGAAATAGCT
AATGCATTTCAACTTCCTATTGTCTCTGGGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCATGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACAA
AATTCTATCAACACATGGGCGGAACTGACGGAGAAATTTTTGGCGAAGTACCACACTTTGACTAGGAACACAGACCTTCGAGAAGACATTGTGTCTTTTCGACAA
AAGGAAAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTAAGAAGATGCCCGAGCCATGGATTACCCGCATGTGTGCAAATTGAACAATTCTAT
AGAGGATTGGATCGTTCATCAAGGATGATGTTAAACACTGCAGCCAATGGCTCATTGTTAGAAAAGTCGGCTAATGAGATCGTTGATATCTTAAACAAGATGATA
GACATACATGCCACGTGTAGGGTCTGTAGAGGTCAGAGCGACTGTAGCTCGTTGTCTAGGGTAGAGCCAAACCAAAATATGGGAGGGAACGCAGTTGCCTCAACT
TCTAGTCGAGTCAAACCCTGTCCTGAGCTTCCCCCTTCTATTTGCTCTCTGTCCAACCTGTTCAAGCTTCGGGAAGTTTACAGGATCCCTGATAACATAGAAATG
AGATTACCTCTGGTCGATAAAAGCTTAGATAACCCTTCACCTGGGTCGGTAGGTTTCTATCCCGAGATGTTCGACCATGGGGTCAGATTGCCCTTACACCCCTTT
GTACAAAGGTTCCTTTCAACTACGAACTTGGCCCCGGCTCAACTGGTACCGAACGGGTGGACCACACTACTCGATCTGTGGTTCACCTGGTGGAGCTTCAGTGCA
GGCGAAGAGAGTGTCCTCCTAGACGTAGGACAATTCTTTGACACTCACATCATCAAACCTTACAAGGAGCATCCGGGTCGGTACTACATATCGGCTCGGAATAGC
TTCTGCAAGATAGTGGACTCTCCCTCGGCCAATAAGCATTGGAGAGACAAATGGTTCCTCGTCTCGGGTCTCCTCGACGAGCTTGAACCCAACCGAGCTCGCCAC
CTCGAAATCATGGTGTTCAAACCATACAACTCTATGAACCGAAAGCGATCCACCGATCGGCCTGCAGCTGGTGATGCTTCTAAGAAACGTGGGAGGTCAGATGAG
GCTTCATCCGGGCGAAAAAGTCGATTTTCCGACCCGAAGGGGAAGCGCTTCTTCGATGCCTCGTCTCATTCTCGTTCAAAGCCAATCAGCATTCACTCGAAGGAC
GAAATGAGCACTCAGTACCTGCCGATAATGGACTTTTCATACCCCTTCAAGGGCAGTTCTGTTAGGGAGAGCATTCGTGAGGCTGCCCTGACAGCCTACAAGGCC
AGCTCGGCTCAGATGCTGGAATCAAGCCAGTCTTCTTTCTTGGAAAAGCCAAGTGATTACGTGCAGTGTCTGATTGACGACATTGCTCAGCTTCACTTTACAGCC
TTCCATACTAGGGCCATCGTCAGCCAGGAGTTGACTGCTAGGAAGGCCAGCTTCACCCGCGAATGTGAAGCTACCAAACAGACTGAGGAGCTTAGAGTTGAGGTG
GTGAAGCTCTGCAAGGCTAAGGCGACTGAAGAGCGTGTCGACAGGGAAGTGACCGAACGTGCTTCTGAAAAGGCTGACTTTGAAGCAAAGTTGAAGAACTTCGAC
TTCTTGGAATCAGCCATGAAGAAAGTTCCTAATTTTGATGACTTGGTGCGGGACTTAGATGACAGGGGCTTCGACATTGTCGTAGCCGAGGTAAAAAAGCTTGCC
TCTACACTGGACTTGGCTCCTATATATGCAGCCTTTGAGGCAGTTATGGAAGAGGATGAGGAAGGAGAAGCTCAAGCTGACCATCCTGTTGATGAGGGTACTGAA
GTGCGTGCGTTCAGCAAAACGCCCAGTCAGCGCCTCATCATTGGCTCGCTGGTCGAAATGCTCGACTCTGAAGGTGAACTGACCTTTCCTAGCAGGGACTACTTC
GATCAGGTTGAGGCCTCACTAAGCGAACGACCAATGACTGAAAATGGGTGTGAGGGGCTCCAGAGGCCGGTGCGGAATGGTGGCGAAGCGTTGACGGTTATCACA
CTTCTTCACAAACTGCTTTTTTGCAGCTCGACGCGCTACCTTCCTGGCTTCGGCACGGTCATCAAGAAGCTTCCCATTAAAGTAATCTTTGATGGGATCATCAAG
GTTAGAGAAGTTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCAATACAAGAGGCGGGAGAAAAAGAGTTCAAAACGTCGTGCAGTTCAGGCTAAGAAGCTGACAGTGCCCATGAATGAACCTAAAACGAGAGCTGCGAAA
GCTAAAGCAGCTGAAGCTAAGAAAAAGGTAGTGGCACCTGGGCCAGTTGATACAATCGAACTGGACTTGTCTGAGGGAGAGGAGGTCGAGACGAAATGGAACGCG
ACAAATTTAGCCACTCGAACTTACTTAATGAAATCCCGTAAGATTATGACAGAATTGGGATTCGATCTCACTCTAGGAGATGTGCCTGATGATTGGAGGGAGACC
GCTAGAGATAAAGAATGGAGACCACTCATTCAGCCCATACAATGTGAGGCTTTGGAGTTAGTCAGAGAGTTCTATGCTGCTGTCCATCCCGAGTCACATATAGCC
ATAGTGCGCGGGAAGGAAATACGGTTTGATGCCACTCAGATCAACTGCACCTTCAACATTAAGAATATCAGAGATGCTGTGGGCAATAAGATTTTAGTGACTCCG
ACTCTGGAACAGCTTGATGAGGCTCTAGAATGTGTTGGGAAGCCTTCTGCCACTTGGGATTTGACTACTCATGGCAAGGTACGACTAAAACCCGAGGATGTTTCC
CTAGCTGCTGCAGGATGGTTATACATAGTCAAAAACAGAATTCTGCCAACGGAGCATGATGAGCATGTCACTCAGGATAGGGCACTGCTGGTTTATGCCATGCTA
AAGGGCATAGATGTGAATTATGGAGAATTGATCAATACCAGTATCCATGAGTGTGCCCACCGGACATGTGGTAAGCTTTATCACCCACGTTTGGTCACTTCTTTA
TGCTTGCGACAAGGTGTACAGCTCCCTGAGGATCAAATTAAGAGAGATGCCCCAATTGTGGAAGAGAAGAATATTCGGCGTATTATCGCCCATGCGTTACAAAGA
AGGGAAGCGCTAGCTGCTATCCTTGGTCATCCATCTTCCAGTACTGACACTGATCCTAGTCCACAACCTAGCTGGGCTAGTAGCCGGTCTTTTAGATGTAAAAGT
CTTAGTGTAAGGAGGGAGTGTGCAGATTCCTTAGGGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGGAAATACCTACATGGGAGGCGTCAGG
CGCCTGGGAAGCCTGCAGAAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAATGCGTCTTCCCATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAA
GAAGTGTTCCACTATCAGTTTGAGCACGATTTGACGATGGGTCAGTTCGGAGGACTGACTAATGAAGATCCTTACTCCCACCTCAAATCTTTTATTGAAATAGCT
AATGCATTTCAACTTCCTATTGTCTCTGGGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCATGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACAA
AATTCTATCAACACATGGGCGGAACTGACGGAGAAATTTTTGGCGAAGTACCACACTTTGACTAGGAACACAGACCTTCGAGAAGACATTGTGTCTTTTCGACAA
AAGGAAAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTTAAGGAGTTACTAAGAAGATGCCCGAGCCATGGATTACCCGCATGTGTGCAAATTGAACAATTCTAT
AGAGGATTGGATCGTTCATCAAGGATGATGTTAAACACTGCAGCCAATGGCTCATTGTTAGAAAAGTCGGCTAATGAGATCGTTGATATCTTAAACAAGATGATA
GACATACATGCCACGTGTAGGGTCTGTAGAGGTCAGAGCGACTGTAGCTCGTTGTCTAGGGTAGAGCCAAACCAAAATATGGGAGGGAACGCAGTTGCCTCAACT
TCTAGTCGAGTCAAACCCTGTCCTGAGCTTCCCCCTTCTATTTGCTCTCTGTCCAACCTGTTCAAGCTTCGGGAAGTTTACAGGATCCCTGATAACATAGAAATG
AGATTACCTCTGGTCGATAAAAGCTTAGATAACCCTTCACCTGGGTCGGTAGGTTTCTATCCCGAGATGTTCGACCATGGGGTCAGATTGCCCTTACACCCCTTT
GTACAAAGGTTCCTTTCAACTACGAACTTGGCCCCGGCTCAACTGGTACCGAACGGGTGGACCACACTACTCGATCTGTGGTTCACCTGGTGGAGCTTCAGTGCA
GGCGAAGAGAGTGTCCTCCTAGACGTAGGACAATTCTTTGACACTCACATCATCAAACCTTACAAGGAGCATCCGGGTCGGTACTACATATCGGCTCGGAATAGC
TTCTGCAAGATAGTGGACTCTCCCTCGGCCAATAAGCATTGGAGAGACAAATGGTTCCTCGTCTCGGGTCTCCTCGACGAGCTTGAACCCAACCGAGCTCGCCAC
CTCGAAATCATGGTGTTCAAACCATACAACTCTATGAACCGAAAGCGATCCACCGATCGGCCTGCAGCTGGTGATGCTTCTAAGAAACGTGGGAGGTCAGATGAG
GCTTCATCCGGGCGAAAAAGTCGATTTTCCGACCCGAAGGGGAAGCGCTTCTTCGATGCCTCGTCTCATTCTCGTTCAAAGCCAATCAGCATTCACTCGAAGGAC
GAAATGAGCACTCAGTACCTGCCGATAATGGACTTTTCATACCCCTTCAAGGGCAGTTCTGTTAGGGAGAGCATTCGTGAGGCTGCCCTGACAGCCTACAAGGCC
AGCTCGGCTCAGATGCTGGAATCAAGCCAGTCTTCTTTCTTGGAAAAGCCAAGTGATTACGTGCAGTGTCTGATTGACGACATTGCTCAGCTTCACTTTACAGCC
TTCCATACTAGGGCCATCGTCAGCCAGGAGTTGACTGCTAGGAAGGCCAGCTTCACCCGCGAATGTGAAGCTACCAAACAGACTGAGGAGCTTAGAGTTGAGGTG
GTGAAGCTCTGCAAGGCTAAGGCGACTGAAGAGCGTGTCGACAGGGAAGTGACCGAACGTGCTTCTGAAAAGGCTGACTTTGAAGCAAAGTTGAAGAACTTCGAC
TTCTTGGAATCAGCCATGAAGAAAGTTCCTAATTTTGATGACTTGGTGCGGGACTTAGATGACAGGGGCTTCGACATTGTCGTAGCCGAGGTAAAAAAGCTTGCC
TCTACACTGGACTTGGCTCCTATATATGCAGCCTTTGAGGCAGTTATGGAAGAGGATGAGGAAGGAGAAGCTCAAGCTGACCATCCTGTTGATGAGGGTACTGAA
GTGCGTGCGTTCAGCAAAACGCCCAGTCAGCGCCTCATCATTGGCTCGCTGGTCGAAATGCTCGACTCTGAAGGTGAACTGACCTTTCCTAGCAGGGACTACTTC
GATCAGGTTGAGGCCTCACTAAGCGAACGACCAATGACTGAAAATGGGTGTGAGGGGCTCCAGAGGCCGGTGCGGAATGGTGGCGAAGCGTTGACGGTTATCACA
CTTCTTCACAAACTGCTTTTTTGCAGCTCGACGCGCTACCTTCCTGGCTTCGGCACGGTCATCAAGAAGCTTCCCATTAAAGTAATCTTTGATGGGATCATCAAG
GTTAGAGAAGTTGGCTGA
Protein sequenceShow/hide protein sequence
MFQYKRREKKSSKRRAVQAKKLTVPMNEPKTRAAKAKAAEAKKKVVAPGPVDTIELDLSEGEEVETKWNATNLATRTYLMKSRKIMTELGFDLTLGDVPDDWRET
ARDKEWRPLIQPIQCEALELVREFYAAVHPESHIAIVRGKEIRFDATQINCTFNIKNIRDAVGNKILVTPTLEQLDEALECVGKPSATWDLTTHGKVRLKPEDVS
LAAAGWLYIVKNRILPTEHDEHVTQDRALLVYAMLKGIDVNYGELINTSIHECAHRTCGKLYHPRLVTSLCLRQGVQLPEDQIKRDAPIVEEKNIRRIIAHALQR
REALAAILGHPSSSTDTDPSPQPSWASSRSFRCKSLSVRRECADSLGDHLGVQINQKKQKVGNTYMGGVRRLGSLQKNSFSSNFALNEMRLPMRFGGSNRCIRVE
EVFHYQFEHDLTMGQFGGLTNEDPYSHLKSFIEIANAFQLPIVSGDALRLKMFPFSLMDGARTWLNALEQNSINTWAELTEKFLAKYHTLTRNTDLREDIVSFRQ
KENEAVQEAWERFKELLRRCPSHGLPACVQIEQFYRGLDRSSRMMLNTAANGSLLEKSANEIVDILNKMIDIHATCRVCRGQSDCSSLSRVEPNQNMGGNAVAST
SSRVKPCPELPPSICSLSNLFKLREVYRIPDNIEMRLPLVDKSLDNPSPGSVGFYPEMFDHGVRLPLHPFVQRFLSTTNLAPAQLVPNGWTTLLDLWFTWWSFSA
GEESVLLDVGQFFDTHIIKPYKEHPGRYYISARNSFCKIVDSPSANKHWRDKWFLVSGLLDELEPNRARHLEIMVFKPYNSMNRKRSTDRPAAGDASKKRGRSDE
ASSGRKSRFSDPKGKRFFDASSHSRSKPISIHSKDEMSTQYLPIMDFSYPFKGSSVRESIREAALTAYKASSAQMLESSQSSFLEKPSDYVQCLIDDIAQLHFTA
FHTRAIVSQELTARKASFTRECEATKQTEELRVEVVKLCKAKATEERVDREVTERASEKADFEAKLKNFDFLESAMKKVPNFDDLVRDLDDRGFDIVVAEVKKLA
STLDLAPIYAAFEAVMEEDEEGEAQADHPVDEGTEVRAFSKTPSQRLIIGSLVEMLDSEGELTFPSRDYFDQVEASLSERPMTENGCEGLQRPVRNGGEALTVIT
LLHKLLFCSSTRYLPGFGTVIKKLPIKVIFDGIIKVREVG