; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g26610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g26610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:20055723..20061065
RNA-Seq ExpressionMoc06g26610
SyntenyMoc06g26610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.3e-8568.32Show/hide
Query:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGW  KWF+ASGEWLAKDES              V+I+P+ ELTQA+FDTLKYYK+HFPR RK+GTLV DKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP

Query:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSP-PCEVRE
         VRP+E+SRPNSELA+VCGF S+VKRKSKG+AHAL+  QSS P TPAV         GP+SE P PVIEL+S+   SREKR   ++ A+DVSP   EVRE
Subjt:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSP-PCEVRE

Query:  GSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQ
          PLKRRRKKKK TS  EVG R  LP S AD VDDPEARMGGT DV  RFR+EPSSS V+DQ
Subjt:  GSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQ

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.1e-7052.99Show/hide
Query:  VSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQP
        +SIKPI EL QATFDTLK+YKD+FPR RKIGTLV DKLLLESGLLDYNPLVRP+EASRPNSELA+VCGFTSSVKRKSKGRAHALK VQSS P TPAVDQ 
Subjt:  VSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQP

Query:  AVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREGSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRM
        A QDQ GPSS  PTPVIELDSTGERSREKRS SES ALDVSP  EVR                                                     
Subjt:  AVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREGSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRM

Query:  EPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAERRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRA
                                                                                        EAKAELLK+EDERHKAHLRA
Subjt:  EPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAERRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRA

Query:  AHAITKGLKKEKFQLLKEKDDMLQALETKDAAIG
        AHAITKGL+KEKFQLLKEKDDMLQALE KDAAIG
Subjt:  AHAITKGLKKEKFQLLKEKDDMLQALETKDAAIG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.7e-7773.76Show/hide
Query:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK
        ++RIAKKPGR+YMCARKGAGGIVKGPTSIKGW  KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+P+ ELTQA+FDTLKYYK+ FPR RK+GTLV D+
Subjt:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK

Query:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA
        LLLESGLLDYNP VRP+E SRPNS LA+VC F S VKRKSKGRAHAL+  QSS P TPAV         GP+SE P PVIEL+S+G  SREKR   ++ A
Subjt:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA

Query:  LD
        +D
Subjt:  LD

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.5e-7874.75Show/hide
Query:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK
        ++RIAKKPGR+YMCARKGAGGIVKGPTSIKGW  KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+P+ ELTQA+FDTLKYYK+ FPR RK+GTLV D+
Subjt:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK

Query:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA
        LLLESGLLDYNP VRP+E+SRPNSELA+VCGF S VKRKSKGRAHAL+  QSS PATPAV         GP+SE P  VIEL+S+G  SREKR   ++ A
Subjt:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA

Query:  LD
        +D
Subjt:  LD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-15875.79Show/hide
Query:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGW GKWFFASGEWLAKDESGR FFDVPTRFGNLVSIK I EL QATFDTLK+YKDHFPRDRKI TLV DKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP

Query:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREG
        LVR +EASRPNSELA+VCGFT SVKRKSKGRAHALKTV  + P TP V +   Q  +GPSS VPTPVIELD +G RS EKRS  ES ALDVSP  EVR  
Subjt:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREG

Query:  SPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAE---
        SPL+RRRKKKK +SSSE G R  LPTSHADLVDDPEARM GTS+V+MRF MEPSSS VKDQV RISA CLDR +RRASKFVSDPGSVLQRTID+ AE   
Subjt:  SPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAE---

Query:  ---------------------RRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRAAHAITKGLKKEKFQLLKEKDDMLQA
                             + R +S AALEAATTLKGELLKAQ EVDILRAE++AK +LLKKE E+HKAHLRAAHAITKGL+KEKFQLLKEKDD+ Q 
Subjt:  ---------------------RRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRAAHAITKGLKKEKFQLLKEKDDMLQA

Query:  LETKDAAIG
        LE KDA+IG
Subjt:  LETKDAAIG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.5e-8668.32Show/hide
Query:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGW  KWF+ASGEWLAKDES              V+I+P+ ELTQA+FDTLKYYK+HFPR RK+GTLV DKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP

Query:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSP-PCEVRE
         VRP+E+SRPNSELA+VCGF S+VKRKSKG+AHAL+  QSS P TPAV         GP+SE P PVIEL+S+   SREKR   ++ A+DVSP   EVRE
Subjt:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSP-PCEVRE

Query:  GSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQ
          PLKRRRKKKK TS  EVG R  LP S AD VDDPEARMGGT DV  RFR+EPSSS V+DQ
Subjt:  GSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQ

A0A6J1CLV1 uncharacterized protein LOC1110124671.0e-7052.99Show/hide
Query:  VSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQP
        +SIKPI EL QATFDTLK+YKD+FPR RKIGTLV DKLLLESGLLDYNPLVRP+EASRPNSELA+VCGFTSSVKRKSKGRAHALK VQSS P TPAVDQ 
Subjt:  VSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQP

Query:  AVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREGSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRM
        A QDQ GPSS  PTPVIELDSTGERSREKRS SES ALDVSP  EVR                                                     
Subjt:  AVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREGSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRM

Query:  EPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAERRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRA
                                                                                        EAKAELLK+EDERHKAHLRA
Subjt:  EPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAERRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRA

Query:  AHAITKGLKKEKFQLLKEKDDMLQALETKDAAIG
        AHAITKGL+KEKFQLLKEKDDMLQALE KDAAIG
Subjt:  AHAITKGLKKEKFQLLKEKDDMLQALETKDAAIG

A0A6J1CR42 uncharacterized protein LOC1110138263.2e-7773.76Show/hide
Query:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK
        ++RIAKKPGR+YMCARKGAGGIVKGPTSIKGW  KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+P+ ELTQA+FDTLKYYK+ FPR RK+GTLV D+
Subjt:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK

Query:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA
        LLLESGLLDYNP VRP+E SRPNS LA+VC F S VKRKSKGRAHAL+  QSS P TPAV         GP+SE P PVIEL+S+G  SREKR   ++ A
Subjt:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA

Query:  LD
        +D
Subjt:  LD

A0A6J1DXS5 uncharacterized protein LOC1110255021.7e-7874.75Show/hide
Query:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK
        ++RIAKKPGR+YMCARKGAGGIVKGPTSIKGW  KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+P+ ELTQA+FDTLKYYK+ FPR RK+GTLV D+
Subjt:  SQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDK

Query:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA
        LLLESGLLDYNP VRP+E+SRPNSELA+VCGF S VKRKSKGRAHAL+  QSS PATPAV         GP+SE P  VIEL+S+G  SREKR   ++ A
Subjt:  LLLESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGA

Query:  LD
        +D
Subjt:  LD

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-15875.79Show/hide
Query:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGW GKWFFASGEWLAKDESGR FFDVPTRFGNLVSIK I EL QATFDTLK+YKDHFPRDRKI TLV DKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLLLESGLLDYNP

Query:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREG
        LVR +EASRPNSELA+VCGFT SVKRKSKGRAHALKTV  + P TP V +   Q  +GPSS VPTPVIELD +G RS EKRS  ES ALDVSP  EVR  
Subjt:  LVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPCEVREG

Query:  SPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAE---
        SPL+RRRKKKK +SSSE G R  LPTSHADLVDDPEARM GTS+V+MRF MEPSSS VKDQV RISA CLDR +RRASKFVSDPGSVLQRTID+ AE   
Subjt:  SPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAE---

Query:  ---------------------RRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRAAHAITKGLKKEKFQLLKEKDDMLQA
                             + R +S AALEAATTLKGELLKAQ EVDILRAE++AK +LLKKE E+HKAHLRAAHAITKGL+KEKFQLLKEKDD+ Q 
Subjt:  ---------------------RRRGSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRAAHAITKGLKKEKFQLLKEKDDMLQA

Query:  LETKDAAIG
        LE KDA+IG
Subjt:  LETKDAAIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCGAAGCCAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGG
GGAGGGAAGTGGTTCTTTGCCTCTGGGGAATGGCTGGCAAAGGACGAGTCAGGTCGTACCTTCTTTGACGTTCCCACCAGGTTTGGGAACTTAGTATCAATCAAA
CCAATTCTCGAGCTCACTCAAGCCACCTTCGACACCCTCAAGTACTACAAGGATCACTTCCCAAGGGACCGGAAGATCGGAACCTTAGTGATCGACAAGCTGCTC
CTAGAGTCTGGGTTGTTAGATTACAACCCCTTGGTGCGTCCGGTTGAAGCTTCAAGGCCAAACTCCGAACTCGCCATAGTGTGTGGATTCACCAGCAGCGTGAAG
CGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACTGTTCAAAGCTCTCATCCAGCGACTCCTGCTGTGGATCAACCTGCGGTTCAGGACCAGACTGGGCCATCC
TCTGAAGTTCCAACTCCGGTGATCGAGTTGGATTCTACTGGGGAGCGCTCTAGGGAGAAGCGTTCGATGAGCGAATCCGGGGCGCTGGACGTGTCGCCTCCTTGC
GAGGTGAGGGAGGGCTCTCCTTTAAAGAGGAGAAGGAAAAAGAAGAAAGCCACCTCTTCCTCGGAGGTTGGACCTCGCAGCCCCCTGCCCACGAGCCATGCCGAC
CTGGTGGACGACCCTGAAGCTCGGATGGGGGGGACGTCTGACGTGAAGATGCGGTTCAGAATGGAACCGTCAAGCTCCGCGGTGAAGGACCAGGTATTTCGCATC
TCGGCTGCATGCTTGGATCGCTGTATTAGGAGAGCATCCAAGTTTGTGAGCGATCCAGGGTCCGTGCTGCAACGGACAATTGACCACGCTGCCGAGCGAAGGAGA
GGGAGCTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTGAAGGCCCAGAGCGAGGTGGATATTTTGAGGGCCGAGATAGAAGCCAAGGCT
GAGCTGCTGAAGAAGGAGGATGAGAGGCATAAGGCCCACCTCCGAGCTGCCCACGCAATCACTAAAGGGCTGAAGAAAGAGAAGTTCCAACTCCTAAAGGAGAAG
GATGACATGCTTCAGGCCCTTGAGACGAAGGATGCTGCAATTGGGTTCCCAAGGCGAGGCAGAGCTGCAAGGAGTTACACACTTAAGCAGAGGCAAAGAAAGCCA
CGTCGGTACAATGCTCCATCTCGGAGTATGAACCGGGCTGCTTTTCGCGCCATCTTCTTTTGCTCCTTCAGATCTTGCGGCGGATTTCCTTTGATGAACTCCATG
ACTGGATCCATCCATGAGGGTGATGGAGTGTCAATCTCCATCACGTCTGGGTCCAAGATTGAAGGATTGTCCAAGATCTTGACTGGGACCGACCTGGCCAGACCA
TGCAAAGACATCCGAGTTGGACCTGAGGAAGTTGATCAGCTCCTCCCTAGCAGTGGCCCTCAGCTTGGTTCCTATGCTTACTTGTCTTTTAGGGCTAAGCAAAGG
AACAAGCTCAAGCTCCTTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTCGAAGCCAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGTGGTATAGTCAAGGGGCCGACCTCCATCAAAGGATGG
GGAGGGAAGTGGTTCTTTGCCTCTGGGGAATGGCTGGCAAAGGACGAGTCAGGTCGTACCTTCTTTGACGTTCCCACCAGGTTTGGGAACTTAGTATCAATCAAA
CCAATTCTCGAGCTCACTCAAGCCACCTTCGACACCCTCAAGTACTACAAGGATCACTTCCCAAGGGACCGGAAGATCGGAACCTTAGTGATCGACAAGCTGCTC
CTAGAGTCTGGGTTGTTAGATTACAACCCCTTGGTGCGTCCGGTTGAAGCTTCAAGGCCAAACTCCGAACTCGCCATAGTGTGTGGATTCACCAGCAGCGTGAAG
CGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACTGTTCAAAGCTCTCATCCAGCGACTCCTGCTGTGGATCAACCTGCGGTTCAGGACCAGACTGGGCCATCC
TCTGAAGTTCCAACTCCGGTGATCGAGTTGGATTCTACTGGGGAGCGCTCTAGGGAGAAGCGTTCGATGAGCGAATCCGGGGCGCTGGACGTGTCGCCTCCTTGC
GAGGTGAGGGAGGGCTCTCCTTTAAAGAGGAGAAGGAAAAAGAAGAAAGCCACCTCTTCCTCGGAGGTTGGACCTCGCAGCCCCCTGCCCACGAGCCATGCCGAC
CTGGTGGACGACCCTGAAGCTCGGATGGGGGGGACGTCTGACGTGAAGATGCGGTTCAGAATGGAACCGTCAAGCTCCGCGGTGAAGGACCAGGTATTTCGCATC
TCGGCTGCATGCTTGGATCGCTGTATTAGGAGAGCATCCAAGTTTGTGAGCGATCCAGGGTCCGTGCTGCAACGGACAATTGACCACGCTGCCGAGCGAAGGAGA
GGGAGCTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGCGAGCTGCTGAAGGCCCAGAGCGAGGTGGATATTTTGAGGGCCGAGATAGAAGCCAAGGCT
GAGCTGCTGAAGAAGGAGGATGAGAGGCATAAGGCCCACCTCCGAGCTGCCCACGCAATCACTAAAGGGCTGAAGAAAGAGAAGTTCCAACTCCTAAAGGAGAAG
GATGACATGCTTCAGGCCCTTGAGACGAAGGATGCTGCAATTGGGTTCCCAAGGCGAGGCAGAGCTGCAAGGAGTTACACACTTAAGCAGAGGCAAAGAAAGCCA
CGTCGGTACAATGCTCCATCTCGGAGTATGAACCGGGCTGCTTTTCGCGCCATCTTCTTTTGCTCCTTCAGATCTTGCGGCGGATTTCCTTTGATGAACTCCATG
ACTGGATCCATCCATGAGGGTGATGGAGTGTCAATCTCCATCACGTCTGGGTCCAAGATTGAAGGATTGTCCAAGATCTTGACTGGGACCGACCTGGCCAGACCA
TGCAAAGACATCCGAGTTGGACCTGAGGAAGTTGATCAGCTCCTCCCTAGCAGTGGCCCTCAGCTTGGTTCCTATGCTTACTTGTCTTTTAGGGCTAAGCAAAGG
AACAAGCTCAAGCTCCTTTATTAG
Protein sequenceShow/hide protein sequence
MLRSQRIAKKPGRYYMCARKGAGGIVKGPTSIKGWGGKWFFASGEWLAKDESGRTFFDVPTRFGNLVSIKPILELTQATFDTLKYYKDHFPRDRKIGTLVIDKLL
LESGLLDYNPLVRPVEASRPNSELAIVCGFTSSVKRKSKGRAHALKTVQSSHPATPAVDQPAVQDQTGPSSEVPTPVIELDSTGERSREKRSMSESGALDVSPPC
EVREGSPLKRRRKKKKATSSSEVGPRSPLPTSHADLVDDPEARMGGTSDVKMRFRMEPSSSAVKDQVFRISAACLDRCIRRASKFVSDPGSVLQRTIDHAAERRR
GSSSAALEAATTLKGELLKAQSEVDILRAEIEAKAELLKKEDERHKAHLRAAHAITKGLKKEKFQLLKEKDDMLQALETKDAAIGFPRRGRAARSYTLKQRQRKP
RRYNAPSRSMNRAAFRAIFFCSFRSCGGFPLMNSMTGSIHEGDGVSISITSGSKIEGLSKILTGTDLARPCKDIRVGPEEVDQLLPSSGPQLGSYAYLSFRAKQR
NKLKLLY