; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:6355579..6357773
RNA-Seq ExpressionMoc01g10230
SyntenyMoc01g10230
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.8e-9756.05Show/hide
Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        +SI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP V PIE+SR NSELAMVCGF SSVKRKSKGRAHAL+  QSS PVTPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDG
        A++D A                   G S       +E  S+G R +  R  + S                                    AL V    + 
Subjt:  ASEDPAQVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDG

Query:  REVWQRGRKRTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDA
        RE       + ELLK+E +R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQALE KD  +     EL+  KE L+NG LLE +FRQHPDFDGFA+DFSDA
Subjt:  REVWQRGRKRTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDA

Query:  GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP
        GFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        +VG+TQEG P
Subjt:  GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.6e-10090.43Show/hide
Query:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
        P G GV F+L   FWLRARDSEEAELLDV+QLLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
Subjt:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL

Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIE SR NS LAMVC F S VKRKSKGRAHALEAAQSSKP TPAV GP
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVL
        ASEDPA V+
Subjt:  ASEDPAQVL

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.3e-10777.89Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDGREVWQRGRK----------------
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCL RASKFVS PGSVLQRTIDYA EAFVASIQSALAVKAELDGREV     K                
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDGREVWQRGRK----------------

Query:  ------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFD
                          + ELLKKE DRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHAT ELETAKE LSNGVLLEE+FRQHPDFD
Subjt:  ------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFD

Query:  GFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQADS
        GFA+DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA    S
Subjt:  GFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQADS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.3e-10291.39Show/hide
Query:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
        P G GV F+L   FWLRARDSEEAEL DV+QLLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
Subjt:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL

Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSELAMVCGF S VKRKSKGRAHALEAAQSSKP TPAV GP
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVL
        ASEDPA V+
Subjt:  ASEDPAQVL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.2e-15459.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAV--------AGPASEDPAQV---------------------------------
         V  IE+SR NSELAMVCGF  SVKRKSKGRAHAL+    ++PVTP V        +GP+S  P  V                                 
Subjt:  AVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAV--------AGPASEDPAQV---------------------------------

Query:  -----------------------LPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFV
                               LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR L RASKFVSDPGSVLQRTID   EAF+
Subjt:  -----------------------LPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFV

Query:  ASIQSALAVKAELDGREVWQRGRK---------------------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQA
        ASI  A+ VKAELDGRE      +                                 + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q 
Subjt:  ASIQSALAVKAELDGREVWQRGRK---------------------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQA

Query:  LEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDL
        LE KD  +   TTEL+  KE L+NG LLEESFRQHPDFDGFA+DFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+L
Subjt:  LEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDL

Query:  DSDYSDPEED--------QVGSTQEGAP
        DSDYSD EE+        +VG+TQE  P
Subjt:  DSDYSDPEED--------QVGSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.3e-9756.05Show/hide
Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        +SI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP V PIE+SR NSELAMVCGF SSVKRKSKGRAHAL+  QSS PVTPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDG
        A++D A                   G S       +E  S+G R +  R  + S                                    AL V    + 
Subjt:  ASEDPAQVLPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDG

Query:  REVWQRGRKRTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDA
        RE       + ELLK+E +R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQALE KD  +     EL+  KE L+NG LLE +FRQHPDFDGFA+DFSDA
Subjt:  REVWQRGRKRTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDA

Query:  GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP
        GFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        +VG+TQEG P
Subjt:  GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP

A0A6J1CR42 uncharacterized protein LOC1110138262.2e-10090.43Show/hide
Query:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
        P G GV F+L   FWLRARDSEEAELLDV+QLLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
Subjt:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL

Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIE SR NS LAMVC F S VKRKSKGRAHALEAAQSSKP TPAV GP
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVL
        ASEDPA V+
Subjt:  ASEDPAQVL

A0A6J1D971 uncharacterized protein LOC1110185381.1e-10777.89Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDGREVWQRGRK----------------
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCL RASKFVS PGSVLQRTIDYA EAFVASIQSALAVKAELDGREV     K                
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDGREVWQRGRK----------------

Query:  ------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFD
                          + ELLKKE DRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHAT ELETAKE LSNGVLLEE+FRQHPDFD
Subjt:  ------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFD

Query:  GFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQADS
        GFA+DFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA    S
Subjt:  GFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQADS

A0A6J1DXS5 uncharacterized protein LOC1110255024.0e-10291.39Show/hide
Query:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
        P G GV F+L   FWLRARDSEEAEL DV+QLLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL
Subjt:  PMGGGVSFSLWPSFWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNL

Query:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP
        VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAV PIESSR NSELAMVCGF S VKRKSKGRAHALEAAQSSKP TPAV GP
Subjt:  VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGP

Query:  ASEDPAQVL
        ASEDPA V+
Subjt:  ASEDPAQVL

A0A6J1DZB3 uncharacterized protein LOC1110256653.5e-15459.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAV--------AGPASEDPAQV---------------------------------
         V  IE+SR NSELAMVCGF  SVKRKSKGRAHAL+    ++PVTP V        +GP+S  P  V                                 
Subjt:  AVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAV--------AGPASEDPAQV---------------------------------

Query:  -----------------------LPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFV
                               LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR L RASKFVSDPGSVLQRTID   EAF+
Subjt:  -----------------------LPASFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFV

Query:  ASIQSALAVKAELDGREVWQRGRK---------------------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQA
        ASI  A+ VKAELDGRE      +                                 + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q 
Subjt:  ASIQSALAVKAELDGREVWQRGRK---------------------------------RTELLKKEVDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQA

Query:  LEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDL
        LE KD  +   TTEL+  KE L+NG LLEESFRQHPDFDGFA+DFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+L
Subjt:  LEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDL

Query:  DSDYSDPEED--------QVGSTQEGAP
        DSDYSD EE+        +VG+TQE  P
Subjt:  DSDYSDPEED--------QVGSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCC
CTGAGAACATCCTCCTCAGGCTTCCGTAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGAC
TTCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCAACTGGGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGGGGTGTCATTTTCGCTTTGGCCATCC
TTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAAACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTTGTCGGTTC
TATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAG
GACGAGTCAGGTCGTTCCTTCTTTGACGTCCCTACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAA
TACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCATCCC
ATTGAATCCTCAAGGTCGAACTCTGAACTTGCCATGGTTTGCGGATTTGTAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAG
AGTTCGAAACCTGTCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCAGGTCTTGCCTGCAAGTTTCGCAGATCGTGTGGACGATCCTGCTGCCAGG
ATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGC
CTAATGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGTCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCT
GTAAAGGCCGAGCTGGATGGGAGGGAAGTTTGGCAGCGAGGGAGAAAGAGGACCGAGCTGCTGAAGAAGGAAGTGGACAGGCGCAAGGCCCAACTCCGAGCTGCC
CACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTTGAGCATGCG
ACTACCGAGCTGGAGACGGCGAAAGAGCACCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCCAAGACTTTTCT
GACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTAGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAA
GAGGGCGCTCCTCAAGCAGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCC
CTGAGAACATCCTCCTCAGGCTTCCGTAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGAC
TTCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCAACTGGGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGGGGTGTCATTTTCGCTTTGGCCATCC
TTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAAACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTTGTCGGTTC
TATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAG
GACGAGTCAGGTCGTTCCTTCTTTGACGTCCCTACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAA
TACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCATCCC
ATTGAATCCTCAAGGTCGAACTCTGAACTTGCCATGGTTTGCGGATTTGTAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAG
AGTTCGAAACCTGTCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCAGGTCTTGCCTGCAAGTTTCGCAGATCGTGTGGACGATCCTGCTGCCAGG
ATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGC
CTAATGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGTCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCT
GTAAAGGCCGAGCTGGATGGGAGGGAAGTTTGGCAGCGAGGGAGAAAGAGGACCGAGCTGCTGAAGAAGGAAGTGGACAGGCGCAAGGCCCAACTCCGAGCTGCC
CACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTTGAGCATGCG
ACTACCGAGCTGGAGACGGCGAAAGAGCACCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCCAAGACTTTTCT
GACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTAGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAA
GAGGGCGCTCCTCAAGCAGACTCTTAG
Protein sequenceShow/hide protein sequence
MTGRIVTPPLQVRVWNTLLGYLSTTSDPFVGGSLSLRTSSSGFRRRGRELTILQRDGSLSTSKCLSTASDFPSPFCPRISLPTGGWLRLKWPPMGGGVSFSLWPS
FWLRARDSEEAELLDVNQLLACFEAKRIAKKPCRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLK
YYKERFPRGRKVGTLVTDELLLESGLLDYNPAVHPIESSRSNSELAMVCGFVSSVKRKSKGRAHALEAAQSSKPVTPAVAGPASEDPAQVLPASFADRVDDPAAR
MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLMRASKFVSDPGSVLQRTIDYAVEAFVASIQSALAVKAELDGREVWQRGRKRTELLKKEVDRRKAQLRAA
HAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATTELETAKEHLSNGVLLEESFRQHPDFDGFAQDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWAS
GPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQADS