; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g39480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g39480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:29321972..29324260
RNA-Seq ExpressionMoc04g39480
SyntenyMoc04g39480
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.1e-9384.44Show/hide
Query:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRR
        AVRPIESSRPNSELAMVC FAS VKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PL EE REE P KRRR
Subjt:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRR

Query:  KKKKAISPSEVGACRVLPASFADRV
        KKKK  SP EVGA  VLPASFADRV
Subjt:  KKKKAISPSEVGACRVLPASFADRV

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]8.9e-12385.82Show/hide
Query:  MFEYGLRLPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLP HPFVQEFLFRTGLAPAQ                        EA+LLDVDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLDEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPL E A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLDEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.1e-10075.96Show/hide
Query:  EVGACRVL------PAS--FADRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTM
        EVG  R+L      P+S    D+VSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEF AALE ASSTM
Subjt:  EVGACRVL------PAS--FADRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTM

Query:  KDELLKAHSEVETLKAE---------------------------------------------ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPD
        KDELLKAHSEVETLKAE                                             ALEAKDKELEHATAELET KERL+NGVLLEEAFRQHPD
Subjt:  KDELLKAHSEVETLKAE---------------------------------------------ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
        FDGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEGA P GS
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-17390.37Show/hide
Query:  MSSSISSNLGSELARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL S+LARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSELARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASG
        LP HPFVQEFLFRTGLAPAQ                        EA+L DVDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]9.4e-14959.06Show/hide
Query:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEARE
         VR IE+SRPNSELAMVC F   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ E R 
Subjt:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEARE

Query:  EAPPKRRRKKKKAISPSEVGACRVLPASFA------------------------------DRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF
        E+P +RRRKKKK  S SE GA   LP S A                              D+VSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVLPASFA------------------------------DRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------------
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                            
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------------

Query:  -ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVR
          LE KD  +   T EL+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  -ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDPEED--------QVGSPQEGAP
        +LDSDYSD EE+        +VG+ QE  P
Subjt:  DLDSDYSDPEED--------QVGSPQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.5e-9484.44Show/hide
Query:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRR
        AVRPIESSRPNSELAMVC FAS VKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PL EE REE P KRRR
Subjt:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRR

Query:  KKKKAISPSEVGACRVLPASFADRV
        KKKK  SP EVGA  VLPASFADRV
Subjt:  KKKKAISPSEVGACRVLPASFADRV

A0A6J1CR42 uncharacterized protein LOC1110138264.3e-12385.82Show/hide
Query:  MFEYGLRLPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLP HPFVQEFLFRTGLAPAQ                        EA+LLDVDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLDEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPL E A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLDEEA

A0A6J1D971 uncharacterized protein LOC1110185383.9e-10075.96Show/hide
Query:  EVGACRVL------PAS--FADRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTM
        EVG  R+L      P+S    D+VSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEF AALE ASSTM
Subjt:  EVGACRVL------PAS--FADRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTM

Query:  KDELLKAHSEVETLKAE---------------------------------------------ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPD
        KDELLKAHSEVETLKAE                                             ALEAKDKELEHATAELET KERL+NGVLLEEAFRQHPD
Subjt:  KDELLKAHSEVETLKAE---------------------------------------------ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
        FDGFAKDFSDAGFKFLMKGIASDMPDLQIDL GLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEGA P GS
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

A0A6J1DXS5 uncharacterized protein LOC1110255029.2e-17490.37Show/hide
Query:  MSSSISSNLGSELARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL S+LARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSELARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASG
        LP HPFVQEFLFRTGLAPAQ                        EA+L DVDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPFHPFVQEFLFRTGLAPAQ------------------------EAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256654.6e-14959.06Show/hide
Query:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEARE
         VR IE+SRPNSELAMVC F   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ E R 
Subjt:  AVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEARE

Query:  EAPPKRRRKKKKAISPSEVGACRVLPASFA------------------------------DRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF
        E+P +RRRKKKK  S SE GA   LP S A                              D+VSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVLPASFA------------------------------DRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------------
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AE                                            
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFLAALEAASSTMKDELLKAHSEVETLKAE--------------------------------------------

Query:  -ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVR
          LE KD  +   T EL+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  -ALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDPEED--------QVGSPQEGAP
        +LDSDYSD EE+        +VG+ QE  P
Subjt:  DLDSDYSDPEED--------QVGSPQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGTAGCAACCTAGGATCCGAGTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGCTTGGAATACCCTTCAAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCTTTCACCCTTTTGTCCAAGAATTT
CTTTTCCGGACTGGGTTGGCTCCGGCTCAAGAGGCCAAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAGAGGATAGCTAAGAAGCCTGGTCGGTTCTA
TATGTGCCCAAGGAAAGGCGCAGGCGGCATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCGAAGGACGAGT
CAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAG
CGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTAGTGACTGACGAACTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCC
GAACTCCGAACTTGCCATGGTTTGCAGATTTGCAAGCGGTGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTG
CCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CCGCCTTTGGACGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGAGTCTTGCCTGCAAG
TTTTGCAGATCGGGTGTCTCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGAACATCGACT
ACGCCGCCGAGGCGTTCGTTGCTTCCATCCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGACGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCCTCGCTGCC
TTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAAGCCGAGGCGCTCGAAGCGAAAGATAAGGAGCTGGAGCATGC
GACTGCCGAGCTGGAGACGACGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCCGACG
CGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGGGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGC
GGCACCCCTGGCCCCCAAGCATTGGTGGATCAGTACGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCGCTCCCCC
AGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGTAGCAACCTAGGATCCGAGTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGCTTGGAATACCCTTCAAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCTTTCACCCTTTTGTCCAAGAATTT
CTTTTCCGGACTGGGTTGGCTCCGGCTCAAGAGGCCAAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAGAGGATAGCTAAGAAGCCTGGTCGGTTCTA
TATGTGCCCAAGGAAAGGCGCAGGCGGCATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCGAAGGACGAGT
CAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAG
CGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTAGTGACTGACGAACTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCC
GAACTCCGAACTTGCCATGGTTTGCAGATTTGCAAGCGGTGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTG
CCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CCGCCTTTGGACGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGAGTCTTGCCTGCAAG
TTTTGCAGATCGGGTGTCTCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGAACATCGACT
ACGCCGCCGAGGCGTTCGTTGCTTCCATCCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGACGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCCTCGCTGCC
TTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAAGCCGAGGCGCTCGAAGCGAAAGATAAGGAGCTGGAGCATGC
GACTGCCGAGCTGGAGACGACGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCCGACG
CGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGGGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGC
GGCACCCCTGGCCCCCAAGCATTGGTGGATCAGTACGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCGCTCCCCC
AGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSISSNLGSELARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPFHPFVQEF
LFRTGLAPAQEAKLLDVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE
RFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
PPLDEEAREEAPPKRRRKKKKAISPSEVGACRVLPASFADRVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFLAA
LEAASSTMKDELLKAHSEVETLKAEALEAKDKELEHATAELETTKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLRGLKRRYAEKWASGPG
GTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS