; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15366330..15368989
RNA-Seq ExpressionMoc04g21100
SyntenyMoc04g21100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.1e-10349.8Show/hide
Query:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHF
        +RRKKKKAIS SEVGACRVLPAG+ADRVDDPAARMGGTSDVT RFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAE   V   
Subjt:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHF

Query:  LLDFAQQAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ--------------------------------------------
            A +AELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                            
Subjt:  LLDFAQQAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETA
                                                AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  
Subjt:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETA

Query:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDS----
        KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++D   LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYSD +ED+V S    
Subjt:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDS----

Query:  ----TQEGAP
            TQEG P
Subjt:  ----TQEGAP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.1e-11280.73Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA--------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIL WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA--------------

Query:  -----------------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
                                     VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  -----------------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  GVKRKSKGRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEA
        GVKRKSKGRA+A EAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPR QTEAVDAQTEA D PPLGE A
Subjt:  GVKRKSKGRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.1e-12688.46Show/hide
Query:  GTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALEAASSTMK
        G   +  + RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAE   V       A +AELDGRE LAAREKEEFSAALE ASSTMK
Subjt:  GTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALEAASSTMK

Query:  DELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEESFRQHPDF
        DELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSNGVLLEE+FRQHPDF
Subjt:  DELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        DGFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.0e-13275.07Show/hide
Query:  MSSSISSNLGSDLAR----------------------------------RIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLAR                                  RIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLAR----------------------------------RIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA---------------------
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIL WLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                      
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA---------------------

Query:  ----------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK
                              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FASGVKRKSK
Subjt:  ----------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK

Query:  GRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVD
        GRA+A EAAQSSKP TPAVVGPASEDPA VIELESSGGPSREKRPR QTEAVD
Subjt:  GRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.1e-15062.98Show/hide
Query:  FYMCARKGAAVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAYAFEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR IE+SRPNSELAMVC F   VKRKSKGRA+A +    +
Subjt:  FYMCARKGAAVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAYAFEAAQSS

Query:  KPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVD
        +P TP V         GP+S  P PVIEL+ SGG S EKR R        ++EA+D  PL  E R E+PL+RRRKKKK  S SE GA   LP   AD VD
Subjt:  KPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVD

Query:  DPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALE
        DP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PGSVLQRTID  AE   +    L    +AELDGREALAA+E+E   AALE
Subjt:  DPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALE

Query:  AASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEES
        AA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEES
Subjt:  AASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEES

Query:  FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEED--------QVDSTQEGAP
        FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP+GTP PQ+LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEED--------QVDSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124675.5e-10449.8Show/hide
Query:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHF
        +RRKKKKAIS SEVGACRVLPAG+ADRVDDPAARMGGTSDVT RFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAE   V   
Subjt:  RRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHF

Query:  LLDFAQQAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ--------------------------------------------
            A +AELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                            
Subjt:  LLDFAQQAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETA
                                                AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  
Subjt:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETA

Query:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDS----
        KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++D   LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYSD +ED+V S    
Subjt:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDS----

Query:  ----TQEGAP
            TQEG P
Subjt:  ----TQEGAP

A0A6J1CR42 uncharacterized protein LOC1110138262.5e-11280.73Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA--------------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIL WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA               
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA--------------

Query:  -----------------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
                                     VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS
Subjt:  -----------------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  GVKRKSKGRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEA
        GVKRKSKGRA+A EAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPR QTEAVDAQTEA D PPLGE A
Subjt:  GVKRKSKGRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEA

A0A6J1D971 uncharacterized protein LOC1110185381.0e-12688.46Show/hide
Query:  GTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALEAASSTMK
        G   +  + RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAE   V       A +AELDGRE LAAREKEEFSAALE ASSTMK
Subjt:  GTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALEAASSTMK

Query:  DELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEESFRQHPDF
        DELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSNGVLLEE+FRQHPDF
Subjt:  DELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS
        DGFAKDFSDAGFKFLMKGIASDMPDLQID SGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDPEEDQV STQEGA P GS
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS

A0A6J1DXS5 uncharacterized protein LOC1110255029.6e-13375.07Show/hide
Query:  MSSSISSNLGSDLAR----------------------------------RIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLAR                                  RIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLAR----------------------------------RIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA---------------------
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIL WLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA                      
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAA---------------------

Query:  ----------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK
                              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVC FASGVKRKSK
Subjt:  ----------------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSK

Query:  GRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVD
        GRA+A EAAQSSKP TPAVVGPASEDPA VIELESSGGPSREKRPR QTEAVD
Subjt:  GRAYAFEAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.0e-15062.98Show/hide
Query:  FYMCARKGAAVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAYAFEAAQSS
        F +  R G  VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR IE+SRPNSELAMVC F   VKRKSKGRA+A +    +
Subjt:  FYMCARKGAAVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAYAFEAAQSS

Query:  KPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVD
        +P TP V         GP+S  P PVIEL+ SGG S EKR R        ++EA+D  PL  E R E+PL+RRRKKKK  S SE GA   LP   AD VD
Subjt:  KPPTPAV--------VGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVD

Query:  DPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALE
        DP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PGSVLQRTID  AE   +    L    +AELDGREALAA+E+E   AALE
Subjt:  DPAARMGGTSDVTTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALE

Query:  AASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEES
        AA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEES
Subjt:  AASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEES

Query:  FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEED--------QVDSTQEGAP
        FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQID +GLK++Y+EKWASGP+GTP PQ+LVD+YVR+LDSDYSD EE+        +V +TQE  P
Subjt:  FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEED--------QVDSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATCTTGGAGCACCAATAGGGGACTTCCACGTGTCCAGAGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACACTTCGGATCTCAGA
GAGAATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTA
GGATCCGATTTAGCTCGTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGC
TGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGG
CTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCGTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCTGAGCTGTTGGACGTAGACCAG
CTCCTCGCGTGCTTCGAGGCAAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGCGGTTTCAATCCGACCAGTCCCCGAGCTTACGCA
GGCCTCTTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACA
ACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCAGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCTATGCTTTT
GAGGCTGCCCAGAGTTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAA
GCGCCCCAGGGGTCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGAGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAAA
AAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTG
ACGACGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAG
CGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGTAAGACTAGTGCCCCATTTTTTGCTTGATTTCGCCCAACAGGCCGAGCTGGATGGGAGGGAAG
CTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCC
GAGGTGGAGTCTCAAGCCGAGCTACTGAAGAAGGAGGAGGATAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCT
CCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGTGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCC
TGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGAC
CTTCAGATCGATTTCAGTGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCT
GGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATCTTGGAGCACCAATAGGGGACTTCCACGTGTCCAGAGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACACTTCGGATCTCAGA
GAGAATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTA
GGATCCGATTTAGCTCGTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGC
TGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGG
CTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCGTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCTGAGCTGTTGGACGTAGACCAG
CTCCTCGCGTGCTTCGAGGCAAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGCGGTTTCAATCCGACCAGTCCCCGAGCTTACGCA
GGCCTCTTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACA
ACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCAGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCTATGCTTTT
GAGGCTGCCCAGAGTTCGAAACCTCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAA
GCGCCCCAGGGGTCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGAGAGGAGGCGAGGGAGGAAGCCCCTCTAAAGCGAAGAAGGAAAA
AAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAGGTTGGGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTG
ACGACGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACCCGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAG
CGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGTAAGACTAGTGCCCCATTTTTTGCTTGATTTCGCCCAACAGGCCGAGCTGGATGGGAGGGAAG
CTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCC
GAGGTGGAGTCTCAAGCCGAGCTACTGAAGAAGGAGGAGGATAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCT
CCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGTGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCC
TGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGAC
CTTCAGATCGATTTCAGTGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCT
GGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSHLGAPIGDFHVSRVFPSPNIGPLSVWSDLDLAEKFIRLALDTWRLPIRGKIQPSRKIYRRNIQIFRHFGSQRESQPLVDYTSRTLGRSVSSLSLSNIIAMSSSISSNL
GSDLARRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILVWLRARDSEEAELLDVDQ
LLACFEAKRIAKKPGRFYMCARKGAAVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCRFASGVKRKSKGRAYAF
EAAQSSKPPTPAVVGPASEDPAPVIELESSGGPSREKRPRGQTEAVDAQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDV
TTRFRIEPSSLGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEVRLVPHFLLDFAQQAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVETLKA
EVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEVKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPD
LQIDFSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEEDQVDSTQEGAPPAGS