; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:3800434..3803385
RNA-Seq ExpressionMoc07g04430
SyntenyMoc07g04430
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.9e-11152.06Show/hide
Query:  RRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEV ACRVLPA FADRVDDPAARMGG SDVTARFR+EPSSSGVRDQVSRISAASLDRCL+RASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQ LE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLD DYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.1e-13693.68Show/hide
Query:  GMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCL+RASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLD DYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]7.9e-10582.12Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCL+RASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQ LE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS
                      +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLD DYSDP+EDQVGSTQEGAP AGS
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.8e-13496.41Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASISSQGLEYPSRIPEHYLGSLRRGFAIPENILLRLSEEGERADNPLEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDAS S QGLEYPSRIPEHYLGSLRRGFAIPENILLRL EEGERADNP EGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASISSQGLEYPSRIPEHYLGSLRRGFAIPENILLRLSEEGERADNPLEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKCAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARK AGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKCAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-14858.77Show/hide
Query:  MCARKCAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKFVPL--------------NP
        MCARK  GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK V L              NP
Subjt:  MCARKCAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKFVPL--------------NP

Query:  QGRTLNLVCPRDQ---------------------------TEAVD--------------------------------------AQTEAVDAPPLGEEVRE
          R +    P  +                           TE V                                        ++EA+D  PL  EVR 
Subjt:  QGRTLNLVCPRDQ---------------------------TEAVD--------------------------------------AQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE  A   LP S AD VDDP ARM G S+V  RF +EPSSSGV+DQVSRISA  LDR L+RASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML

Query:  QVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        QVLE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDFDYSDPEED--------QVGSTQEGAP--QAGS
        +LD DYSD EE+        +VG+TQE  P  Q GS
Subjt:  DLDFDYSDPEED--------QVGSTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124679.4e-11252.06Show/hide
Query:  RRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEV ACRVLPA FADRVDDPAARMGG SDVTARFR+EPSSSGVRDQVSRISAASLDRCL+RASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQ LE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLD DYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185382.4e-13693.68Show/hide
Query:  GMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCL+RASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLD DYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS

A0A6J1DVF6 uncharacterized protein LOC1110247403.8e-10582.12Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCL+RASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQ LE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS
                      +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLD DYSDP+EDQVGSTQEGAP AGS
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-13496.41Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASISSQGLEYPSRIPEHYLGSLRRGFAIPENILLRLSEEGERADNPLEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDAS S QGLEYPSRIPEHYLGSLRRGFAIPENILLRL EEGERADNP EGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASISSQGLEYPSRIPEHYLGSLRRGFAIPENILLRLSEEGERADNPLEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKCAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARK AGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKCAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRK

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-14858.77Show/hide
Query:  MCARKCAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKFVPL--------------NP
        MCARK  GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK V L              NP
Subjt:  MCARKCAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKFVPL--------------NP

Query:  QGRTLNLVCPRDQ---------------------------TEAVD--------------------------------------AQTEAVDAPPLGEEVRE
          R +    P  +                           TE V                                        ++EA+D  PL  EVR 
Subjt:  QGRTLNLVCPRDQ---------------------------TEAVD--------------------------------------AQTEAVDAPPLGEEVRE

Query:  EAPLKRRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE  A   LP S AD VDDP ARM G S+V  RF +EPSSSGV+DQVSRISA  LDR L+RASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGGMSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML

Query:  QVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        QVLE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDFDYSDPEED--------QVGSTQEGAP--QAGS
        +LD DYSD EE+        +VG+TQE  P  Q GS
Subjt:  DLDFDYSDPEED--------QVGSTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGTAAGCAAGCTTTTGTTCGATTTAAAAGACGGTGTTTATGCAAGGATATGCACAACAGTGTGTTCCTGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGA
ACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCAAGTTTAGTTCGAGGCCAGAAATCGTCGTACCTGATCGGGGAAT
CATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCTGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCATTTC
AAGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTTCGGAGGAGGGGG
AGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACT
GGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGT
AGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAATGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCA
TCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCA
ATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGTTCGTCCCATTGAATCCTCAAGGCCGAAC
TCTGAACTTGGTATGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCCCGCCTTTAGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGC
GAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCAGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGATGATCCTGCGGCCAGGATGGGCGGG
ATGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGAGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAAGAGGGCGTC
CAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATG
GGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACT
TTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAA
GTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCA
ATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAGGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGAC
ATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGT
CAGAGATCTGGACTTTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGGTAAGCAAGCTTTTGTTCGATTTAAAAGACGGTGTTTATGCAAGGATATGCACAACAGTGTGTTCCTGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGA
ACACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCAAGTTTAGTTCGAGGCCAGAAATCGTCGTACCTGATCGGGGAAT
CATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCT
ATTAGCAGCAACCTAGGATCTGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCATTTC
AAGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTTCGGAGGAGGGGG
AGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACT
GGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGT
AGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAATGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCA
TCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCA
ATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGTTCGTCCCATTGAATCCTCAAGGCCGAAC
TCTGAACTTGGTATGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCCCGCCTTTAGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGC
GAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCAGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGATGATCCTGCGGCCAGGATGGGCGGG
ATGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGAGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAAGAGGGCGTC
CAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATG
GGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACT
TTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAA
GTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCA
ATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAGGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGAC
ATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGT
CAGAGATCTGGACTTTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSGKQAFVRFKRRCLCKDMHNSVFLIVARTRPPDRPEHLGGPAQKGEHSDDQVSIGRIPSLVRGQKSSYLIGESYLTFPEFLEFDLKAARTLGRSVSSLSLSNVVAMSSS
ISSNLGSDLARRLESELEEIENFRISDDGEDSDASISSQGLEYPSRIPEHYLGSLRRGFAIPENILLRLSEEGERADNPLEGWVTLYFKMFEYGLRLPLHPFVQEFLFRT
GLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKCAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVS
IRPVPELTQASFDTLKYYKERFPRGRKFVPLNPQGRTLNLVCPRDQTEAVDAQTEAVDAPPLGEEVREEAPLKRRRKKKKAISPSEVRACRVLPASFADRVDDPAARMGG
MSDVTARFRVEPSSSGVRDQVSRISAASLDRCLKRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASD
MPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDFDYSDPEEDQVGSTQEGAPQAGS