; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g29760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g29760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr3:21239622..21242830
RNA-Seq ExpressionMoc03g29760
SyntenyMoc03g29760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.5e-10983.33Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWF+ASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGEEVREEAPL
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TP V GPASEDPAPVIELESS GPSREKRPRD TE VD      PLGEEVREE PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGEEVREEAPL

Query:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
        KRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.4e-14294.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQT---DAPPLGE
        GVKRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRD TE VD QT   D PPLGE
Subjt:  GVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQT---DAPPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.0e-10895.3Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]8.0e-19196.03Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLKYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGL+YPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEG VTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLKYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWF+ASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVD
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRD TE VD
Subjt:  GRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.4e-17371.19Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWFFASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTV--------AGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGE
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TPTV        +GP+S  P PVIEL+ SGG S EKR R+ +E +D      PL  
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTV--------AGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGE

Query:  EVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYA
        EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQRTID  
Subjt:  EVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYA

Query:  AEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEK
        AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEK
Subjt:  AEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEK

Query:  DDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK
        DD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK
Subjt:  DDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.2e-10983.33Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWF+ASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGEEVREEAPL
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TP V GPASEDPAPVIELESS GPSREKRPRD TE VD      PLGEEVREE PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGEEVREEAPL

Query:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
        KRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138266.7e-14394.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWF+ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQT---DAPPLGE
        GVKRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRD TE VD QT   D PPLGE
Subjt:  GVKRKSKGRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQT---DAPPLGE

A0A6J1D971 uncharacterized protein LOC1110185382.4e-10895.3Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK

A0A6J1DXS5 uncharacterized protein LOC1110255023.9e-19196.03Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLKYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGL+YPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEG VTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLKYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWF+ASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVD
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRD TE VD
Subjt:  GRAHALEAAQSSKPATPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.1e-17371.19Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWFFASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTV--------AGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGE
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TPTV        +GP+S  P PVIEL+ SGG S EKR R+ +E +D      PL  
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPTV--------AGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGE

Query:  EVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYA
        EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQRTID  
Subjt:  EVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYA

Query:  AEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEK
        AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEK
Subjt:  AEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEK

Query:  DDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK
        DD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK
Subjt:  DDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G38190.1 INVOLVED IN: biological_process unknown7.6e-0623.68Show/hide
Query:  RLESELEEIENFRISDDGEDSDASTSGQGLKY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMF-EYGLRLPLHPFVQ
        R +++ +   N    D+ E +D + SG+  K       P+      +G       +P  + +RIP + +R  + PEG + L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRISDDGEDSDASTSGQGLKY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFA
         F     +A +Q+       I   A L  L AR       L V+ +       ++  K G+ Y+ + +G   +   P+  + W+  +F+A
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGAT
AGTGACGCCTCCACTTCAGGTCAGGGTTTGAAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTT
CTCAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGTGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCT
TTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAGGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCT
CGGGATAGTGAGGAGACCGAGCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGG
AAAGGCGCAGGCGGTATAGTTAAGGGGCCGACTTCCATCAAGGGATGGGTGAGGAAGTGGTTCTTCGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGC
TTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGG
CCGAACTCTGAACTTGCCATGGTTTGCGGTTTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCC
ACCCCTACCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCACACCGAG
GTGGTGGACACCCAGACCGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAG
GTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCGCGGTTCAGAATTGAG
CCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCCGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTT
CTGCAGAGGACCATCGACTATGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGG
GAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAG
TCTCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTG
AAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTC
CTGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATG
CCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCTTGGCCCCCAAGCGTTGGTGGATCAGTATG
TCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGAACCAGGTCGGCTCCACTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAGGCGACCACCCTTCACGAGG
CCTTTTGCTGTTCTCCCTCCCTTTTCTTTTTTGTTTGTAAGTGTCAGGGCAGAGCTGCAAGGTTTGAATTTTAAGTTCGTCAGTGGTTTTGGCATCGCACCTCGT
ACCCTTAGGCTTCGAAGGATTGAATTTGAACCAACTGCTGGAATGCGATCGCGTGGGAGTCCGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAG
GACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATGCCCTTGATCTCAAACGGGCCCTCCCAGGCCGGATCAAGGGCACCCACATGCGTTTG
GACACTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGAT
AGTGACGCCTCCACTTCAGGTCAGGGTTTGAAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTT
CTCAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGTGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCT
TTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAGGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCT
CGGGATAGTGAGGAGACCGAGCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGG
AAAGGCGCAGGCGGTATAGTTAAGGGGCCGACTTCCATCAAGGGATGGGTGAGGAAGTGGTTCTTCGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGT
TCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGC
TTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGG
CCGAACTCTGAACTTGCCATGGTTTGCGGTTTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCC
ACCCCTACCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCACACCGAG
GTGGTGGACACCCAGACCGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAG
GTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCGCGGTTCAGAATTGAG
CCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCCGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTT
CTGCAGAGGACCATCGACTATGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGG
GAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAG
TCTCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTG
AAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTC
CTGCTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATG
CCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAAAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCTTGGCCCCCAAGCGTTGGTGGATCAGTATG
TCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGAACCAGGTCGGCTCCACTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAGGCGACCACCCTTCACGAGG
CCTTTTGCTGTTCTCCCTCCCTTTTCTTTTTTGTTTGTAAGTGTCAGGGCAGAGCTGCAAGGTTTGAATTTTAAGTTCGTCAGTGGTTTTGGCATCGCACCTCGT
ACCCTTAGGCTTCGAAGGATTGAATTTGAACCAACTGCTGGAATGCGATCGCGTGGGAGTCCGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAG
GACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATGCCCTTGATCTCAAACGGGCCCTCCCAGGCCGGATCAAGGGCACCCACATGCGTTTG
GACACTCCTTAA
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLKYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGCVTLYFKMFEYGLRLPLHP
FVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEETELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFFASGEWLAKDESGR
SFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPA
TPTVAGPASEDPAPVIELESSGGPSREKRPRDHTEVVDTQTDAPPLGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIE
PSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVE
SQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDM
PDLQIDLSGLKKGMPRSGRLGLAAPLAPKRWWISMSGIWTLTTPIPKRTRSAPLRRALPQQALRRPPFTRPFAVLPPFSFLFVSVRAELQGLNFKFVSGFGIAPR
TLRLRRIEFEPTAGMRSRGSPIITLQVLRVPRVREDVSFQIGQYVRPRSDYALDLKRALPGRIKGTHMRLDTP