; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g09370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g09370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:6646395..6648308
RNA-Seq ExpressionMoc02g09370
SyntenyMoc02g09370
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.6e-9994.58Show/hide
Query:  VSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGP
        V+IRPVPELTQASF+TLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALE AQSSKP TPAVVGP
Subjt:  VSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGP

Query:  ASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARGVLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGV
        ASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEV LKRRRKKKKTTSPLEVGARGVLP SFADRVDDPEARMGGT DVT RFR+EPSSSGV
Subjt:  ASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARGVLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGV

Query:  RDQ
        RDQ
Subjt:  RDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.3e-7765.2Show/hide
Query:  MFEYGLRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL-------------------------------------------------------------
        MFEYGLRLPLHPFVQE  FR GLA AQVAP G GV F+L                                                             
Subjt:  MFEYGLRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL-------------------------------------------------------------

Query:  -WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
         W    G  L  DESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  -WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALE AQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-12673.24Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYSSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEY SRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYSSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL--------------------------------------------------------------WPSF
        LRLPLHPFVQE  FR GLA AQVAP G GV F+L                                                              W   
Subjt:  LRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL--------------------------------------------------------------WPSF

Query:  FGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
         G  L  DESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  FGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALE AQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]1.3e-9386.58Show/hide
Query:  MVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG
        MVCGFAS+VKRKSKGRAHA E AQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEV LKRRRKKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG

Query:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR
        VLP SFADRVDDPEARMGGTSDVTARFR++PSS+GVRDQVSRISAASL+RCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVK ELDGRE LAAR
Subjt:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR

Query:  EKEEFSTALEAASSTMKDELLKAHSEVEILK
        EKEEFS ALEA       EL  A +E+E  K
Subjt:  EKEEFSTALEAASSTMKDELLKAHSEVEILK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.3e-12770.21Show/hide
Query:  WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASN
        W    G  L  DESGR+FFDVPTRFGNLVSI+ +PEL QA+F+TLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVCGF  +
Subjt:  WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASN

Query:  VKRKSKGRAHALEVAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG
        VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E  L+RRRKKKKT+S  E GARG
Subjt:  VKRKSKGRAHALEVAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG

Query:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR
         LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VK ELDGREALAA+
Subjt:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR

Query:  EKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDM
        E+E    ALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+
Subjt:  EKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDM

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.3e-9994.58Show/hide
Query:  VSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGP
        V+IRPVPELTQASF+TLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALE AQSSKP TPAVVGP
Subjt:  VSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGP

Query:  ASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARGVLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGV
        ASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEV LKRRRKKKKTTSPLEVGARGVLP SFADRVDDPEARMGGT DVT RFR+EPSSSGV
Subjt:  ASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARGVLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGV

Query:  RDQ
        RDQ
Subjt:  RDQ

A0A6J1CR42 uncharacterized protein LOC1110138266.1e-7865.2Show/hide
Query:  MFEYGLRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL-------------------------------------------------------------
        MFEYGLRLPLHPFVQE  FR GLA AQVAP G GV F+L                                                             
Subjt:  MFEYGLRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL-------------------------------------------------------------

Query:  -WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
         W    G  L  DESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  -WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALE AQSSKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1DXS5 uncharacterized protein LOC1110255027.1e-12773.24Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYSSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEY SRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYSSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL--------------------------------------------------------------WPSF
        LRLPLHPFVQE  FR GLA AQVAP G GV F+L                                                              W   
Subjt:  LRLPLHPFVQE--FRIGLASAQVAPMG-GVSFSL--------------------------------------------------------------WPSF

Query:  FGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
         G  L  DESGRSFFDVPTRFGNLVSIRPVPELTQASF+TLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  FGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALE AQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256066.1e-9486.58Show/hide
Query:  MVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG
        MVCGFAS+VKRKSKGRAHA E AQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEV LKRRRKKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG

Query:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR
        VLP SFADRVDDPEARMGGTSDVTARFR++PSS+GVRDQVSRISAASL+RCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVK ELDGRE LAAR
Subjt:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR

Query:  EKEEFSTALEAASSTMKDELLKAHSEVEILK
        EKEEFS ALEA       EL  A +E+E  K
Subjt:  EKEEFSTALEAASSTMKDELLKAHSEVEILK

A0A6J1DZB3 uncharacterized protein LOC1110256651.1e-12770.21Show/hide
Query:  WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASN
        W    G  L  DESGR+FFDVPTRFGNLVSI+ +PEL QA+F+TLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVCGF  +
Subjt:  WPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASN

Query:  VKRKSKGRAHALEVAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG
        VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E  L+RRRKKKKT+S  E GARG
Subjt:  VKRKSKGRAHALEVAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARG

Query:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR
         LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VK ELDGREALAA+
Subjt:  VLPTSFADRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAR

Query:  EKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDM
        E+E    ALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+
Subjt:  EKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACTCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCACCCTTTCGTCCAA
GAGTTCCGAATTGGGCTGGCTTCGGCTCAAGTGGCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGACGAGTCAGGTCG
TTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACTTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGAAACGTTGAAATATTATAAGGAGCATTTTC
CGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCC
GAATTAGCCATGGTTTGTGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGTCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGT
AGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAAAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCT
TGGGCGAGGAGGTGAGGGAGGAAGTCCATCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTACGAGCTTCGCA
GATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAATCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTC
GGCTGCAAGTTTGAACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCA
TTCAATCGGCTCTGGCCGTGAAGGGCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAG
GATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGC
TGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACTCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCACCCTTTCGTCCAA
GAGTTCCGAATTGGGCTGGCTTCGGCTCAAGTGGCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGACGAGTCAGGTCG
TTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACTTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGAAACGTTGAAATATTATAAGGAGCATTTTC
CGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCC
GAATTAGCCATGGTTTGTGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGTCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGT
AGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAAAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCT
TGGGCGAGGAGGTGAGGGAGGAAGTCCATCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTACGAGCTTCGCA
GATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAATCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTC
GGCTGCAAGTTTGAACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCA
TTCAATCGGCTCTGGCCGTGAAGGGCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAG
GATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGC
TGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYSSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
EFRIGLASAQVAPMGGVSFSLWPSFFGYELGIDESGRSFFDVPTRFGNLVSIRPVPELTQASFETLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNS
ELAMVCGFASNVKRKSKGRAHALEVAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVHLKRRRKKKKTTSPLEVGARGVLPTSFA
DRVDDPEARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKGELDGREALAAREKEEFSTALEAASSTMK
DELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML