; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:11153533..11161760
RNA-Seq ExpressionMoc04g14630
SyntenyMoc04g14630
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]9.7e-3955.49Show/hide
Query:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRPI------------------------
        MNRN +DPPP Q+P VN D  GE AAN+AGE+PN ILLA+N+DVAM NYVT AF+NLNSGINN  PQAAQ +L+P+                        
Subjt:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRPI------------------------

Query:  ---------------EQFYR---GLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN
                       E   R   GLDRSSRM+LNT  NGSLL+KSVNEIVDILNKM DINDQGE GRSL KK+   GIF+L+
Subjt:  ---------------EQFYR---GLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]2.3e-4034.94Show/hide
Query:  IEQFYRGLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLNVNVPSGISIYIQTPTILDGGTIQTSLGVTKEELVA
        +E FY GL+ +++ +++ + NG++L K+ NE  +IL ++   N Q    RS P +K   G+  L V+  S I+  + + T      I  +L + ++ ++ 
Subjt:  IEQFYRGLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLNVNVPSGISIYIQTPTILDGGTIQTSLGVTKEELVA

Query:  VRKEQCKAIITRSKLS--YDGPTLPDEGTEIAIPV-PASICTLQPEEEAEPVTSKEKSKKADKSKQVVPCT---TPQVGNPLPVKCKDPGSSTIPYSIGG
                I   +  S  Y G    +E T    P  PASI  +  +   + +T++ K ++     +VVP     +  + N +P+K KDPGS TIP SIGG
Subjt:  VRKEQCKAIITRSKLS--YDGPTLPDEGTEIAIPV-PASICTLQPEEEAEPVTSKEKSKKADKSKQVVPCT---TPQVGNPLPVKCKDPGSSTIPYSIGG

Query:  KNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMPIIL------------------
        K LGRAL DL +SINL  LS +K+L I EARPTTVTLQLADRS    +GKIED+L+ VDKFIFP DFIILD EAD  +PIIL                  
Subjt:  KNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMPIIL------------------

Query:  -----GSRGVLYNRGNHGGTP--KDDCRRL--------RSKFGGCRQNRRNCASRN--------FDKKKENFEFLQPTAADLKALQPSIIEPPELEKKTL
             G + V +N  +    P   ++C  +          ++      +   +S N          +    FE L+        ++PSI E P+L+ K L
Subjt:  -----GSRGVLYNRGNHGGTP--KDDCRRL--------RSKFGGCRQNRRNCASRN--------FDKKKENFEFLQPTAADLKALQPSIIEPPELEKKTL

Query:  PSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI
        P +LKYAYLG   TL +IIS+ L++  E +LL+T+
Subjt:  PSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI

XP_030485610.1 uncharacterized protein LOC115702304 [Cannabis sativa]2.2e-3841.16Show/hide
Query:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV
        +T K + ++ +       C+   + + +P K KDPGS TIP SIGG+N+GRAL DL ASINL  +S F++L I EARPTTVTLQLADRS+   +GKIEDV
Subjt:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV

Query:  LVMVDKFIFPVDFIILDCEADIQMPIILG------SRGVLYNRGNHGGTPKDDCRRLRSKFGGCR--QNRRNCA---------SRNFDKK----------
        LV VDKFIFP DFIILD EAD ++PIILG       R ++  +        +D +   + F   R       C+         + +F KK          
Subjt:  LVMVDKFIFPVDFIILDCEADIQMPIILG------SRGVLYNRGNHGGTPKDDCRRLRSKFGGCR--QNRRNCA---------SRNFDKK----------

Query:  ---------------------------KENFEFLQPTAADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI
                                   K +FE L+   ++ K  +PSI EPP+LE K LPSHLKYAYLG N+ L +IIS+ L  E E LLL+ +
Subjt:  ---------------------------KENFEFLQPTAADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]1.5e-3944.94Show/hide
Query:  LPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMPII
        +P K KDPGS TIP SIGG+++GRAL DL ASINL  +S FK+L I EARPTTVTLQLADRS+   +GKIEDVLV VDKFIFP DFIILD EAD  +PII
Subjt:  LPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMPII

Query:  LG-----------------------SRGVLYNRGNHGGTPK--DDCRRLR--SKFGGCRQNRRNCASRNF---------------------------DKK
        LG                        + V +N  N    P   ++C R+         + ++  C    F                            K 
Subjt:  LG-----------------------SRGVLYNRGNHGGTPK--DDCRRLR--SKFGGCRQNRRNCASRNF---------------------------DKK

Query:  KENFEFLQPTAADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI
        K+ FE L+   ++ K  +PS  EPP+LE K LPSHLKYAYLG NDTL VII+S L  E E  LL+ +
Subjt:  KENFEFLQPTAADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI

XP_038880330.1 uncharacterized protein LOC120071970 [Benincasa hispida]1.7e-3844.92Show/hide
Query:  NPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMP
        N +P K KDP S T+P SIGGK +G  L DL ASINL  LS FK+LNI  ARPTT+ LQLADRS+   +GKIED+LV VDKFIFPVDFIILD EADI++P
Subjt:  NPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQMP

Query:  IILG-----------------------SRGVLYNRGNHGGTPKD-----------------DCRRLRSK-FGGCRQNRRNCASRNFDKKKENFEFLQPTA
        IILG                        + V +N  N    P D                    +L  K FG       NC +    + +  FE L+ + 
Subjt:  IILG-----------------------SRGVLYNRGNHGGTPKD-----------------DCRRLRSK-FGGCRQNRRNCASRNFDKKKENFEFLQPTA

Query:  ADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI
          ++  +PS+ EPP LE K+LP HLKY YLG N+TL VIIS+ L+ E E  L+Q +
Subjt:  ADLKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI

TrEMBL top hitse value%identityAlignment
A0A2G9G6G2 Reverse transcriptase1.2e-3444.49Show/hide
Query:  VGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQ
        + N LP K K+PGS TIP +IG    GRAL DL ASINL   S ++ L + EA+PT++TLQLADRSL   KG I+D+LV VDKFIFP DF++LD E DI+
Subjt:  VGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKFIFPVDFIILDCEADIQ

Query:  MPIILGSRGVLYNR------------------------GNHGGT------PKDDCRRLRSKFGGCRQNRRNC-------ASRNFDKKKENFEFLQPTAAD
        +PIILG   +   R                         N GG       P D   R         +N ++C       AS+ F  K    E L+ TA  
Subjt:  MPIILGSRGVLYNR------------------------GNHGGT------PKDDCRRLRSKFGGCRQNRRNC-------ASRNFDKKKENFEFLQPTAAD

Query:  LKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI
         K L+PSI EPP LE K LPSHL YAYLG +DTL VIISS L++     LL+ +
Subjt:  LKALQPSIIEPPELEKKTLPSHLKYAYLGLNDTLLVIISSYLTNEHESLLLQTI

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220074.7e-3955.49Show/hide
Query:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRPI------------------------
        MNRN +DPPP Q+P VN D  GE AAN+AGE+PN ILLA+N+DVAM NYVT AF+NLNSGINN  PQAAQ +L+P+                        
Subjt:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRPI------------------------

Query:  ---------------EQFYR---GLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN
                       E   R   GLDRSSRM+LNT  NGSLL+KSVNEIVDILNKM DINDQGE GRSL KK+   GIF+L+
Subjt:  ---------------EQFYR---GLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN

A0A6J1DQL0 uncharacterized protein LOC1110234232.2e-3668.22Show/hide
Query:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV
        + SK+K     +   +  C +  VG+PLP+KCKDPGS TIP SIGGKNLGRAL DL ASINL  LS FKEL I EARPTTVTLQLADRS+KK +GKIE V
Subjt:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV

Query:  LVMVDKFIFPVDFIILDCEADIQMPIILG
         V VDKFIFP  FIILDCEAD+ +PIILG
Subjt:  LVMVDKFIFPVDFIILDCEADIQMPIILG

A0A6J1DVY2 uncharacterized protein LOC1110240021.1e-3564.84Show/hide
Query:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV
        + S+ K     ++  +  C++  + NPLPVKCKDPGS TIP S+G KNLGRAL DL A INL SLS FKELNI EARPTTVTLQLADRS+KK +GKIED+
Subjt:  VTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDV

Query:  LVMVDKFIFPVDFIILDCEADIQMPIIL
        LV VD+ IFP+DFIILDCEAD+++ +IL
Subjt:  LVMVDKFIFPVDFIILDCEADIQMPIIL

A0A6J1E251 uncharacterized protein LOC1110253022.0e-3739.85Show/hide
Query:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRP-------------------------
        MNRN +DPPP Q+P VN D  GEEAAN+ GE+PN ILLA+N+DVAM NYVTHAF+NLNSGINNP PQAAQF+L+P                         
Subjt:  MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRP-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------IEQFYRGLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN
              IEQFYRGLDRSS+M+LNT  NGSLL+KSVNEIVD+LNKMTDINDQGE+GRSLPKK+   GIF+L+
Subjt:  ------IEQFYRGLDRSSRMLLNTTTNGSLLKKSVNEIVDILNKMTDINDQGEVGRSLPKKKALVGIFKLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGAAATCCACGAGATCCTCCACCTCTACAAGATCCAGCTGTGAACAGAGATAGGACAGGTGAAGAAGCAGCAAACCAAGCAGGAGAAGTGCCTAATTGGATCCT
TTTAGCTGAAAACCAAGACGTAGCCATGTGGAACTATGTCACTCATGCATTCAATAACTTAAATTCAGGGATAAATAATCCTTCACCCCAAGCCGCACAGTTCAAGCTCA
GGCCAATAGAACAGTTCTATAGGGGATTGGATCGTTCATCAAGAATGTTGTTGAACACCACAACCAATGGCTCATTGTTAAAGAAGTCGGTTAATGAGATTGTTGATATT
TTGAACAAGATGACGGACATTAATGACCAAGGAGAAGTAGGAAGGTCATTGCCAAAGAAGAAAGCATTAGTCGGAATCTTTAAGCTAAACGTCAATGTGCCTAGCGGAAT
TTCAATCTATATTCAAACACCTACAATCTTGGATGGAGGCACTATCCAAACTTCTCTTGGAGTAACCAAAGAGGAGCTAGTAGCAGTGCGTAAAGAGCAATGTAAGGCGA
TCATCACGAGAAGCAAACTGAGTTATGATGGACCCACACTTCCAGATGAAGGAACCGAAATAGCTATACCTGTTCCTGCATCAATCTGTACTCTACAACCAGAAGAGGAA
GCAGAACCTGTAACTTCAAAGGAGAAAAGTAAGAAAGCCGATAAAAGTAAGCAAGTAGTGCCTTGCACTACTCCGCAGGTAGGGAATCCATTGCCTGTCAAGTGTAAGGA
CCCAGGTAGTTCTACTATCCCTTACTCGATAGGTGGTAAGAATTTAGGAAGAGCATTGCGTGATTTATGGGCAAGCATTAATCTTAAGTCTCTTTCAGGCTTTAAAGAGT
TAAATATACGAGAAGCTCGCCCCACTACTGTGACTTTGCAACTAGCTGATAGGTCGTTAAAGAAATCGAAAGGAAAAATAGAAGATGTGCTTGTTATGGTTGATAAGTTT
ATTTTTCCCGTCGATTTCATAATTTTGGATTGTGAAGCAGATATTCAGATGCCAATCATTCTTGGGAGTCGAGGAGTGCTCTACAATAGGGGCAACCATGGAGGAACTCC
AAAAGATGATTGCAGAAGACTTAGAAGCAAATTTGGAGGCTGCAGACAAAACAGGCGAAATTGCGCCAGCCGCAATTTTGACAAAAAAAAGGAGAATTTTGAGTTTTTGC
AGCCAACAGCAGCTGATTTGAAAGCCTTGCAGCCTTCCATCATTGAACCTCCAGAATTGGAGAAGAAAACCCTACCCTCTCATTTAAAATATGCATATTTGGGTTTAAAC
GATACTTTGCTAGTTATCATTTCTTCGTATTTGACTAATGAACATGAATCTTTGCTTTTGCAGACAATTGTTGACATTTTCGAGCCCATGTACATGGTTCTTCAAATTAT
GGATAGCAAAGTTGTTCCAACCATGTCGATTATCTATAGTTTGATTGAAAATCAGATAAAAGTACATGCAAAGCAACATAATCGTCTAGCATACGAGAAGTTCCACAAAC
TTGTTTACTGCTACTACAATATAAAATTAAAAATTAGAGATATAGAGGCTGAGGATGAAAGAGTAGCAGAAATAGATTATCTTGACTTGCTAGATGTTAGAGCAGACCCC
AACGATGATGGTGATGACCCAATTTTTCAATGGGTTCGACCATTACATCTTGATGATGAATTTGGCAATACAGACCCTAAGATTATAAACACGGCTACGAATGCTGGTAT
CAATGTAGAGGTAGGTGCACATAATGAAACTGACACTTATACTGAGCTTCTGATGAAGGATTGTGTAGAAGCTAGGTATGATACACAAGAACTACATCAGATTCGAAAAG
TACTTCATCGAGGAAAACGTAAGAAAATTGCAAAGAATAATGATGATGAGGACGACAACACTGATGATAGAAATGATGCTGATGACACATCTGGTGGAGGAGGTGGGAAT
GATGCTTGCAACCAGGCAGAGGCGGGACAGTCAAGTAATTTAGGAATAAGTCTTTTTTCATGTGAGGTTGGACTTGATCATGTCACCAAAGATGAGAACCATGGATCTTG
GGTGGGTGGTGAGGGGATTGCTATTATTGGTAAAACATTTTATACACATCAAGATGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGAAATCCACGAGATCCTCCACCTCTACAAGATCCAGCTGTGAACAGAGATAGGACAGGTGAAGAAGCAGCAAACCAAGCAGGAGAAGTGCCTAATTGGATCCT
TTTAGCTGAAAACCAAGACGTAGCCATGTGGAACTATGTCACTCATGCATTCAATAACTTAAATTCAGGGATAAATAATCCTTCACCCCAAGCCGCACAGTTCAAGCTCA
GGCCAATAGAACAGTTCTATAGGGGATTGGATCGTTCATCAAGAATGTTGTTGAACACCACAACCAATGGCTCATTGTTAAAGAAGTCGGTTAATGAGATTGTTGATATT
TTGAACAAGATGACGGACATTAATGACCAAGGAGAAGTAGGAAGGTCATTGCCAAAGAAGAAAGCATTAGTCGGAATCTTTAAGCTAAACGTCAATGTGCCTAGCGGAAT
TTCAATCTATATTCAAACACCTACAATCTTGGATGGAGGCACTATCCAAACTTCTCTTGGAGTAACCAAAGAGGAGCTAGTAGCAGTGCGTAAAGAGCAATGTAAGGCGA
TCATCACGAGAAGCAAACTGAGTTATGATGGACCCACACTTCCAGATGAAGGAACCGAAATAGCTATACCTGTTCCTGCATCAATCTGTACTCTACAACCAGAAGAGGAA
GCAGAACCTGTAACTTCAAAGGAGAAAAGTAAGAAAGCCGATAAAAGTAAGCAAGTAGTGCCTTGCACTACTCCGCAGGTAGGGAATCCATTGCCTGTCAAGTGTAAGGA
CCCAGGTAGTTCTACTATCCCTTACTCGATAGGTGGTAAGAATTTAGGAAGAGCATTGCGTGATTTATGGGCAAGCATTAATCTTAAGTCTCTTTCAGGCTTTAAAGAGT
TAAATATACGAGAAGCTCGCCCCACTACTGTGACTTTGCAACTAGCTGATAGGTCGTTAAAGAAATCGAAAGGAAAAATAGAAGATGTGCTTGTTATGGTTGATAAGTTT
ATTTTTCCCGTCGATTTCATAATTTTGGATTGTGAAGCAGATATTCAGATGCCAATCATTCTTGGGAGTCGAGGAGTGCTCTACAATAGGGGCAACCATGGAGGAACTCC
AAAAGATGATTGCAGAAGACTTAGAAGCAAATTTGGAGGCTGCAGACAAAACAGGCGAAATTGCGCCAGCCGCAATTTTGACAAAAAAAAGGAGAATTTTGAGTTTTTGC
AGCCAACAGCAGCTGATTTGAAAGCCTTGCAGCCTTCCATCATTGAACCTCCAGAATTGGAGAAGAAAACCCTACCCTCTCATTTAAAATATGCATATTTGGGTTTAAAC
GATACTTTGCTAGTTATCATTTCTTCGTATTTGACTAATGAACATGAATCTTTGCTTTTGCAGACAATTGTTGACATTTTCGAGCCCATGTACATGGTTCTTCAAATTAT
GGATAGCAAAGTTGTTCCAACCATGTCGATTATCTATAGTTTGATTGAAAATCAGATAAAAGTACATGCAAAGCAACATAATCGTCTAGCATACGAGAAGTTCCACAAAC
TTGTTTACTGCTACTACAATATAAAATTAAAAATTAGAGATATAGAGGCTGAGGATGAAAGAGTAGCAGAAATAGATTATCTTGACTTGCTAGATGTTAGAGCAGACCCC
AACGATGATGGTGATGACCCAATTTTTCAATGGGTTCGACCATTACATCTTGATGATGAATTTGGCAATACAGACCCTAAGATTATAAACACGGCTACGAATGCTGGTAT
CAATGTAGAGGTAGGTGCACATAATGAAACTGACACTTATACTGAGCTTCTGATGAAGGATTGTGTAGAAGCTAGGTATGATACACAAGAACTACATCAGATTCGAAAAG
TACTTCATCGAGGAAAACGTAAGAAAATTGCAAAGAATAATGATGATGAGGACGACAACACTGATGATAGAAATGATGCTGATGACACATCTGGTGGAGGAGGTGGGAAT
GATGCTTGCAACCAGGCAGAGGCGGGACAGTCAAGTAATTTAGGAATAAGTCTTTTTTCATGTGAGGTTGGACTTGATCATGTCACCAAAGATGAGAACCATGGATCTTG
GGTGGGTGGTGAGGGGATTGCTATTATTGGTAAAACATTTTATACACATCAAGATGCATGA
Protein sequenceShow/hide protein sequence
MNRNPRDPPPLQDPAVNRDRTGEEAANQAGEVPNWILLAENQDVAMWNYVTHAFNNLNSGINNPSPQAAQFKLRPIEQFYRGLDRSSRMLLNTTTNGSLLKKSVNEIVDI
LNKMTDINDQGEVGRSLPKKKALVGIFKLNVNVPSGISIYIQTPTILDGGTIQTSLGVTKEELVAVRKEQCKAIITRSKLSYDGPTLPDEGTEIAIPVPASICTLQPEEE
AEPVTSKEKSKKADKSKQVVPCTTPQVGNPLPVKCKDPGSSTIPYSIGGKNLGRALRDLWASINLKSLSGFKELNIREARPTTVTLQLADRSLKKSKGKIEDVLVMVDKF
IFPVDFIILDCEADIQMPIILGSRGVLYNRGNHGGTPKDDCRRLRSKFGGCRQNRRNCASRNFDKKKENFEFLQPTAADLKALQPSIIEPPELEKKTLPSHLKYAYLGLN
DTLLVIISSYLTNEHESLLLQTIVDIFEPMYMVLQIMDSKVVPTMSIIYSLIENQIKVHAKQHNRLAYEKFHKLVYCYYNIKLKIRDIEAEDERVAEIDYLDLLDVRADP
NDDGDDPIFQWVRPLHLDDEFGNTDPKIINTATNAGINVEVGAHNETDTYTELLMKDCVEARYDTQELHQIRKVLHRGKRKKIAKNNDDEDDNTDDRNDADDTSGGGGGN
DACNQAEAGQSSNLGISLFSCEVGLDHVTKDENHGSWVGGEGIAIIGKTFYTHQDA