; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:11730961..11737143
RNA-Seq ExpressionMoc03g17710
SyntenyMoc03g17710
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48131.1 hypothetical protein EZV62_027425 [Acer yangbiense]3.0e-3034.08Show/hide
Query:  LANDGVNR----VVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMD
        +A  G NR    VVEQFR+LHPPSF+GT                     N + V +W   H   + KE +  +L QG + L+EYE+KFE+LS + P+L+D
Subjt:  LANDGVNR----VVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMD

Query:  TDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEGQSSSQPHKRVGYSSSGQQSECKTCGKNHTGTCYKATGA
        T  RK+  F R L+ D+RKHV    LMTY +VLQ A I++++ D     +K    +++  + ++   P               +TCGK H G C+   GA
Subjt:  TDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEGQSSSQPHKRVGYSSSGQQSECKTCGKNHTGTCYKATGA

Query:  CFLCGKIGHFINNCPLSQNTDDSKAQEKKSSKKGKVFAITKQEVNDEPNVVSVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILGYSAA
        CF CGK  HF  NCP  +N  D  AQ K     G V+A+T+QE                               A  +++ DL IL++ DFN I G    
Subjt:  CFLCGKIGHFINNCPLSQNTDDSKAQEKKSSKKGKVFAITKQEVNDEPNVVSVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILGYSAA

Query:  LLTTQEHLIKDFEK
         L+     +K FEK
Subjt:  LLTTQEHLIKDFEK

XP_022155935.1 uncharacterized protein LOC111022935 [Momordica charantia]3.0e-5963.59Show/hide
Query:  PPRRVNNRRTTAVEEPQDNIPIVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN------------AMIVRKWLV
        PP+R NN RT  V+EPQDNIP+VPQN + NRNA AND VNRVVEQFR LHPPSFD      +L    ++        WN             +   K+  
Subjt:  PPRRVNNRRTTAVEEPQDNIPIVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN------------AMIVRKWLV

Query:  LHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKF
        L T+RQNKE + IKLEQG+L LIEYEKKFEELSHYAPHL+DTDWRK R+FERGLRP+LRKHVA F L+T  EVLQ A ILSQDLDN KS SKDGGSKRKF
Subjt:  LHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKF

Query:  HEGQSS
        H GQSS
Subjt:  HEGQSS

XP_022158216.1 uncharacterized protein LOC111024753 [Momordica charantia]6.3e-4961.11Show/hide
Query:  IVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRK----------SRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI
        + PQNPR NRNA AND VN + +QFRKLHPPSFDG S   ++ +    +          + R    +  +   K+    T+RQNKE E IKLEQGRLP I
Subjt:  IVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRK----------SRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI

Query:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEG
        EYEKKFEELSH APHL++T+WRK RQFERGLRP+L+KHV  FRLMTY EVLQ   ILSQDLDNTKS  KDGGSKRKFH G
Subjt:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEG

XP_028093324.1 uncharacterized protein LOC114293449 [Camellia sinensis]2.1e-2834.39Show/hide
Query:  EVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDL---DNTKSFSKDGGSKRK---FHEGQ
        E  +L+QG++ ++EYE KF +L+ +APH++DT+++K ++FERGL  D+   V   +L TYVEVL  A +    L      K+   +  SKR    F +G 
Subjt:  EVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDL---DNTKSFSKDGGSKRK---FHEGQ

Query:  SS-SQPHKRVGYSSSGQQSE-----CKTCGKNHTGTCYKATGACFLCGKIGHFINNCP-----------------LSQNTDDSKAQEKKSSKKGKVFAIT
        SS S   +  G SSS  QS      C   G+ H G CY+ +GACF CGK GH I +CP                 L+  T+     E+++ ++G+VFA+ 
Subjt:  SS-SQPHKRVGYSSSGQQSE-----CKTCGKNHTGTCYKATGACFLCGKIGHFINNCP-----------------LSQNTDDSKAQEKKSSKKGKVFAIT

Query:  KQEVNDEPNVVSVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILGYSAALLTTQEHLIKDFEKLAIDVTTKSV----SSLLASLVVE--
          +V +  +VVS      ++L    V  +C +   D  L ADL  L+IT F+ ILG    LLT           + ID  +K+V      +L  + V   
Subjt:  KQEVNDEPNVVSVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILGYSAALLTTQEHLIKDFEKLAIDVTTKSV----SSLLASLVVE--

Query:  PSLISRIKQAQLSDPALKKIVDDVSKSKRVDFSVSCDGVLWFGDRI
        P++I  IK+ QL D  LK IVD+ +   R  F V  + VL F  R+
Subjt:  PSLISRIKQAQLSDPALKKIVDDVSKSKRVDFSVSCDGVLWFGDRI

XP_030942004.1 uncharacterized protein LOC115967052 [Quercus lobata]3.3e-2942.35Show/hide
Query:  KEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHE
        KE E I++ QG   + EYE+KF ELS +APH++DT+ RK R FE GLR +++  V+ F+L TY EV+  A I  +   N    SK    +RK     F +
Subjt:  KEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHE

Query:  GQSSSQPHKRVGYSSSGQQSECKTC---GKNHTGTCYKATGACFLCGKIGHFINNCP--LSQNTDDSKAQEKKSSKKGKVFAITKQEVNDEPNVVS
        G+SS    K+ G  +S +++  KTC   GK H+GTCY  +GACF CGK  HFI +CP   ++ T ++   ++K   +G+VFA+TKQ+    P+VVS
Subjt:  GQSSSQPHKRVGYSSSGQQSECKTC---GKNHTGTCYKATGACFLCGKIGHFINNCP--LSQNTDDSKAQEKKSSKKGKVFAITKQEVNDEPNVVS

TrEMBL top hitse value%identityAlignment
A0A2N9HUA0 Uncharacterized protein6.0e-4536.61Show/hide
Query:  PRLN-RNAL-----ANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN---------AMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI
        PR N RN L      +D +  +++   KL PPSF G S+  L   N  ++  RL    N         A+ + K   L   R  KE E I++EQG   + 
Subjt:  PRLN-RNAL-----ANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN---------AMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI

Query:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSS
        EYE+KF ELS +APH++D + RK R FERGLR +++  V+ F+L TY EV+  A I  +   N K  SK    +RK     F +G+SS    K+ G S+S
Subjt:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSS

Query:  GQQS--ECKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQN--TDDSKAQEKKSSKKGKVFAITKQEVNDEPNVV----------------------
         +++   C  CGKNH+GTCY  TGACF CGK GHFI +CP  QN  TD +   ++K    G+VFA+TKQ+    P+VV                      
Subjt:  GQQS--ECKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQN--TDDSKAQEKKSSKKGKVFAITKQEVNDEPNVV----------------------

Query:  ---------------------SVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILG
                             SV TPS +V+  DKV +SC + ++ R L A+L +L++ +F+ ILG
Subjt:  ---------------------SVSTPSSNVLRVDKVVESCQLSIADRNLMADLTILEITDFNAILG

A0A2N9I6M6 Reverse transcriptase8.1e-4232.46Show/hide
Query:  PPRRVNNRRTTAVEEPQDNIPIVPQN----PRLNRNALANDGVNRVVEQFRKLHPPSFDGT----------SRIQLLLKNGSRKSRRLLTL---------
        PPR+ +     + E   D +  + Q       + ++ +        +E+FRKL PPSF G+            +  L K  + +  + +TL         
Subjt:  PPRRVNNRRTTAVEEPQDNIPIVPQN----PRLNRNALANDGVNRVVEQFRKLHPPSFDGT----------SRIQLLLKNGSRKSRRLLTL---------

Query:  ---WNAM----------------IVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTY
           W +                 + RK     + R  KE E I++EQG   + EYE+KF ELS +APH++D + RK R FERGLR +++  V+ F+L TY
Subjt:  ---WNAM----------------IVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTY

Query:  VEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSSGQQSE--CKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQNTDD
         EV+  A I  +   N K  SK    +RK     F +G+SS    KR G S+S +++   C  CGKNH+GTCY  TGACF CGK GHFI +CP  QN   
Subjt:  VEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSSGQQSE--CKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQNTDD

Query:  SKAQEKKSSKK--GKVFAITKQEVNDEPNVV-------------------------------------------SVSTPSSNVLRVDKVVESCQLSIADR
         KA E +   K  G+VFA+T+Q+    P+VV                                           SV TPS +VL  DKV +SC + ++ R
Subjt:  SKAQEKKSSKK--GKVFAITKQEVNDEPNVV-------------------------------------------SVSTPSSNVLRVDKVVESCQLSIADR

Query:  NLMADLTILEITDFNAILG
         L A+L +L++ +F+ ILG
Subjt:  NLMADLTILEITDFNAILG

A0A2N9IS48 Reverse transcriptase4.0e-4132.22Show/hide
Query:  PPRRVNNRRTTAVEEPQDNIPIVPQN----PRLNRNALANDGVNRVVEQFRKLHPPSFDGT----------SRIQLLLKNGSRKSRRLLTL---------
        PPR+ +     + E   D +  + Q       + ++ +        +E+FRKL PPSF G+            +  L K  + +  + +TL         
Subjt:  PPRRVNNRRTTAVEEPQDNIPIVPQN----PRLNRNALANDGVNRVVEQFRKLHPPSFDGT----------SRIQLLLKNGSRKSRRLLTL---------

Query:  ---WNAM----------------IVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTY
           W +                 + RK     + R  KE E I++EQG   + EYE+KF ELS +APH++D + RK R FERGLR +++  V+ F+L TY
Subjt:  ---WNAM----------------IVRKWLVLHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTY

Query:  VEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSSGQQSE--CKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQNTDD
         EV+  A I  +   N K  SK    +RK     F +G+SS    KR G S+S +++   C  CGKNH+GTCY  T ACF CGK GHFI +CP  QN   
Subjt:  VEVLQTAWILSQDLDNTKSFSKDGGSKRK-----FHEGQSSSQPHKRVGYSSSGQQSE--CKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQNTDD

Query:  SKAQEKKSSKK--GKVFAITKQEVNDEPNVV-------------------------------------------SVSTPSSNVLRVDKVVESCQLSIADR
         KA E +   K  G+VFA+T+Q+    P+VV                                           SV TPS +VL  DKV +SC + ++ R
Subjt:  SKAQEKKSSKK--GKVFAITKQEVNDEPNVV-------------------------------------------SVSTPSSNVLRVDKVVESCQLSIADR

Query:  NLMADLTILEITDFNAILG
         L A+L +L++ +F+ ILG
Subjt:  NLMADLTILEITDFNAILG

A0A2N9IS48 Reverse transcriptase5.1e-1249.41Show/hide
Query:  GYSAALLTTQEHLIKDFEKLAIDVTTKSVSSLLASLVVEPSLISRIKQAQLSDPALKKIVDDVSKSKRVDFSVSCDGVLWFGDRI
        G+SAALLTTQ+H+I D E+L ++V      S LASL V+P+LI +IK +Q  DP L KI+++V    R++F++S DG L FG+R+
Subjt:  GYSAALLTTQEHLIKDFEKLAIDVTTKSVSSLLASLVVEPSLISRIKQAQLSDPALKKIVDDVSKSKRVDFSVSCDGVLWFGDRI

A0A6J1DNX0 uncharacterized protein LOC1110229351.5e-5963.59Show/hide
Query:  PPRRVNNRRTTAVEEPQDNIPIVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN------------AMIVRKWLV
        PP+R NN RT  V+EPQDNIP+VPQN + NRNA AND VNRVVEQFR LHPPSFD      +L    ++        WN             +   K+  
Subjt:  PPRRVNNRRTTAVEEPQDNIPIVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWN------------AMIVRKWLV

Query:  LHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKF
        L T+RQNKE + IKLEQG+L LIEYEKKFEELSHYAPHL+DTDWRK R+FERGLRP+LRKHVA F L+T  EVLQ A ILSQDLDN KS SKDGGSKRKF
Subjt:  LHTSRQNKEVEVIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKF

Query:  HEGQSS
        H GQSS
Subjt:  HEGQSS

A0A6J1E0B5 uncharacterized protein LOC1110247533.1e-4961.11Show/hide
Query:  IVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRK----------SRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI
        + PQNPR NRNA AND VN + +QFRKLHPPSFDG S   ++ +    +          + R    +  +   K+    T+RQNKE E IKLEQGRLP I
Subjt:  IVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRK----------SRRLLTLWNAMIVRKWLVLHTSRQNKEVEVIKLEQGRLPLI

Query:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEG
        EYEKKFEELSH APHL++T+WRK RQFERGLRP+L+KHV  FRLMTY EVLQ   ILSQDLDNTKS  KDGGSKRKFH G
Subjt:  EYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGTAGAACAACGCCGCCACGAAGGGTTAATAATCGTAGGACAACAGCTGTTGAGGAACCACAAGATAATATCCCAATTGTTCCACAAAACCCAAGACTT
AACAGGAATGCTCTAGCTAATGATGGAGTCAATCGTGTGGTGGAACAGTTCAGAAAGCTCCATCCACCCTCTTTTGATGGTACAAGCCGGATCCAATTGTTGCTG
AAGAATGGATCGAGGAAATCGAGAAGGCTTTTGACTTTATGGAATGCAATGATCGTTAGAAAGTGGCTTGTGCTACATACATCTAGACAGAACAAGGAGGTAGAG
GTTATCAAGTTAGAGCAAGGAAGACTACCCCTGATAGAGTATGAAAAGAAGTTCGAGGAGCTTTCTCACTACGCCCCTCATTTGATGGACACTGATTGGAGGAAA
GTAAGACAATTTGAAAGAGGTTTGAGACCAGATTTAAGGAAGCATGTTGCTACCTTCAGATTGATGACCTATGTCGAGGTGTTACAAACAGCTTGGATCTTGTCA
CAGGATTTGGACAACACGAAGTCATTTTCCAAGGATGGTGGTTCAAAAAGGAAGTTCCATGAAGGACAGAGTTCGAGTCAACCTCATAAAAGGGTAGGCTATAGC
AGTAGTGGACAACAGTCCGAGTGTAAGACATGTGGCAAGAACCATACTGGGACATGCTACAAAGCGACAGGAGCCTGCTTTCTTTGTGGAAAAATAGGACACTTC
ATCAATAATTGTCCATTGAGTCAAAACACAGATGATTCGAAAGCACAAGAGAAGAAATCGAGTAAGAAGGGTAAAGTCTTTGCGATCACTAAGCAAGAGGTTAAT
GATGAGCCCAATGTTGTTTCAGTATCCACTCCTTCTAGTAACGTGCTACGTGTTGATAAGGTAGTTGAATCCTGTCAATTAAGTATTGCTGATCGAAACTTGATG
GCTGATTTGACAATTCTTGAGATTACTGATTTTAACGCAATTCTTGGTTACTCAGCAGCCTTACTCACAACTCAAGAGCATTTAATTAAAGATTTTGAAAAGTTA
GCGATTGATGTGACTACAAAAAGTGTATCGTCGTTGTTGGCTAGCTTGGTGGTTGAACCATCTTTAATCAGCAGGATCAAGCAAGCTCAACTTTCAGATCCAGCT
TTGAAGAAAATTGTCGATGATGTTAGCAAGTCTAAACGAGTTGACTTTTCAGTGTCGTGTGATGGAGTTTTATGGTTTGGTGATCGAATTGGTAGTGTAGCATTT
TCACCGATGACAATGTCTACCATGACTCTCACATTGTCCACACCTAATGGCACTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTGTAGAACAACGCCGCCACGAAGGGTTAATAATCGTAGGACAACAGCTGTTGAGGAACCACAAGATAATATCCCAATTGTTCCACAAAACCCAAGACTT
AACAGGAATGCTCTAGCTAATGATGGAGTCAATCGTGTGGTGGAACAGTTCAGAAAGCTCCATCCACCCTCTTTTGATGGTACAAGCCGGATCCAATTGTTGCTG
AAGAATGGATCGAGGAAATCGAGAAGGCTTTTGACTTTATGGAATGCAATGATCGTTAGAAAGTGGCTTGTGCTACATACATCTAGACAGAACAAGGAGGTAGAG
GTTATCAAGTTAGAGCAAGGAAGACTACCCCTGATAGAGTATGAAAAGAAGTTCGAGGAGCTTTCTCACTACGCCCCTCATTTGATGGACACTGATTGGAGGAAA
GTAAGACAATTTGAAAGAGGTTTGAGACCAGATTTAAGGAAGCATGTTGCTACCTTCAGATTGATGACCTATGTCGAGGTGTTACAAACAGCTTGGATCTTGTCA
CAGGATTTGGACAACACGAAGTCATTTTCCAAGGATGGTGGTTCAAAAAGGAAGTTCCATGAAGGACAGAGTTCGAGTCAACCTCATAAAAGGGTAGGCTATAGC
AGTAGTGGACAACAGTCCGAGTGTAAGACATGTGGCAAGAACCATACTGGGACATGCTACAAAGCGACAGGAGCCTGCTTTCTTTGTGGAAAAATAGGACACTTC
ATCAATAATTGTCCATTGAGTCAAAACACAGATGATTCGAAAGCACAAGAGAAGAAATCGAGTAAGAAGGGTAAAGTCTTTGCGATCACTAAGCAAGAGGTTAAT
GATGAGCCCAATGTTGTTTCAGTATCCACTCCTTCTAGTAACGTGCTACGTGTTGATAAGGTAGTTGAATCCTGTCAATTAAGTATTGCTGATCGAAACTTGATG
GCTGATTTGACAATTCTTGAGATTACTGATTTTAACGCAATTCTTGGTTACTCAGCAGCCTTACTCACAACTCAAGAGCATTTAATTAAAGATTTTGAAAAGTTA
GCGATTGATGTGACTACAAAAAGTGTATCGTCGTTGTTGGCTAGCTTGGTGGTTGAACCATCTTTAATCAGCAGGATCAAGCAAGCTCAACTTTCAGATCCAGCT
TTGAAGAAAATTGTCGATGATGTTAGCAAGTCTAAACGAGTTGACTTTTCAGTGTCGTGTGATGGAGTTTTATGGTTTGGTGATCGAATTGGTAGTGTAGCATTT
TCACCGATGACAATGTCTACCATGACTCTCACATTGTCCACACCTAATGGCACTCCATGA
Protein sequenceShow/hide protein sequence
MFCRTTPPRRVNNRRTTAVEEPQDNIPIVPQNPRLNRNALANDGVNRVVEQFRKLHPPSFDGTSRIQLLLKNGSRKSRRLLTLWNAMIVRKWLVLHTSRQNKEVE
VIKLEQGRLPLIEYEKKFEELSHYAPHLMDTDWRKVRQFERGLRPDLRKHVATFRLMTYVEVLQTAWILSQDLDNTKSFSKDGGSKRKFHEGQSSSQPHKRVGYS
SSGQQSECKTCGKNHTGTCYKATGACFLCGKIGHFINNCPLSQNTDDSKAQEKKSSKKGKVFAITKQEVNDEPNVVSVSTPSSNVLRVDKVVESCQLSIADRNLM
ADLTILEITDFNAILGYSAALLTTQEHLIKDFEKLAIDVTTKSVSSLLASLVVEPSLISRIKQAQLSDPALKKIVDDVSKSKRVDFSVSCDGVLWFGDRIGSVAF
SPMTMSTMTLTLSTPNGTP