; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022056 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022056
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:17163740..17173836
RNA-Seq ExpressionLag0022056
SyntenyLag0022056
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]3.0e-4342.74Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQ
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL   + C+D LK++GAVFML+ +A  WW 
Subjt:  APMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQ

Query:  SVAAAEDHANDRSRGKGSRIYL----------------------------QYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEA
         VA  EDH N+      S+  L                            QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AEA
Subjt:  SVAAAEDHANDRSRGKGSRIYL----------------------------QYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEA

Query:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        + GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]7.4e-4245.98Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA    + C D  K++GAVFML+ +A  WW S+AAAEDHAN                      D   
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR

Query:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
               +G+    QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]8.8e-4346.88Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR
        E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL   + C D  K++GAVFML+ +A  WW SVAAAED+AN                      D   
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR

Query:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
               +G+    QYERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS+SGVKRK      
Subjt:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
         L+         + P+R A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.0e-4344.23Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL
        P G+   DPPPPP        PP PPAA   +       + +N A       + P  +     E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL 
Subjt:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN-----DRSRG-----------------------KGSRIYLQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN      R +                        +G+    QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN-----DRSRG-----------------------KGSRIYLQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        IKRF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

XP_022158302.1 uncharacterized protein LOC111024816 [Momordica charantia]3.1e-4055.29Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----DRSRGKGSRIY-LQYERKFTAL
        E+QFI+DFK YGPP+FDG SE   A+E W+ +LEA    + C D  K++GAVFML+ +A  WW S+A AED AN    D  R + SR+   QYERKFT L
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----DRSRGKGSRIY-LQYERKFTAL

Query:  SRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKR
        S FA + + T   KIKRF KGLR+ IR  V L  PA++AEA+ GALIMDK+V+ K QP LE  S+SGVKR
Subjt:  SRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKR

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.5e-4342.74Show/hide
Query:  APMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQ
        A + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL   + C+D LK++GAVFML+ +A  WW 
Subjt:  APMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQ

Query:  SVAAAEDHANDRSRGKGSRIYL----------------------------QYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEA
         VA  EDH N+      S+  L                            QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AEA
Subjt:  SVAAAEDHANDRSRGKGSRIYL----------------------------QYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEA

Query:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        + GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  LTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

A0A6J1DL73 uncharacterized protein LOC1110221443.6e-4245.98Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA    + C D  K++GAVFML+ +A  WW S+AAAEDHAN                      D   
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR

Query:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
               +G+    QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DNV8 uncharacterized protein LOC1110229251.5e-4044.12Show/hide
Query:  PPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDD
        PP  P   +L+  EALQ + DN        ++ P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL   + C+D  K+RGAVFML+ +
Subjt:  PPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDD

Query:  ARTWWQSVAAAEDHAN----------------------DRSRG------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP
        A  WW+SVAAAEDHAN                      +  R       + S I  QYERKFT LSRF    + T + KI +FI GLR EI+G + L  P
Subjt:  ARTWWQSVAAAEDHAN----------------------DRSRG------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRP

Query:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS
         T+A A+  AL+MDK + ++PQ     GS+SGVKRK +
Subjt:  ATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS

A0A6J1DUM2 uncharacterized protein LOC1110232474.3e-4346.88Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR
        E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL   + C D  K++GAVFML+ +A  WW SVAAAED+AN                      D   
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN----------------------DRSR

Query:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
               +G+    QYERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS+SGVKRK      
Subjt:  G------KGSRIYLQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
         L+         + P+R A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DVA0 uncharacterized protein LOC1110234245.0e-4444.23Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL
        P G+   DPPPPP        PP PPAA   +       + +N A       + P  +     E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL 
Subjt:  PPGQRRVDPPPPP-------PPPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALL

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN-----DRSRG-----------------------KGSRIYLQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN      R +                        +G+    QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHAN-----DRSRG-----------------------KGSRIYLQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        IKRF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCGGCGGCCGCCCCATCGTCGGAATTTCTTTTTTGGGTTGCGGCTGTACAGACGGTGAGTATCTTCACTGTACAGTGGGTTGTTGAACACGCAATGGTACAGTG
GGTTGCGGCTACGAGCGACAACACGAGCATGGGTTCTGGATCTGGGTTATTTGGAACAAAACCAAGATCTGGACCACCGCTGTTGAGTCATTGGGTCGAATTCGACAAGT
TCTTACAGGCGGCGAACCCAGATCTAGACCACCAGCGAGTTCTTACGGGTGATGAACGTCAACGACAGTGGGTTCATGATTTTCAGAGGCGGGAGGTGGAGGCGGAGACG
ACGAAGGATTTGGCCGGATTCGAAGGGGCGAGTTCTTACGGGCGCGAACCTCGGCGACATGGGTTTAGGATTTGGCCGGAAACGGGAGCTCCTCCTCCCCTCTTTTTTAG
TCCCTCGCTAGAAAACACACAGAGGAGGGGGGAAGTGGGTTCGCCGACAGTCAGAAAACACATAGACAAGGGGAAGAAAAGAAGGAGAAAAAATAGCTGCCGACGGTCGG
CGGTGGTGGCCGGCGACGACGACCAGCGATGGTACAGACACTGGTGGCTGAAGAAGAAGAACTTAAGAAGAAGAAGAAGAGTGGGAGAAATAAATAAAATTGCGCCGCCG
CCAGCCCTCTCCGGCGTCTTTCCCCGCGCGCATCTCTCCCTCCCTCTCGCGTGTTCGTCCGCCGGCAGCAAGGCTCGTGAAGCCTTCGTCTTCCTCTCTATCTCTCTCGC
CGTTTCCGTCCCAGCTCGAGCGTTCGTGAAGCCCAGCCGTCGTGAAGGTCGCCGCCAGCCCCAAGGTAGCTTGTGGGTAAGTCTGTTTTCCTTCGTTCTAGTTGGTTTAA
GGTTCAATCTAATCTCCTATTGCTCTTTTTGGATAACCCACGCAAGAATCCGGCCGTTTCCGTCAGTGGGTGTGCTGTCCAGCAGCGTTTTCGCTCTGTTTTGGGTCGTT
ACAGCGTCACCTAACAATTCCGTGTGTTCGGTTGCTGTTCATCGAGCATTTGATCTCAAAGATCGTGCTCAGGCCTCGGGTATAAAAGGTCGGGAACTGATATATCACTA
TTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTC
CTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCT
TGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCTCTCTCCCCCCAACTACCAGACTTTGCAGGTTATGAGGACTGCATGGACCATGGTGATGCAGAGGA
GTTAGTAGTTGTTGTTGTGTGGCCTGTCTGGCTGTCAGTTCTGTGGAGTTCTGCTGTGTTGCAGTGTGGTGAAGTTGTTGATTCAAGAGTTAGTAATGTCGCTGGGTTAG
CTTCTAAAATCCTGGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTGGTTTAGGGTTATGGTCTTCCTCGTTC
TCCTCTCCATCACCAGGAGTAGGATGCTCCCATAGATTGCTAGATAGCTTTTGGTTCTTCTTGTGGTTGTTTTGTGCAATGTTGCTTGTTAGTAGGATCAGAGGAAGTTA
CTACTCAGCAGGGGTCGATCCTCTGGCTCCCCCTATGCAGGAGGCTAATCCCCTGATTCCTCCCGGTCAGCGTAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCC
CTCCTGCGGCTCCTATGCTGATCACTCCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACTGCGAAACCCTAATTGGGTACCTGAGAAC
GCGGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGC
ACTGCTTGACCTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACC
ATGCTAACGACCGATCTCGTGGGAAAGGTTCAAGGATCTATTTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCAGAGCGG
AAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTGGCCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAA
GAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCC
CAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAATCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGTCTTGCTTGTGTATGTTGTTGCA
TCGGCTAGAACTTCCATTGCCTTTGAAGTTAGTGGGCTAGTTGGTTGGTTTCACGCCCTGCTAATCATTGCCTATGTCATAGGAATCCAGAGATTTTCTTCGTTTGCATC
TATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTGGACTTGCTCATGAATGCTCTTGGTGGACCGGGATGAGCATTCTTGAGGTTCATGGAAGGTGGTTGTGTTATG
TCAAGCTAAAGTCTAAGTTGACGTTATCAGCATGTGATCCTTCAAGTGGCATCGTTAGAATGAGCCTCATGTGCAAGAACCAGTTCGGATTGGTTTTTCAGACCTTTGGT
GTTCTTAATGCACTTGGTCAAGTTGCCGCAGCCGCAGATCCCCCCCCTCTGCGTGTTTTTCGCCGCCGCCCAGCCCCCTCCGGCCGTCTTCCCGCCGCGCGTCTCTCCCT
CCTCTCGCGTGTTCGTCGCCGCAGCAGGCTCGGCAGCCCGTCTCCTCTCTCCTCTCGCGTTTCCGTCCTCGAGTTCGTGGTCCGTCGTGAGGTCGCCGCCAGCCGTGAGC
CGCGCGTAAAGCCTCGCACTTCCGTCTCGCTGTGTCGTGTTGTCTCCTTGCGTTTTCAGCCAAGTAGCTTGTGGAATCCGGCCGTTTCGTCAGTGGTGTGCTGTCCAGCA
GCGTTTCGCTCTGTTTGGGTCGTTACAGCGTCACCTAACATTCCGCGCCGTCTAAGTGTCGATTGGGTTCGAACACTGGCCTCGGGTATAAAAGGTCGGGGACTGATATA
TCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGA
TGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCG
AGGCCTTGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCCTCTCTCCCCCCAACTACCAGATTTTGCAGTTAGTAGTTGTTGTTGTGTGGCCTGTCTG
GCTGTCAGTTCTGTGGAGTTCTGCTGTGTTGCAGTGTGGTGAAGTTGTTGATTCAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCGGCGGCCGCCCCATCGTCGGAATTTCTTTTTTGGGTTGCGGCTGTACAGACGGTGAGTATCTTCACTGTACAGTGGGTTGTTGAACACGCAATGGTACAGTG
GGTTGCGGCTACGAGCGACAACACGAGCATGGGTTCTGGATCTGGGTTATTTGGAACAAAACCAAGATCTGGACCACCGCTGTTGAGTCATTGGGTCGAATTCGACAAGT
TCTTACAGGCGGCGAACCCAGATCTAGACCACCAGCGAGTTCTTACGGGTGATGAACGTCAACGACAGTGGGTTCATGATTTTCAGAGGCGGGAGGTGGAGGCGGAGACG
ACGAAGGATTTGGCCGGATTCGAAGGGGCGAGTTCTTACGGGCGCGAACCTCGGCGACATGGGTTTAGGATTTGGCCGGAAACGGGAGCTCCTCCTCCCCTCTTTTTTAG
TCCCTCGCTAGAAAACACACAGAGGAGGGGGGAAGTGGGTTCGCCGACAGTCAGAAAACACATAGACAAGGGGAAGAAAAGAAGGAGAAAAAATAGCTGCCGACGGTCGG
CGGTGGTGGCCGGCGACGACGACCAGCGATGGTACAGACACTGGTGGCTGAAGAAGAAGAACTTAAGAAGAAGAAGAAGAGTGGGAGAAATAAATAAAATTGCGCCGCCG
CCAGCCCTCTCCGGCGTCTTTCCCCGCGCGCATCTCTCCCTCCCTCTCGCGTGTTCGTCCGCCGGCAGCAAGGCTCGTGAAGCCTTCGTCTTCCTCTCTATCTCTCTCGC
CGTTTCCGTCCCAGCTCGAGCGTTCGTGAAGCCCAGCCGTCGTGAAGGTCGCCGCCAGCCCCAAGGTAGCTTGTGGGTAAGTCTGTTTTCCTTCGTTCTAGTTGGTTTAA
GGTTCAATCTAATCTCCTATTGCTCTTTTTGGATAACCCACGCAAGAATCCGGCCGTTTCCGTCAGTGGGTGTGCTGTCCAGCAGCGTTTTCGCTCTGTTTTGGGTCGTT
ACAGCGTCACCTAACAATTCCGTGTGTTCGGTTGCTGTTCATCGAGCATTTGATCTCAAAGATCGTGCTCAGGCCTCGGGTATAAAAGGTCGGGAACTGATATATCACTA
TTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTC
CTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCT
TGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCTCTCTCCCCCCAACTACCAGACTTTGCAGGTTATGAGGACTGCATGGACCATGGTGATGCAGAGGA
GTTAGTAGTTGTTGTTGTGTGGCCTGTCTGGCTGTCAGTTCTGTGGAGTTCTGCTGTGTTGCAGTGTGGTGAAGTTGTTGATTCAAGAGTTAGTAATGTCGCTGGGTTAG
CTTCTAAAATCCTGGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTGGTTTAGGGTTATGGTCTTCCTCGTTC
TCCTCTCCATCACCAGGAGTAGGATGCTCCCATAGATTGCTAGATAGCTTTTGGTTCTTCTTGTGGTTGTTTTGTGCAATGTTGCTTGTTAGTAGGATCAGAGGAAGTTA
CTACTCAGCAGGGGTCGATCCTCTGGCTCCCCCTATGCAGGAGGCTAATCCCCTGATTCCTCCCGGTCAGCGTAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCC
CTCCTGCGGCTCCTATGCTGATCACTCCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACTGCGAAACCCTAATTGGGTACCTGAGAAC
GCGGAGGAATCCCAGTTCATTAGGGACTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGC
ACTGCTTGACCTCATGAACTGTAATGATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACC
ATGCTAACGACCGATCTCGTGGGAAAGGTTCAAGGATCTATTTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCAGAGCGG
AAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTGGCCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAA
GAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGAGTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCC
CAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAATCCTTAAAGGTGGGAAGTCGGTTTCACCTCTCATTTCCAGTCTTGCTTGTGTATGTTGTTGCA
TCGGCTAGAACTTCCATTGCCTTTGAAGTTAGTGGGCTAGTTGGTTGGTTTCACGCCCTGCTAATCATTGCCTATGTCATAGGAATCCAGAGATTTTCTTCGTTTGCATC
TATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTGGACTTGCTCATGAATGCTCTTGGTGGACCGGGATGAGCATTCTTGAGGTTCATGGAAGGTGGTTGTGTTATG
TCAAGCTAAAGTCTAAGTTGACGTTATCAGCATGTGATCCTTCAAGTGGCATCGTTAGAATGAGCCTCATGTGCAAGAACCAGTTCGGATTGGTTTTTCAGACCTTTGGT
GTTCTTAATGCACTTGGTCAAGTTGCCGCAGCCGCAGATCCCCCCCCTCTGCGTGTTTTTCGCCGCCGCCCAGCCCCCTCCGGCCGTCTTCCCGCCGCGCGTCTCTCCCT
CCTCTCGCGTGTTCGTCGCCGCAGCAGGCTCGGCAGCCCGTCTCCTCTCTCCTCTCGCGTTTCCGTCCTCGAGTTCGTGGTCCGTCGTGAGGTCGCCGCCAGCCGTGAGC
CGCGCGTAAAGCCTCGCACTTCCGTCTCGCTGTGTCGTGTTGTCTCCTTGCGTTTTCAGCCAAGTAGCTTGTGGAATCCGGCCGTTTCGTCAGTGGTGTGCTGTCCAGCA
GCGTTTCGCTCTGTTTGGGTCGTTACAGCGTCACCTAACATTCCGCGCCGTCTAAGTGTCGATTGGGTTCGAACACTGGCCTCGGGTATAAAAGGTCGGGGACTGATATA
TCACTATTGGTGTCGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGA
TGAGTCCTGAGGCAAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCG
AGGCCTTGGGTAGGGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCCTCTCTCCCCCCAACTACCAGATTTTGCAGTTAGTAGTTGTTGTTGTGTGGCCTGTCTG
GCTGTCAGTTCTGTGGAGTTCTGCTGTGTTGCAGTGTGGTGAAGTTGTTGATTCAAGTTAA
Protein sequenceShow/hide protein sequence
MPPAAAPSSEFLFWVAAVQTVSIFTVQWVVEHAMVQWVAATSDNTSMGSGSGLFGTKPRSGPPLLSHWVEFDKFLQAANPDLDHQRVLTGDERQRQWVHDFQRREVEAET
TKDLAGFEGASSYGREPRRHGFRIWPETGAPPPLFFSPSLENTQRRGEVGSPTVRKHIDKGKKRRRKNSCRRSAVVAGDDDQRWYRHWWLKKKNLRRRRRVGEINKIAPP
PALSGVFPRAHLSLPLACSSAGSKAREAFVFLSISLAVSVPARAFVKPSRREGRRQPQGSLWVSLFSFVLVGLRFNLISYCSFWITHARIRPFPSVGVLSSSVFALFWVV
TASPNNSVCSVAVHRAFDLKDRAQASGIKGRELIYHYWCRCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGSVRCLVIEALGINGQGSMHSSRP
WVGAAYQYLSVLTPSLSPQLPDFAGYEDCMDHGDAEELVVVVVWPVWLSVLWSSAVLQCGEVVDSRVSNVAGLASKILGRYSWYQSRVVPVDWPRKSRLFGGLGLWSSSF
SSPSPGVGCSHRLLDSFWFFLWLFCAMLLVSRIRGSYYSAGVDPLAPPMQEANPLIPPGQRRVDPPPPPPPPAPPAAPMLITPEALQTMFDNMAQRNARPLRNPNWVPEN
AEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALLDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANDRSRGKGSRIYLQYERKFTALSRFAPDLVSTPER
KIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSMESLKVGSRFHLSFPVLLVYVVA
SARTSIAFEVSGLVGWFHALLIIAYVIGIQRFSSFASIMESMNQVSRSFGLAHECSWWTGMSILEVHGRWLCYVKLKSKLTLSACDPSSGIVRMSLMCKNQFGLVFQTFG
VLNALGQVAAAADPPPLRVFRRRPAPSGRLPAARLSLLSRVRRRSRLGSPSPLSSRVSVLEFVVRREVAASREPRVKPRTSVSLCRVVSLRFQPSSLWNPAVSSVVCCPA
AFRSVWVVTASPNIPRRLSVDWVRTLASGIKGRGLIYHYWCRCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGSVRCLVIEALGINGQGSMHSS
RPWVGAAYQYLSVLTPSLSPPNYQILQLVVVVVWPVWLSVLWSSAVLQCGEVVDSS